Caracterización de riesgos urbanos en prensa aplicando minería de texto para el enriquecimiento de datos abiertos

Translated title of the contribution: Characterization of urban risks in the press applying text mining for the enrichment of open data Luis M. Vilches-Blázquez and Diana Comesaña Ocampo

Luis M. Vilches-Blázquez, Diana Comesaña Ocampo

Research output: Contribution to journalArticlepeer-review

Abstract

News is freely spread and widely available to Internet users much more easily than traditional media. In the news, we can find an infinite number of hidden “minor data,” that can provide valuable information not col-lected in other sources of information. In this context, we have been interested in analyzing and characteriz-ing the urban risks contained in the Uruguayan open newspapers using text mining techniques. This pro-posal makes it possible to create a news corpus based on risk events included in open data. The corpus cov-ers 2003-2019 and is built from the digital open newspapers El Eco Digital, Montevideo Portal, and La Red 21. Various text mining techniques are applied to this corpus using the QDA-MinerLite software and the Python language (concretely, through the Scattertext library) to identify, characterize, and discover insights on these events. The corpus processing results help en-rich the existing open data on risks in Uruguay, incor-porating information on their effects, actors, and asso-ciated interventions.

Translated title of the contributionCharacterization of urban risks in the press applying text mining for the enrichment of open data Luis M. Vilches-Blázquez and Diana Comesaña Ocampo
Original languageSpanish
Article numbereib0915853805
Pages (from-to)85-107
Number of pages23
JournalInvestigacion Bibliotecologica
Volume36
Issue number91
DOIs
StatePublished - 2022

Fingerprint

Dive into the research topics of 'Characterization of urban risks in the press applying text mining for the enrichment of open data Luis M. Vilches-Blázquez and Diana Comesaña Ocampo'. Together they form a unique fingerprint.

Cite this