Mining Hidden Topics from Newspaper Quotations: The COVID-19 Pandemic

Thang Hoang Ta, Abu Bakar Siddiqur Rahman, Grigori Sidorov, Alexander Gelbukh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

In this paper, we extract quotations from Al Jazeera’s news articles containing keywords related to the COVID-19 pandemic. We apply Latent Dirichlet allocation (LDA), coherence measures, and clustering algorithms to unsupervisedly explore latent topics from the dataset of about 3400 quotations to see how coronavirus impacts human beings. By combining noun phrases as inputs before the training and Cv measure for coherence values, we obtain an average coherence value of 0.66 with a least average number of topics of 24.8. The result covers some of the top issues that our world has been facing against the COVID-19 pandemic.

Original languageEnglish
Title of host publicationAdvances in Computational Intelligence - 19th Mexican International Conference on Artificial Intelligence, MICAI 2020, Proceedings
EditorsLourdes Martínez-Villaseñor, Hiram Ponce, Oscar Herrera-Alcántara, Félix A. Castro-Espinoza
PublisherSpringer Science and Business Media Deutschland GmbH
Pages51-64
Number of pages14
ISBN (Print)9783030608866
DOIs
StatePublished - 2020
Event19th Mexican International Conference on Artificial Intelligence, MICAI 2020 - Mexico City, Mexico
Duration: 12 Oct 202017 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12469 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th Mexican International Conference on Artificial Intelligence, MICAI 2020
Country/TerritoryMexico
CityMexico City
Period12/10/2017/10/20

Keywords

  • COVID-19
  • Latent Dirichlet Allocation
  • Quotation mining
  • Topic model

Fingerprint

Dive into the research topics of 'Mining Hidden Topics from Newspaper Quotations: The COVID-19 Pandemic'. Together they form a unique fingerprint.

Cite this