Web Crawler and Classifier for News Articles

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

In this work, we present a crawler that collects news articles and a classifier that identifies the section to which these articles belong. Due to a large number of available sources of information, a tool for gathering and filtering news articles about specific interests is necessary. For instance, a person might be interested in news about sports or science, and it could be necessary to check several websites to obtain this kind of news finally. Therefore, in this work, we propose a web application that uses a crawler to collect news articles from different websites automatically, then a classifier determines the section of each news article, and finally, the news articles that match the section of interest are displayed in the web application.

Original languageEnglish
Title of host publicationAdvances in Computational Intelligence - 21st Mexican International Conference on Artificial Intelligence, MICAI 2022, Proceedings
EditorsObdulia Pichardo Lagunas, Bella Martínez Seis, Juan Martínez-Miranda
PublisherSpringer Science and Business Media Deutschland GmbH
Pages127-136
Number of pages10
ISBN (Print)9783031194955
DOIs
StatePublished - 2022
Event21st Mexican International Conference on Artificial Intelligence, MICAI 2022 - Monterrey, Mexico
Duration: 24 Oct 202229 Oct 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13613 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st Mexican International Conference on Artificial Intelligence, MICAI 2022
Country/TerritoryMexico
CityMonterrey
Period24/10/2229/10/22

Keywords

  • Machine learning
  • Text classification
  • Web crawling

Fingerprint

Dive into the research topics of 'Web Crawler and Classifier for News Articles'. Together they form a unique fingerprint.

Cite this