Web Crawler and Classifier for News Articles

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

2 Citas (Scopus)

Resumen

In this work, we present a crawler that collects news articles and a classifier that identifies the section to which these articles belong. Due to a large number of available sources of information, a tool for gathering and filtering news articles about specific interests is necessary. For instance, a person might be interested in news about sports or science, and it could be necessary to check several websites to obtain this kind of news finally. Therefore, in this work, we propose a web application that uses a crawler to collect news articles from different websites automatically, then a classifier determines the section of each news article, and finally, the news articles that match the section of interest are displayed in the web application.

Idioma originalInglés
Título de la publicación alojadaAdvances in Computational Intelligence - 21st Mexican International Conference on Artificial Intelligence, MICAI 2022, Proceedings
EditoresObdulia Pichardo Lagunas, Bella Martínez Seis, Juan Martínez-Miranda
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas127-136
Número de páginas10
ISBN (versión impresa)9783031194955
DOI
EstadoPublicada - 2022
Evento21st Mexican International Conference on Artificial Intelligence, MICAI 2022 - Monterrey, México
Duración: 24 oct. 202229 oct. 2022

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen13613 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia21st Mexican International Conference on Artificial Intelligence, MICAI 2022
País/TerritorioMéxico
CiudadMonterrey
Período24/10/2229/10/22

Huella

Profundice en los temas de investigación de 'Web Crawler and Classifier for News Articles'. En conjunto forman una huella única.

Citar esto