TY - GEN
T1 - Web Crawler and Classifier for News Articles
AU - García-Mendoza, Consuelo Varinia
AU - Juárez Gambino, Omar
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - In this work, we present a crawler that collects news articles and a classifier that identifies the section to which these articles belong. Due to a large number of available sources of information, a tool for gathering and filtering news articles about specific interests is necessary. For instance, a person might be interested in news about sports or science, and it could be necessary to check several websites to obtain this kind of news finally. Therefore, in this work, we propose a web application that uses a crawler to collect news articles from different websites automatically, then a classifier determines the section of each news article, and finally, the news articles that match the section of interest are displayed in the web application.
AB - In this work, we present a crawler that collects news articles and a classifier that identifies the section to which these articles belong. Due to a large number of available sources of information, a tool for gathering and filtering news articles about specific interests is necessary. For instance, a person might be interested in news about sports or science, and it could be necessary to check several websites to obtain this kind of news finally. Therefore, in this work, we propose a web application that uses a crawler to collect news articles from different websites automatically, then a classifier determines the section of each news article, and finally, the news articles that match the section of interest are displayed in the web application.
KW - Machine learning
KW - Text classification
KW - Web crawling
UR - http://www.scopus.com/inward/record.url?scp=85142816010&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-19496-2_10
DO - 10.1007/978-3-031-19496-2_10
M3 - Contribución a la conferencia
AN - SCOPUS:85142816010
SN - 9783031194955
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 127
EP - 136
BT - Advances in Computational Intelligence - 21st Mexican International Conference on Artificial Intelligence, MICAI 2022, Proceedings
A2 - Pichardo Lagunas, Obdulia
A2 - Martínez Seis, Bella
A2 - Martínez-Miranda, Juan
PB - Springer Science and Business Media Deutschland GmbH
T2 - 21st Mexican International Conference on Artificial Intelligence, MICAI 2022
Y2 - 24 October 2022 through 29 October 2022
ER -