Exploratory Data Analysis for the Automatic Detection of Question Paraphrasing in Collaborative Environments

Tania Alcantara, Hiram Calvo

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Internet searches are a daily occurrence, but we must be aware that more than one person searches the same topic with different words, this is called paraphrasing. Paraphrasing involves syntactic changes and the overlapping of words, linked to the rules of the language in which we work. The identification is a problem of great importance for natural language processing (NLP), especially paraphrasing questions with the same intention. In addition, it has been found that for the study of similarities, some features are not taken into account, which makes the identification yield lower results. In this paper, we address the problem of automatic paraphrase identification in the Quora Question Pair (QQP) dataset, paying special attention to data’s shape through exploratory data analysis (EDA). This is in order to obtain better results in the identification tasks, as well as to compare different classifiers in collaborative environments where resources are limited.

Idioma originalInglés
Título de la publicación alojadaAdvances in Computational Intelligence - 21st Mexican International Conference on Artificial Intelligence, MICAI 2022, Proceedings
EditoresObdulia Pichardo Lagunas, Bella Martínez Seis, Juan Martínez-Miranda
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas193-211
Número de páginas19
ISBN (versión impresa)9783031194955
DOI
EstadoPublicada - 2022
Evento21st Mexican International Conference on Artificial Intelligence, MICAI 2022 - Monterrey, México
Duración: 24 oct. 202229 oct. 2022

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen13613 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia21st Mexican International Conference on Artificial Intelligence, MICAI 2022
País/TerritorioMéxico
CiudadMonterrey
Período24/10/2229/10/22

Huella

Profundice en los temas de investigación de 'Exploratory Data Analysis for the Automatic Detection of Question Paraphrasing in Collaborative Environments'. En conjunto forman una huella única.

Citar esto