Exploratory Data Analysis for the Automatic Detection of Question Paraphrasing in Collaborative Environments

Tania Alcantara, Hiram Calvo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Internet searches are a daily occurrence, but we must be aware that more than one person searches the same topic with different words, this is called paraphrasing. Paraphrasing involves syntactic changes and the overlapping of words, linked to the rules of the language in which we work. The identification is a problem of great importance for natural language processing (NLP), especially paraphrasing questions with the same intention. In addition, it has been found that for the study of similarities, some features are not taken into account, which makes the identification yield lower results. In this paper, we address the problem of automatic paraphrase identification in the Quora Question Pair (QQP) dataset, paying special attention to data’s shape through exploratory data analysis (EDA). This is in order to obtain better results in the identification tasks, as well as to compare different classifiers in collaborative environments where resources are limited.

Original languageEnglish
Title of host publicationAdvances in Computational Intelligence - 21st Mexican International Conference on Artificial Intelligence, MICAI 2022, Proceedings
EditorsObdulia Pichardo Lagunas, Bella Martínez Seis, Juan Martínez-Miranda
PublisherSpringer Science and Business Media Deutschland GmbH
Pages193-211
Number of pages19
ISBN (Print)9783031194955
DOIs
StatePublished - 2022
Event21st Mexican International Conference on Artificial Intelligence, MICAI 2022 - Monterrey, Mexico
Duration: 24 Oct 202229 Oct 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13613 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st Mexican International Conference on Artificial Intelligence, MICAI 2022
Country/TerritoryMexico
CityMonterrey
Period24/10/2229/10/22

Fingerprint

Dive into the research topics of 'Exploratory Data Analysis for the Automatic Detection of Question Paraphrasing in Collaborative Environments'. Together they form a unique fingerprint.

Cite this