TY - JOUR
T1 - NLP-NITMZ@DPIL-FIRE2016
T2 - 2016 Forum for Information Retrieval Evaluation, FIRE 2016
AU - Sarkar, Sandip
AU - Saha, Saurav
AU - Bentham, Jereemi
AU - Pakray, Partha
AU - Das, Dipankar
AU - Gelbukh, Alexander
PY - 2016
Y1 - 2016
N2 - In this paper we describe the detailed information of NLP-NITMZ system on the participation of DPIL1 shared task at Forum for Information Retrieval Evaluation (FIRE 2016). The main aim of DPIL shared task is to detect paraphrases in Indian Languages. Paraphrase detection is an important part in the field of Information Retrieval, Document Summarization, Question Answering, Plagiarism Detection etc. In our approach, we used language independent feature-set to detect paraphrases in Indian languages. Features are mainly based on lexical based similarity. Our system's three features are: Jaccard Similarity, length normalized Edit Distance and Cosine Similarity. Finally, these feature-set are trained using Probabilistic Neural Network (PNN) to detect the paraphrases. With our feature-set, we achieved 88.13% average accuracy in Sub-Task 1 and 71.98% average accuracy in Sub-Task 2.
AB - In this paper we describe the detailed information of NLP-NITMZ system on the participation of DPIL1 shared task at Forum for Information Retrieval Evaluation (FIRE 2016). The main aim of DPIL shared task is to detect paraphrases in Indian Languages. Paraphrase detection is an important part in the field of Information Retrieval, Document Summarization, Question Answering, Plagiarism Detection etc. In our approach, we used language independent feature-set to detect paraphrases in Indian languages. Features are mainly based on lexical based similarity. Our system's three features are: Jaccard Similarity, length normalized Edit Distance and Cosine Similarity. Finally, these feature-set are trained using Probabilistic Neural Network (PNN) to detect the paraphrases. With our feature-set, we achieved 88.13% average accuracy in Sub-Task 1 and 71.98% average accuracy in Sub-Task 2.
KW - DPIL
KW - Jaccard similarity
KW - Plagiarism detection
KW - Probabilistic neural network (PNN)
UR - http://www.scopus.com/inward/record.url?scp=85006154542&partnerID=8YFLogxK
M3 - Artículo de la conferencia
AN - SCOPUS:85006154542
SN - 1613-0073
VL - 1737
SP - 256
EP - 259
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
Y2 - 7 December 2016 through 10 December 2016
ER -