TY - GEN
T1 - Computing text similarity using Tree Edit Distance
AU - Sidorov, Grigori
AU - Gomez-Adorno, Helena
AU - Markov, Ilia
AU - Pinto, David
AU - Loya, Nahun
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/9/29
Y1 - 2015/9/29
N2 - In this paper, we propose the application of the Tree Edit Distance (TED) for calculation of similarity between syntactic n-grams for further detection of soft similarity between texts. The computation of text similarity is the basic task for many natural language processing problems, and it is an open research field. Syntactic n-grams are text features for Vector Space Model construction extracted from dependency trees. Soft similarity is application of Vector Space Model taking into account similarity of features. First, we discuss the advantages of the application of the TED to syntactic n-grams. Then, we present a procedure based on the TED and syntactic n-grams for calculating soft similarity between texts.
AB - In this paper, we propose the application of the Tree Edit Distance (TED) for calculation of similarity between syntactic n-grams for further detection of soft similarity between texts. The computation of text similarity is the basic task for many natural language processing problems, and it is an open research field. Syntactic n-grams are text features for Vector Space Model construction extracted from dependency trees. Soft similarity is application of Vector Space Model taking into account similarity of features. First, we discuss the advantages of the application of the TED to syntactic n-grams. Then, we present a procedure based on the TED and syntactic n-grams for calculating soft similarity between texts.
KW - Computational modeling
KW - Cost function
KW - Heuristic algorithms
KW - Information retrieval
KW - Natural language processing
KW - Semantics
KW - Syntactics
UR - http://www.scopus.com/inward/record.url?scp=84961888075&partnerID=8YFLogxK
U2 - 10.1109/NAFIPS-WConSC.2015.7284129
DO - 10.1109/NAFIPS-WConSC.2015.7284129
M3 - Contribución a la conferencia
AN - SCOPUS:84961888075
T3 - Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS
BT - 2015 Annual Meeting of the North American Fuzzy Information Processing Society, NAFIPS 2015
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - Annual Meeting of the North American Fuzzy Information Processing Society, NAFIPS 2015
Y2 - 17 August 2015 through 19 August 2015
ER -