TY - GEN
T1 - Soft cardinality
T2 - 1st Joint Conference on Lexical and Computational Semantics, *SEM 2012
AU - Jimenez, Sergio
AU - Becerra, Claudia
AU - Gelbukh, Alexander
N1 - Publisher Copyright:
© 2012 Association for Computational Linguistics.
PY - 2012
Y1 - 2012
N2 - We present an approach for the construction of text similarity functions using a parameterized resemblance coefficient in combination with a softened cardinality function called soft cardinality. Our approach provides a consistent and recursive model, varying levels of granularity from sentences to characters. Therefore, our model was used to compare sentences divided into words, and in turn, words divided into q-grams of characters. Experimentally, we observed that a performance correlation function in a space defined by all parameters was relatively smooth and had a single maximum achievable by "hill climbing." Our approach used only surface text information, a stop-word remover, and a stemmer to tackle the semantic text similarity task 6 at SEMEVAL 2012. The proposed method ranked 3rd (average), 5th (normalized correlation), and 15th (aggregated correlation) among 89 systems submitted by 31 teams.
AB - We present an approach for the construction of text similarity functions using a parameterized resemblance coefficient in combination with a softened cardinality function called soft cardinality. Our approach provides a consistent and recursive model, varying levels of granularity from sentences to characters. Therefore, our model was used to compare sentences divided into words, and in turn, words divided into q-grams of characters. Experimentally, we observed that a performance correlation function in a space defined by all parameters was relatively smooth and had a single maximum achievable by "hill climbing." Our approach used only surface text information, a stop-word remover, and a stemmer to tackle the semantic text similarity task 6 at SEMEVAL 2012. The proposed method ranked 3rd (average), 5th (normalized correlation), and 15th (aggregated correlation) among 89 systems submitted by 31 teams.
UR - http://www.scopus.com/inward/record.url?scp=84870725740&partnerID=8YFLogxK
M3 - Contribución a la conferencia
AN - SCOPUS:84870725740
T3 - *SEM 2012 - 1st Joint Conference on Lexical and Computational Semantics
SP - 449
EP - 453
BT - Proceedings of the 6th International Workshop on Semantic Evaluation, SemEval 2012
PB - Association for Computational Linguistics (ACL)
Y2 - 7 June 2012 through 8 June 2012
ER -