SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity

Sergio Jimenez, Claudia Becerra, Alexander Gelbukh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Soft cardinality has been shown to be a very strong text-overlapping baseline for the task of measuring semantic textual similarity (STS), obtaining 3rd place in SemEval-2012. At ∗SEM-2013 shared task, beside the plain textoverlapping approach, we tested within soft cardinality two distributional word-similarity functions derived from the ukWack corpus. Unfortunately, we combined these measures with other features using regression, obtaining positions 18th, 22nd and 23rd among the 90 participants systems in the official ranking. Already after the release of the gold standard annotations of the test data, we observed that using only the similarity measures without combining them with other features would have obtained positions 6th, 7th and 8th; moreover, an arithmetic average of these similarity measures would have been 4th(mean=0.5747). This paper describes both the 3 systems as they were submitted and the similarity measures that would obtained those better results.

Original languageEnglish
Title of host publicationSEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task
Subtitle of host publicationSemantic Textual SimilaritySEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity
EditorsMona Diab, Tim Baldwin, Marco Baroni
PublisherAssociation for Computational Linguistics (ACL)
Pages194-201
Number of pages8
ISBN (Electronic)9781937284480
StatePublished - 2013
Event2nd Joint Conference on Lexical and Computational Semantics, SEM 2013 - Atlanta, United States
Duration: 13 Jun 201314 Jun 2013

Publication series

NameSEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task: Semantic Textual SimilaritySEM 2013 - 2nd Joint Conference on Lexical and Computational Semantics, Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity

Conference

Conference2nd Joint Conference on Lexical and Computational Semantics, SEM 2013
Country/TerritoryUnited States
CityAtlanta
Period13/06/1314/06/13

Fingerprint

Dive into the research topics of 'SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity'. Together they form a unique fingerprint.

Cite this