Automatic measuring of semantic distances between word senses in a Spanish explanatory dictionary

Alexander Gelbukh, Grigori Sidorov, Liliana Chanona-Hernandez

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The problem of what is a semantic distance and how it should be measured is interesting and not very well-investigated. Usually the distance is measured between words. We propose to measure the distances between different senses of the same word. One of the purposes of this measurement is evaluation of the plausibility of application of word sense disambiguation techniques in information retrieval. Namely, if word senses are too close (too similar), then, on the one hand, the user will be unable to distinguish them for his/her informational need, and, on the other hand, WSD methods will not be reliable. Another purpose is the ability to estimate the quality of a dictionary, i.e., if there are many close (similar) senses, then the dictionary should be revised. In our experiments, we used Anaya dictionary of Spanish language. Dictionary definitions were lemmatized. For measuring the distance, we calculated the literal matching between two senses and matching using synonyms. The synonyms were taken from the Spanish dictionary of synonyms. The results show that about 90% of senses are different (the distance is rather long), still about 10% are rather similar (the distance is short). Thus, in general, the WSD techniques seem to be useful in information retrieval, but in case of the Anaya dictionary about 10% of definitions of similar senses should be revised.

Original languageEnglish
Title of host publicationProceedings of the IASTED International Conference on Computer Science and Technology
EditorsS. Sahni
Pages399-404
Number of pages6
StatePublished - 2003
EventProceedings of the IASTED International Conference on Computer Science and Technology - Cancun, Mexico
Duration: 19 May 200321 May 2003

Publication series

NameProceedings of the IASTED International Conference on Computer Science and Technology

Conference

ConferenceProceedings of the IASTED International Conference on Computer Science and Technology
Country/TerritoryMexico
CityCancun
Period19/05/0321/05/03

Keywords

  • Computational linguistics
  • Distance in explanatory dictionary
  • Synonyms
  • Word senses

Fingerprint

Dive into the research topics of 'Automatic measuring of semantic distances between word senses in a Spanish explanatory dictionary'. Together they form a unique fingerprint.

Cite this