On some optimization heuristics for lesk-like WSD algorithms

Alexander Gelbukh; Grigori Sidorov; Sang Yong Han

doi:10.1007/11428817_47

On some optimization heuristics for lesk-like WSD algorithms

Alexander Gelbukh, Grigori Sidorov, Sang Yong Han

Centro de Investigación en Computación (CIC)

Research output: Contribution to journal › Conference article › peer-review

11 Scopus citations

Abstract

For most English words, dictionaries give various senses: e.g., "bank" can stand for a financial institution, shore, set, etc. Automatic selection of the sense intended in a given text has crucial importance in many applications of text processing, such as information retrieval or machine translation: e.g., "(my account in the) bank" is to be translated into Spanish as "(mi cuenta en et) banco" whereas "(on the) bank (of the lake)" as "(en la) orilla (del logo)." To choose the optimal combination of the intended senses of all words, Lesk suggested to consider the global coherence of the text, i.e., which we mean the average relatedness between the chosen senses for all words in the text. Due to high dimensionality of the search space, heuristics are to be used to find a near-optimal configuration. In this paper, we discuss several such heuristics that differ in terms of complexity and quality of the results. In particular, we introduce a dimensionality reduction algorithm that reduces the complexity of computationally expensive approaches such as genetic algorithms.

Original language	English
Pages (from-to)	402-405
Number of pages	4
Journal	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	3513
DOIs	https://doi.org/10.1007/11428817_47
State	Published - 2005
Event	10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005: Natural Language Processing and Information Systems - Alicante, Spain Duration: 15 Jun 2005 → 17 Jun 2005

Access to Document

10.1007/11428817_47

Cite this

@article{c9b5229231c345498606388d410118a8,

title = "On some optimization heuristics for lesk-like WSD algorithms",

abstract = "For most English words, dictionaries give various senses: e.g., {"}bank{"} can stand for a financial institution, shore, set, etc. Automatic selection of the sense intended in a given text has crucial importance in many applications of text processing, such as information retrieval or machine translation: e.g., {"}(my account in the) bank{"} is to be translated into Spanish as {"}(mi cuenta en et) banco{"} whereas {"}(on the) bank (of the lake){"} as {"}(en la) orilla (del logo).{"} To choose the optimal combination of the intended senses of all words, Lesk suggested to consider the global coherence of the text, i.e., which we mean the average relatedness between the chosen senses for all words in the text. Due to high dimensionality of the search space, heuristics are to be used to find a near-optimal configuration. In this paper, we discuss several such heuristics that differ in terms of complexity and quality of the results. In particular, we introduce a dimensionality reduction algorithm that reduces the complexity of computationally expensive approaches such as genetic algorithms.",

author = "Alexander Gelbukh and Grigori Sidorov and Han, {Sang Yong}",

year = "2005",

doi = "10.1007/11428817_47",

language = "Ingl{\'e}s",

volume = "3513",

pages = "402--405",

journal = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

issn = "0302-9743",

publisher = "Springer Verlag",

note = "10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005: Natural Language Processing and Information Systems ; Conference date: 15-06-2005 Through 17-06-2005",

}

TY - JOUR

T1 - On some optimization heuristics for lesk-like WSD algorithms

AU - Gelbukh, Alexander

AU - Sidorov, Grigori

AU - Han, Sang Yong

PY - 2005

Y1 - 2005

N2 - For most English words, dictionaries give various senses: e.g., "bank" can stand for a financial institution, shore, set, etc. Automatic selection of the sense intended in a given text has crucial importance in many applications of text processing, such as information retrieval or machine translation: e.g., "(my account in the) bank" is to be translated into Spanish as "(mi cuenta en et) banco" whereas "(on the) bank (of the lake)" as "(en la) orilla (del logo)." To choose the optimal combination of the intended senses of all words, Lesk suggested to consider the global coherence of the text, i.e., which we mean the average relatedness between the chosen senses for all words in the text. Due to high dimensionality of the search space, heuristics are to be used to find a near-optimal configuration. In this paper, we discuss several such heuristics that differ in terms of complexity and quality of the results. In particular, we introduce a dimensionality reduction algorithm that reduces the complexity of computationally expensive approaches such as genetic algorithms.

AB - For most English words, dictionaries give various senses: e.g., "bank" can stand for a financial institution, shore, set, etc. Automatic selection of the sense intended in a given text has crucial importance in many applications of text processing, such as information retrieval or machine translation: e.g., "(my account in the) bank" is to be translated into Spanish as "(mi cuenta en et) banco" whereas "(on the) bank (of the lake)" as "(en la) orilla (del logo)." To choose the optimal combination of the intended senses of all words, Lesk suggested to consider the global coherence of the text, i.e., which we mean the average relatedness between the chosen senses for all words in the text. Due to high dimensionality of the search space, heuristics are to be used to find a near-optimal configuration. In this paper, we discuss several such heuristics that differ in terms of complexity and quality of the results. In particular, we introduce a dimensionality reduction algorithm that reduces the complexity of computationally expensive approaches such as genetic algorithms.

UR - http://www.scopus.com/inward/record.url?scp=25144488587&partnerID=8YFLogxK

U2 - 10.1007/11428817_47

DO - 10.1007/11428817_47

M3 - Artículo de la conferencia

AN - SCOPUS:25144488587

SN - 0302-9743

VL - 3513

SP - 402

EP - 405

JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

T2 - 10th International Conference on Applications of Natural Language to Information Systems, NLDB 2005: Natural Language Processing and Information Systems

Y2 - 15 June 2005 through 17 June 2005

ER -

On some optimization heuristics for lesk-like WSD algorithms

Abstract

Access to Document

Other files and links

Fingerprint

Cite this