Augmenting word space models for Word Sense Discrimination using an automatic thesaurus

Hiram Calvo

doi:10.1007/978-3-540-85287-2_10

Augmenting word space models for Word Sense Discrimination using an automatic thesaurus

Centro de Investigación en Computación (CIC)

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

Resumen

This paper presents an algorithm for Word Sense Discrimination that divides the global representation of a word into a number of classes by determining for any two occurrences whether they belong to the same sense or not. We rely on the notion that words that are used in similar contexts will have the same or a closely related meaning, thus, given a target word, we group its dependency co-occurrences in a Word Space Model. Each cluster represents a distinct meaning or sense of that word. We experiment with augmenting the bag of words of each cluster of co-occurrences, the dictionary of sense definition, and augmenting both. Then we count the number of intersections of each word of the bag of clustered senses and the bag of the dictionary of senses following the Lesk method. We find an increase in recall and a decrease in precision when augmenting. However, the best resulting F-measure is for the option of augmenting the both dictionary of senses and the bag of words from the clusters.

Idioma original	Inglés
Título de la publicación alojada	Advances in Natural Language Processing - 6th International Conference, GoTAL 2008, Proceedings
Páginas	100-107
Número de páginas	8
DOI	https://doi.org/10.1007/978-3-540-85287-2_10
Estado	Publicada - 2008
Evento	6th International Conference on Natural Language Processing, GoTAL 2008 - Gothenburg, Suecia Duración: 25 ago. 2008 → 27 ago. 2008

Serie de la publicación

Nombre	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen	5221 LNAI
ISSN (versión impresa)	0302-9743
ISSN (versión digital)	1611-3349

Conferencia

Conferencia	6th International Conference on Natural Language Processing, GoTAL 2008
País/Territorio	Suecia
Ciudad	Gothenburg
Período	25/08/08 → 27/08/08

Acceder al documento

10.1007/978-3-540-85287-2_10

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Calvo, H. (2008). Augmenting word space models for Word Sense Discrimination using an automatic thesaurus. En Advances in Natural Language Processing - 6th International Conference, GoTAL 2008, Proceedings (pp. 100-107). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5221 LNAI). https://doi.org/10.1007/978-3-540-85287-2_10

@inproceedings{94d83d3f8919412f9c6fc085e2b71c1d,

title = "Augmenting word space models for Word Sense Discrimination using an automatic thesaurus",

abstract = "This paper presents an algorithm for Word Sense Discrimination that divides the global representation of a word into a number of classes by determining for any two occurrences whether they belong to the same sense or not. We rely on the notion that words that are used in similar contexts will have the same or a closely related meaning, thus, given a target word, we group its dependency co-occurrences in a Word Space Model. Each cluster represents a distinct meaning or sense of that word. We experiment with augmenting the bag of words of each cluster of co-occurrences, the dictionary of sense definition, and augmenting both. Then we count the number of intersections of each word of the bag of clustered senses and the bag of the dictionary of senses following the Lesk method. We find an increase in recall and a decrease in precision when augmenting. However, the best resulting F-measure is for the option of augmenting the both dictionary of senses and the bag of words from the clusters.",

author = "Hiram Calvo",

note = "Funding Information: This work has been partially supported by Strategic Information and Communications R&D Promotion Programme (SCOPE) of Ministry of Internal Affairs and Communications, Japan.; 6th International Conference on Natural Language Processing, GoTAL 2008 ; Conference date: 25-08-2008 Through 27-08-2008",

year = "2008",

doi = "10.1007/978-3-540-85287-2_10",

language = "Ingl{\'e}s",

isbn = "3540852867",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "100--107",

booktitle = "Advances in Natural Language Processing - 6th International Conference, GoTAL 2008, Proceedings",

}

Calvo, H 2008, Augmenting word space models for Word Sense Discrimination using an automatic thesaurus. En Advances in Natural Language Processing - 6th International Conference, GoTAL 2008, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5221 LNAI, pp. 100-107, 6th International Conference on Natural Language Processing, GoTAL 2008, Gothenburg, Suecia, 25/08/08. https://doi.org/10.1007/978-3-540-85287-2_10

Augmenting word space models for Word Sense Discrimination using an automatic thesaurus. / Calvo, Hiram.
Advances in Natural Language Processing - 6th International Conference, GoTAL 2008, Proceedings. 2008. p. 100-107 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5221 LNAI).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - Augmenting word space models for Word Sense Discrimination using an automatic thesaurus

AU - Calvo, Hiram

N1 - Funding Information: This work has been partially supported by Strategic Information and Communications R&D Promotion Programme (SCOPE) of Ministry of Internal Affairs and Communications, Japan.

PY - 2008

Y1 - 2008

N2 - This paper presents an algorithm for Word Sense Discrimination that divides the global representation of a word into a number of classes by determining for any two occurrences whether they belong to the same sense or not. We rely on the notion that words that are used in similar contexts will have the same or a closely related meaning, thus, given a target word, we group its dependency co-occurrences in a Word Space Model. Each cluster represents a distinct meaning or sense of that word. We experiment with augmenting the bag of words of each cluster of co-occurrences, the dictionary of sense definition, and augmenting both. Then we count the number of intersections of each word of the bag of clustered senses and the bag of the dictionary of senses following the Lesk method. We find an increase in recall and a decrease in precision when augmenting. However, the best resulting F-measure is for the option of augmenting the both dictionary of senses and the bag of words from the clusters.

AB - This paper presents an algorithm for Word Sense Discrimination that divides the global representation of a word into a number of classes by determining for any two occurrences whether they belong to the same sense or not. We rely on the notion that words that are used in similar contexts will have the same or a closely related meaning, thus, given a target word, we group its dependency co-occurrences in a Word Space Model. Each cluster represents a distinct meaning or sense of that word. We experiment with augmenting the bag of words of each cluster of co-occurrences, the dictionary of sense definition, and augmenting both. Then we count the number of intersections of each word of the bag of clustered senses and the bag of the dictionary of senses following the Lesk method. We find an increase in recall and a decrease in precision when augmenting. However, the best resulting F-measure is for the option of augmenting the both dictionary of senses and the bag of words from the clusters.

UR - http://www.scopus.com/inward/record.url?scp=52149096944&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-85287-2_10

DO - 10.1007/978-3-540-85287-2_10

M3 - Contribución a la conferencia

SN - 3540852867

SN - 9783540852865

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 100

EP - 107

BT - Advances in Natural Language Processing - 6th International Conference, GoTAL 2008, Proceedings

T2 - 6th International Conference on Natural Language Processing, GoTAL 2008

Y2 - 25 August 2008 through 27 August 2008

ER -

Calvo H. Augmenting word space models for Word Sense Discrimination using an automatic thesaurus. En Advances in Natural Language Processing - 6th International Conference, GoTAL 2008, Proceedings. 2008. p. 100-107. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-540-85287-2_10

Augmenting word space models for Word Sense Discrimination using an automatic thesaurus

Resumen

Serie de la publicación

Conferencia

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto