Improving unsupervised WSD with a dynamic thesaurus

Javier Tejada-Cárcamo, Hiram Calvo, Alexander Gelbukh

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

The method proposed by Diana McCarthy et al. [1] obtains the predominant sense for an ambiguous word based on a weighted list of terms related to the ambiguous word. This list of terms is obtained using the distributional similarity method proposed by Lin [2] to obtain a thesaurus. In that method, every occurrence of the ambiguous word uses the same thesaurus, regardless of the context where it occurs. Every different word to be disambiguated uses the same thesaurus. In this paper we explore a different method that accounts for the context of a word when determining the most frequent sense of an ambiguous word. In our method the list of distributed similar words is built based on the syntactic context of the ambiguous word. We attain a precision of 69.86%, which is 7% higher than the supervised baseline of using the MFS of 90% SemCor against the remaining 10% of SemCor.

Idioma originalInglés
Título de la publicación alojadaText, Speech and Dialogue - 11th International Conference, TSD 2008, Proceedings
Páginas201-210
Número de páginas10
DOI
EstadoPublicada - 2008
Evento11th International Conference on Text, Speech and Dialogue, TSD 2008 - Brno, República Checa
Duración: 8 sep. 200812 sep. 2008

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen5246 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia11th International Conference on Text, Speech and Dialogue, TSD 2008
País/TerritorioRepública Checa
CiudadBrno
Período8/09/0812/09/08

Huella

Profundice en los temas de investigación de 'Improving unsupervised WSD with a dynamic thesaurus'. En conjunto forman una huella única.

Citar esto