TY - JOUR
T1 - Distributional thesaurus versus WordNet
T2 - 6th International Conference, CICLing 2005
AU - Calvo, Hiram
AU - Gelbukh, Alexander
AU - Kilgarriff, Adam
PY - 2005
Y1 - 2005
N2 - Prepositional Phrase (PP) attachment can be addressed by considering frequency counts of dependency triples seen in a non-annotated corpus. However, not all triples appear even in very big corpora. To solve this problem, several techniques have been used. We evaluate two different backoff methods, one based on WordNet and the other on a distributional (automatically created) thesaurus. We work on Spanish. The thesaurus is created using the dependency triples found in the same corpus used for counting the frequency of unambiguous triples. The training corpus used for both methods is an encyclopaedia. The method based on a distributional thesaurus has higher coverage but lower precision than the WordNet method.
AB - Prepositional Phrase (PP) attachment can be addressed by considering frequency counts of dependency triples seen in a non-annotated corpus. However, not all triples appear even in very big corpora. To solve this problem, several techniques have been used. We evaluate two different backoff methods, one based on WordNet and the other on a distributional (automatically created) thesaurus. We work on Spanish. The thesaurus is created using the dependency triples found in the same corpus used for counting the frequency of unambiguous triples. The training corpus used for both methods is an encyclopaedia. The method based on a distributional thesaurus has higher coverage but lower precision than the WordNet method.
UR - http://www.scopus.com/inward/record.url?scp=24344473862&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-30586-6_17
DO - 10.1007/978-3-540-30586-6_17
M3 - Artículo de la conferencia
AN - SCOPUS:24344473862
SN - 0302-9743
VL - 3406
SP - 177
EP - 188
JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Y2 - 13 February 2005 through 19 February 2005
ER -