On the usage of morphological tags for grammar induction

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

2 Citas (Scopus)

Resumen

We present a study on the effect of adding morphological tags to the training corpus of a grammar inductor. For this purpose, we carried out several experiments using the grammar induction system called Alignment-Based Learning (ABL) and the CAST-3LB syntactically tagged Spanish corpus for training and testing. ABL produces a set of possible constituents with a word alignment process. We developed an algorithm which converts the hypotheses generated by ABL into ordered production rules. Then our algorithm groups them into possible phrase groups (constituents). These phrase groups correspond to the syntactic tagging of the unannotated text. We compared the phrase groups obtained by our algorithm with the manually tagged groups of CAST3LB. The experiments in the grammar induction process consisted on trying three different variants for the training corpus: (1) using words; (2) using only the morphological tags; and (3) adding morphological tags to words. Our experiments show that the inclusion of morphological tags in the grammar induction process improves significantly the performance of ABL.

Idioma originalInglés
Título de la publicación alojadaMICAI 2007
Subtítulo de la publicación alojadaAdvances in Artificial Intelligence - 6th Mexican International Conference on Artificial Intelligence, Proceedings
Páginas912-921
Número de páginas10
EstadoPublicada - 2007
Evento6th Mexican International Conference on Artificial Intelligence, MICAI 2007 - Aguascalientes, México
Duración: 4 nov. 200710 nov. 2007

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen4827 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia6th Mexican International Conference on Artificial Intelligence, MICAI 2007
País/TerritorioMéxico
CiudadAguascalientes
Período4/11/0710/11/07

Huella

Profundice en los temas de investigación de 'On the usage of morphological tags for grammar induction'. En conjunto forman una huella única.

Citar esto