A simple spanish part of speech tagger for detection and correction of accentuation error

S. N. Galicia-Haro, I. A. Bolshakov, A. F. Gelbukh

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

4 Citas (Scopus)

Resumen

One of the most frequent kind of typographic errors specific to Spanish is connected with accentuation, namely, with omission of an obligatory stress mark or insertion of a superfluous one. If such an error transforms one word to another existing one, the latter cannot be detected by usual spell-checkers, since some context analysis is necessary. A simple procedure is proposed for this task. It relies on (1) some simple heuristics that determine linear context and (2) on a small list of pairs of words that differ only in accentuation mark. This idea is applied to numerous nouns or adjectives like número that pass to quasi-homonymous personal verb forms if they lose their stress marks.

Idioma originalInglés
Título de la publicación alojadaText, Speech and Dialogue - 2nd International Workshop, TSD 1999, Proceedings
EditoresVáclav Matousek, Pavel Mautner, Jana Oceláková, Petr Sojka
EditorialSpringer Verlag
Páginas219-222
Número de páginas4
ISBN (versión impresa)3540664947, 9783540664949
DOI
EstadoPublicada - 1999
Evento2nd International Workshop on Text, Speech and Dialogue, TSD 1999 - Plzen, República Checa
Duración: 13 sep. 199917 sep. 1999

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen1692
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia2nd International Workshop on Text, Speech and Dialogue, TSD 1999
País/TerritorioRepública Checa
CiudadPlzen
Período13/09/9917/09/99

Huella

Profundice en los temas de investigación de 'A simple spanish part of speech tagger for detection and correction of accentuation error'. En conjunto forman una huella única.

Citar esto