Hybrid algorithm for word-level alignment of parallel texts

Eduardo Cendejas; Grettel Barceló; Alexander Gelbukh; Grigori Sidorov

doi:10.1007/978-3-642-12550-8_25

Hybrid algorithm for word-level alignment of parallel texts

Eduardo Cendejas, Grettel Barceló, Alexander Gelbukh, Grigori Sidorov

Centro de Investigación en Computación (CIC)

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

Resumen

Given a text in two languages, word alignment task consists of identifying in the two variants of the text specific word occurrences that are mutual translations. The majority of existing text alignment systems follow either a linguistic or a statistical approach. We argue for that both approaches are insufficient when used separately, and suggest a flexible algorithm that combines statistical and linguistic techniques.

Idioma original	Inglés
Título de la publicación alojada	Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers
Páginas	293-294
Número de páginas	2
DOI	https://doi.org/10.1007/978-3-642-12550-8_25
Estado	Publicada - 2009
Evento	14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009 - Saarbrucken, Alemania Duración: 24 jun. 2009 → 26 jun. 2009

Serie de la publicación

Nombre	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen	5723 LNCS
ISSN (versión impresa)	0302-9743
ISSN (versión digital)	1611-3349

Conferencia

Conferencia	14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009
País/Territorio	Alemania
Ciudad	Saarbrucken
Período	24/06/09 → 26/06/09

Acceder al documento

10.1007/978-3-642-12550-8_25

Otros archivos y enlaces

Enlace a la publicación en Scopus

Citar esto

Cendejas, E., Barceló, G., Gelbukh, A., & Sidorov, G. (2009). Hybrid algorithm for word-level alignment of parallel texts. En Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers (pp. 293-294). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5723 LNCS). https://doi.org/10.1007/978-3-642-12550-8_25

Cendejas, Eduardo ; Barceló, Grettel ; Gelbukh, Alexander et al. / Hybrid algorithm for word-level alignment of parallel texts. Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers. 2009. pp. 293-294 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{89405b736a044db48bec76d3081dedf9,

title = "Hybrid algorithm for word-level alignment of parallel texts",

abstract = "Given a text in two languages, word alignment task consists of identifying in the two variants of the text specific word occurrences that are mutual translations. The majority of existing text alignment systems follow either a linguistic or a statistical approach. We argue for that both approaches are insufficient when used separately, and suggest a flexible algorithm that combines statistical and linguistic techniques.",

author = "Eduardo Cendejas and Grettel Barcel{\'o} and Alexander Gelbukh and Grigori Sidorov",

note = "Funding Information: Work done under partial support of Mexican Government (SIP-IPN 20091587 and 20090772, CONACYT 50206-H and 83270, SNI, PIFI-IPN).; 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009 ; Conference date: 24-06-2009 Through 26-06-2009",

year = "2009",

doi = "10.1007/978-3-642-12550-8_25",

language = "Ingl{\'e}s",

isbn = "3642125492",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "293--294",

booktitle = "Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers",

}

Cendejas, E, Barceló, G, Gelbukh, A & Sidorov, G 2009, Hybrid algorithm for word-level alignment of parallel texts. En Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 5723 LNCS, pp. 293-294, 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Saarbrucken, Alemania, 24/06/09. https://doi.org/10.1007/978-3-642-12550-8_25

Hybrid algorithm for word-level alignment of parallel texts. / Cendejas, Eduardo; Barceló, Grettel; Gelbukh, Alexander et al.
Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers. 2009. p. 293-294 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 5723 LNCS).

Producción científica: Capítulo del libro/informe/acta de congreso › Contribución a la conferencia › revisión exhaustiva

TY - GEN

T1 - Hybrid algorithm for word-level alignment of parallel texts

AU - Cendejas, Eduardo

AU - Barceló, Grettel

AU - Gelbukh, Alexander

AU - Sidorov, Grigori

N1 - Funding Information: Work done under partial support of Mexican Government (SIP-IPN 20091587 and 20090772, CONACYT 50206-H and 83270, SNI, PIFI-IPN).

PY - 2009

Y1 - 2009

N2 - Given a text in two languages, word alignment task consists of identifying in the two variants of the text specific word occurrences that are mutual translations. The majority of existing text alignment systems follow either a linguistic or a statistical approach. We argue for that both approaches are insufficient when used separately, and suggest a flexible algorithm that combines statistical and linguistic techniques.

AB - Given a text in two languages, word alignment task consists of identifying in the two variants of the text specific word occurrences that are mutual translations. The majority of existing text alignment systems follow either a linguistic or a statistical approach. We argue for that both approaches are insufficient when used separately, and suggest a flexible algorithm that combines statistical and linguistic techniques.

UR - http://www.scopus.com/inward/record.url?scp=78651242565&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-12550-8_25

DO - 10.1007/978-3-642-12550-8_25

M3 - Contribución a la conferencia

SN - 3642125492

SN - 9783642125492

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 293

EP - 294

BT - Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers

T2 - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009

Y2 - 24 June 2009 through 26 June 2009

ER -

Cendejas E, Barceló G, Gelbukh A , Sidorov G. Hybrid algorithm for word-level alignment of parallel texts. En Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers. 2009. p. 293-294. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-12550-8_25

Hybrid algorithm for word-level alignment of parallel texts

Resumen

Serie de la publicación

Conferencia

Acceder al documento

Otros archivos y enlaces

Huella

Citar esto