TY - GEN
T1 - Hybrid algorithm for word-level alignment of parallel texts
AU - Cendejas, Eduardo
AU - Barceló, Grettel
AU - Gelbukh, Alexander
AU - Sidorov, Grigori
N1 - Funding Information:
Work done under partial support of Mexican Government (SIP-IPN 20091587 and 20090772, CONACYT 50206-H and 83270, SNI, PIFI-IPN).
PY - 2009
Y1 - 2009
N2 - Given a text in two languages, word alignment task consists of identifying in the two variants of the text specific word occurrences that are mutual translations. The majority of existing text alignment systems follow either a linguistic or a statistical approach. We argue for that both approaches are insufficient when used separately, and suggest a flexible algorithm that combines statistical and linguistic techniques.
AB - Given a text in two languages, word alignment task consists of identifying in the two variants of the text specific word occurrences that are mutual translations. The majority of existing text alignment systems follow either a linguistic or a statistical approach. We argue for that both approaches are insufficient when used separately, and suggest a flexible algorithm that combines statistical and linguistic techniques.
UR - http://www.scopus.com/inward/record.url?scp=78651242565&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-12550-8_25
DO - 10.1007/978-3-642-12550-8_25
M3 - Contribución a la conferencia
SN - 3642125492
SN - 9783642125492
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 293
EP - 294
BT - Natural Language Processing and Information Systems - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009, Revised Papers
T2 - 14th International Conference on Applications of Natural Language to Information Systems, NLDB 2009
Y2 - 24 June 2009 through 26 June 2009
ER -