Improving Neural Machine Translation for Low Resource Languages Using Mixed Training: The Case of Ethiopian Languages

Atnafu Lambebo Tonja, Olga Kolesnikova, Muhammad Arif, Alexander Gelbukh, Grigori Sidorov

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

4 Citas (Scopus)

Resumen

Neural Machine Translation (NMT) has shown improvement for high-resource languages, but there is still a problem with low-resource languages as NMT performs well on huge parallel data available for high-resource languages. In spite of many proposals to solve the problem of low-resource languages, it continues to be a difficult challenge. The issue becomes even more complicated when few resources cover only one domain. In our attempt to combat this issue, we propose a new approach to improve NMT for low-resource languages. The proposed approach using the transformer model shows 5.3, 5.0, and 3.7 BLEU score improvement for Gamo-English, Gofa-English, and Dawuro-English language pairs, respectively, where Gamo, Gofa, and Dawuro are related low-resource Ethiopian languages. We discuss our contributions and envisage future steps in this challenging research area.

Idioma originalInglés
Título de la publicación alojadaAdvances in Computational Intelligence - 21st Mexican International Conference on Artificial Intelligence, MICAI 2022, Proceedings
EditoresObdulia Pichardo Lagunas, Bella Martínez Seis, Juan Martínez-Miranda
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas30-40
Número de páginas11
ISBN (versión impresa)9783031194955
DOI
EstadoPublicada - 2022
Evento21st Mexican International Conference on Artificial Intelligence, MICAI 2022 - Monterrey, México
Duración: 24 oct. 202229 oct. 2022

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen13613 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia21st Mexican International Conference on Artificial Intelligence, MICAI 2022
País/TerritorioMéxico
CiudadMonterrey
Período24/10/2229/10/22

Huella

Profundice en los temas de investigación de 'Improving Neural Machine Translation for Low Resource Languages Using Mixed Training: The Case of Ethiopian Languages'. En conjunto forman una huella única.

Citar esto