TY - GEN
T1 - Addressing the Issue of Unavailability of Parallel Corpus Incorporating Monolingual Corpus on PBSMT System for English-Manipuri Translation
AU - Achom, Amika
AU - Pakray, Partha
AU - Gelbukh, Alexander
N1 - Publisher Copyright:
© 2023, Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - This research paper work establishes an important concept of improving Phrase based Statistical Machine Translation System incorporating monolingual corpus on the target side of the English to Manipuri translation language pair. However, there has been no work that focuses on translating one of the Indian Minority Tibeton-Burman Manipuri language pair. This Phrase based Statistical Machine Translation system has been developed using the Moses open-source toolkit and evaluated carefully using various automatic and human evaluation techniques. PBSMT achieves a BLEU Score of 10.15 as compared to the baseline PBSMT of BLEU Score 9.89 using the same training, tuning, and testing datasets. This research paper work addresses the issue of limited availability of parallel text corpora (English-Manipuri pair).
AB - This research paper work establishes an important concept of improving Phrase based Statistical Machine Translation System incorporating monolingual corpus on the target side of the English to Manipuri translation language pair. However, there has been no work that focuses on translating one of the Indian Minority Tibeton-Burman Manipuri language pair. This Phrase based Statistical Machine Translation system has been developed using the Moses open-source toolkit and evaluated carefully using various automatic and human evaluation techniques. PBSMT achieves a BLEU Score of 10.15 as compared to the baseline PBSMT of BLEU Score 9.89 using the same training, tuning, and testing datasets. This research paper work addresses the issue of limited availability of parallel text corpora (English-Manipuri pair).
KW - Adequacy
KW - Automatic-bilingual evaluation understudy score (BLEU)
KW - English to manipuri parallel corpora
KW - Human evaluation-fluency
KW - Overall rating
KW - Phrase based statistical machine translation
UR - http://www.scopus.com/inward/record.url?scp=85149667733&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-23793-5_25
DO - 10.1007/978-3-031-23793-5_25
M3 - Contribución a la conferencia
AN - SCOPUS:85149667733
SN - 9783031237928
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 299
EP - 319
BT - Computational Linguistics and Intelligent Text Processing - 19th International Conference, CICLing 2018, Revised Selected Papers
A2 - Gelbukh, Alexander
PB - Springer Science and Business Media Deutschland GmbH
T2 - 19th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2018
Y2 - 18 March 2018 through 24 March 2018
ER -