Resource building and Parts-of-Speech (POS) tagging for the Mizo language

Partha Pakray, Arunagshu Pal, Goutam Majumder, Alexander Gelbukh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

19 Scopus citations

Abstract

The main goal of this work is to build resources and a part-of-speech (POS) tagger for Mizo Language. The Mizo Language is the official language of the Mizoram state of India. The Mizo language is also known as Lushai language. The Mizo language belongs to the Kukish branch of the Sino-Tibetan language family. The paper describes the development of a Mizo-to-English dictionary and a part-of-speech tagger. In our Mizo-to-English dictionary, we have collected 26,407 entries, both automatically and manually. We started from studying the Mizo parts of speech and generated the POS tag list. For POS tagging of the Mizo Language, we built a 24-item tag set. The dictionary and the POS tag set will be used for building an automatic POS tagger for the Mizo language.

Original languageEnglish
Title of host publicationProceedings - 14th Mexican International Conference on Artificial Intelligence
Subtitle of host publicationAdvances in Artificial Intelligence, MICAI 2015
EditorsGustavo Arroyo Figueroa, Grigori Sidorov, Sofia N. Galicia Haro, Oscar Herrera Alcantara, Obdulia Pichardo Lagunas
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3-7
Number of pages5
ISBN (Electronic)9781509003235
DOIs
StatePublished - 8 Mar 2016
Event14th Mexican International Conference on Artificial Intelligence, MICAI 2015 - Cuernavaca, Morelos, Mexico
Duration: 25 Oct 201531 Oct 2015

Publication series

NameProceedings - 14th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence, MICAI 2015

Conference

Conference14th Mexican International Conference on Artificial Intelligence, MICAI 2015
Country/TerritoryMexico
CityCuernavaca, Morelos
Period25/10/1531/10/15

Keywords

  • Mizo language
  • Part-of-speech Tagging
  • lexical

Fingerprint

Dive into the research topics of 'Resource building and Parts-of-Speech (POS) tagging for the Mizo language'. Together they form a unique fingerprint.

Cite this