Towards Burmese (Myanmar) morphological analysis: Syllable-based tokenization and part-of-speech tagging

C Ding, HTZ Aye, WP Pa, KT Nwet, KM Soe… - ACM Transactions on …, 2019 - dl.acm.org
This article presents a comprehensive study on two primary tasks in Burmese (Myanmar)
morphological analysis: tokenization and part-of-speech (POS) tagging. Twenty thousand
Burmese sentences of newswire are annotated with two-layer tokenization and POS-tagging
information, as one component of the Asian Language Treebank Project. The annotated
corpus has been released under a CC BY-NC-SA license, and it is the largest open-access
database of annotated Burmese when this manuscript was prepared in 2017. Detailed …

[PDF][PDF] Towards Burmese (Myanmar) Morphological Analysis: Syllable-based Tokenization and Part-of-speech Tagging

HTHUZAR AYE, W PA, KT NWET, KMAR SOE - 2019 - academia.edu
The specific processing within morphological analysis is diverse and depends on different
types of languages or, more formally, on linguistic typology. Consequently, most references
pertaining to morphological analysis in NLP are language specific, because the features of
languages largely affect engineering tasks. As for most inflected Indo-European languages,
the processing includes related tasks such as stemming, lemmatization, and POS-tagging of
words, where the core part revolves around the identification of stems and a xes. Also, this is …
Showing the best results for this search. See all results