Morphological Analysis Based Part-of-Speech Tagging for Uyghur Speech Synthesis

Author(s):  
Guljamal Mamateli ◽  
Askar Rozi ◽  
Gulnar Ali ◽  
Askar Hamdulla
2017 ◽  
Vol 68 (2) ◽  
pp. 396-403
Author(s):  
Hana Žižková

Abstract Compound adverbs represent an interesting issue in terms of Automatic Morphological Analysis (AMA). The reason is that compound adverbs in Czech are expressions formed by compounding existing words that are different parts of speech without any change in their form. An indicative sign of compound adverbs is that they can always be decomposed again. Compound adverbs may be written as one word but sometimes a multiword form coexists. A word that is originally a different part of speech gains an adverbial meaning and becomes an adverb. This article presents the results of a corpus probe aimed at mapping expressions that are demonstrably compound adverbs and were not recognized by AMA or were incorrectly tagged by AMA as another part of speech. Analysis of data obtained from the Czech National Corpus (ČNK) SYN v3 show that the unrecognized and incorrectly tagged units can be divided into several groups. Based on knowledge of these groups it is possible to refine part of speech tagging by AMA. The corpus probe examined units written in accordance with the current codification as well as substandard units.


2014 ◽  
Vol 519-520 ◽  
pp. 784-787
Author(s):  
Zhi Qiang Wu ◽  
Hong Zhi Yu ◽  
Shu Hui Wan

It’s a basic work for Tibetan information processing to tag the Tibetan parts of speech,the results can be used in machine translation, speech synthesis and so on. By studying the Tibetan language grammar and the classification of Tibetan parts of speech, established the Tibetan parts of speech tagging sets, and tagged the corpus, used the CRFs to solve the problem that automatic tagging of Tibetan parts of speech, the experimental results show that in the closed test set, part-of-speech tagging accuracy is 94.2%, and in the opening set, the accuracy is 91.5%.


Sign in / Sign up

Export Citation Format

Share Document