phoneme classification
Recently Published Documents


TOTAL DOCUMENTS

107
(FIVE YEARS 13)

H-INDEX

11
(FIVE YEARS 1)

2021 ◽  
Vol 34 (2) ◽  
Author(s):  
Haakon S. Krohn

This paper presents the phonological vowel systems of 16 Chibchan languages, according to existing descriptions. Its purpose is to provide a basic and systematic overview of the vowel systems found in this family. For each language, the distinctive features needed to distinguish the vowel phonemes are discussed, as well as their phonetic realizations when this is relevant. Whereas the phoneme classification is uncontroversial for many of the languages, there are also some Chibchan languages that have been subject to very different analyses over the years; in these cases, a more thorough discussion is provided. At the end of the paper, the Chibchan vowel systems are compared from a quantitative perspective.


2021 ◽  
Vol 11 (1) ◽  
pp. 428
Author(s):  
Donghoon Oh ◽  
Jeong-Sik Park ◽  
Ji-Hwan Kim ◽  
Gil-Jin Jang

Speech recognition consists of converting input sound into a sequence of phonemes, then finding text for the input using language models. Therefore, phoneme classification performance is a critical factor for the successful implementation of a speech recognition system. However, correctly distinguishing phonemes with similar characteristics is still a challenging problem even for state-of-the-art classification methods, and the classification errors are hard to be recovered in the subsequent language processing steps. This paper proposes a hierarchical phoneme clustering method to exploit more suitable recognition models to different phonemes. The phonemes of the TIMIT database are carefully analyzed using a confusion matrix from a baseline speech recognition model. Using automatic phoneme clustering results, a set of phoneme classification models optimized for the generated phoneme groups is constructed and integrated into a hierarchical phoneme classification method. According to the results of a number of phoneme classification experiments, the proposed hierarchical phoneme group models improved performance over the baseline by 3%, 2.1%, 6.0%, and 2.2% for fricative, affricate, stop, and nasal sounds, respectively. The average accuracy was 69.5% and 71.7% for the baseline and proposed hierarchical models, showing a 2.2% overall improvement.


Phoneme recognition is an intricate problem lying under non-linear systems. Most research in this area revolve around try to model the pattern of features observed in the speech spectra with the use of Hidden Markov Models (HMM), various types of neural networks like deep recurrent neural networks, time delay neural networks, etc. for efficient phoneme recognition. In this paper, we study the effectiveness of the hybrid architecture, the Adaptive Neuro-Fuzzy Inference System (ANFIS) for capturing the spectral features of the speech signal to handle the problem of Phoneme Recognition. In spite of a wide range of research in this field, here we examine the power of ANFIS for least explored Tamil phoneme recognition problem. The experimental results have shown the ability of the model to learn the patterns associated with various phonetic classes, indicated with recognition improvement in terms of accuracy to its counterparts.


Sign in / Sign up

Export Citation Format

Share Document