RECOGNITION OF LEXICAL TONES FOR ISOLATED SYLLABLES AND DISYLLABLES IN MANDARIN SPEECH

This study purposes a method for recognizing the lexical tones in Mandarin speech. The method is based on Vector Quantization (VQ) and Hidden Markov Models (HMM). The pitch periods are extracted to derive the feature vectors which represent pitch height and pitch contour slope. One HMM is trained by the feature vectors of monosyllables for each tone. Then the HMMs are used to recognize the tone of monosyllables and disyllables. For the monosyllables, the accuracy rate can be 93.75% for speaker-independent cases. For the disyllables, the accuracy rates are 93% for the first syllables and 90% for the second syllables. It shows that the tone of the second syllable may be affected by the preceding syllable. This degradation also reveals the fact of tone variation in Mandarin speech.

Download Full-text

On the Application of Vector Quantization and Hidden Markov Models to Speaker-Independent, Isolated Word Recognition

Bell System Technical Journal ◽

10.1002/j.1538-7305.1983.tb03115.x ◽

1983 ◽

Vol 62 (4) ◽

pp. 1075-1105 ◽

Cited By ~ 204

Author(s):

L. R. Rabiner ◽

S. E. Levinson ◽

M. M. Sondhi

Keyword(s):

Word Recognition ◽

Hidden Markov Models ◽

Vector Quantization ◽

Markov Models ◽

Hidden Markov ◽

Speaker Independent ◽

Isolated Word ◽

Isolated Word Recognition

Download Full-text

Speaker-independent isolated-digit recognition based on hidden Markov models and multiple vocabulary specific vector quantization

[1991] IEEE Pacific Rim Conference on Communications, Computers and Signal Processing Conference Proceedings ◽

10.1109/pacrim.1991.160702 ◽

2002 ◽

Cited By ~ 1

Author(s):

L. Cossette ◽

E. Velez ◽

V. Cuperman

Keyword(s):

Hidden Markov Models ◽

Vector Quantization ◽

Markov Models ◽

Hidden Markov ◽

Digit Recognition ◽

Speaker Independent

Download Full-text

Speaker-independent isolated word recognition using word-based vector quantization and hidden Markov models

10.1109/icassp.1987.1169792 ◽

2005 ◽

Cited By ~ 1

Author(s):

Y. Cheung ◽

S. Leung

Keyword(s):

Word Recognition ◽

Hidden Markov Models ◽

Vector Quantization ◽

Markov Models ◽

Hidden Markov ◽

Speaker Independent ◽

Isolated Word ◽

Isolated Word Recognition

Download Full-text

Speaker-independent French digits recognition using word-based vector quantization and hidden Markov models

10.1109/icassp.1986.1168582 ◽

2005 ◽

Cited By ~ 2

Author(s):

A. Tassy ◽

L. Miclet

Keyword(s):

Hidden Markov Models ◽

Vector Quantization ◽

Markov Models ◽

Hidden Markov ◽

Speaker Independent

Download Full-text

Supported Diagnosis of Attention Deficit and Hyperactivity Disorder from EEG Based on Interpretable Kernels for Hidden Markov Models

International Journal of Neural Systems ◽

10.1142/s0129065722500083 ◽

2022 ◽

Author(s):

M. C. Maya-Piedrahita ◽

P. M. Herrera-Gomez ◽

L. Berrío-Mesa ◽

D. A. Cárdenas-Peña ◽

A. A. Orozco-Gutierrez

Keyword(s):

Hidden Markov Models ◽

Attention Deficit ◽

Markov Models ◽

Hidden Markov ◽

Stop Signal Task ◽

Support Vector ◽

Stochastic Dynamic ◽

Eeg Signals ◽

Accuracy Rate ◽

Hyperactivity Disorder

As a neurodevelopmental pathology, Attention Deficit Hyperactivity Disorder (ADHD) mainly arises during childhood. Persistent patterns of generalized inattention, impulsivity, or hyperactivity characterize ADHD that may persist into adulthood. The conventional diagnosis relies on clinical observational processes yielding high rates of overdiagnosis due to varying interpretations among specialists or missing information. Although several studies have designed objective behavioral features to overcome such an issue, they lack significance. Despite electroencephalography (EEG) analyses extracting alternative biomarkers using signal processing techniques, the nonlinearity and nonstationarity of EEG signals restrain performance and generalization of hand-crafted features. This work proposes a methodology to support ADHD diagnosis by characterizing EEG signals from hidden Markov models (HMM), classifying subjects based on similarity measures for probability functions, and spatially interpreting the results using graphic embeddings of stochastic dynamic models. The methodology learns a single HMM for EEG signal from each patient, so favoring the inter-subject variability. Then, the Probability Product Kernel, specifically developed for assessing the similarity between HMMs, fed a support vector machine that classifies subjects according to their stochastic dynamics. Lastly, the kernel variant of Principal Component Analysis provided a means to visualize the EEG transitions in a two-dimensional space, evidencing dynamic differences between ADHD and Healthy Control children. From the electrophysiological perspective, we recorded EEG under the Stop Signal Task modified with reward levels, which considers cognitive features of interest as insufficient motivational circuits recruitment. The methodology compares the supported diagnosis in two EEG channel setups (whole channel set and channels of interest in frontocentral area) and four frequency bands (Theta, Alpha, Beta rhythms, and a wideband). Results evidence an accuracy rate of 97.0% in the Beta band and in the channels where previous works found error-related negativity events. Such accuracy rate strongly supports the dual pathway hypothesis and motivational deficit concerning the pathophysiology of ADHD. It also demonstrates the utility of joining inhibitory and motivational paradigms with dynamic EEG analysis into a noninvasive and affordable diagnostic tool for ADHD patients.

Download Full-text