RECOGNITION OF LEXICAL TONES FOR ISOLATED SYLLABLES AND DISYLLABLES IN MANDARIN SPEECH

Author(s):  
WU-JI YANG ◽  
JYH-CHYANG LEE ◽  
YUEH-CHIN CHANG ◽  
HSIAO-CHUAN WANG

This study purposes a method for recognizing the lexical tones in Mandarin speech. The method is based on Vector Quantization (VQ) and Hidden Markov Models (HMM). The pitch periods are extracted to derive the feature vectors which represent pitch height and pitch contour slope. One HMM is trained by the feature vectors of monosyllables for each tone. Then the HMMs are used to recognize the tone of monosyllables and disyllables. For the monosyllables, the accuracy rate can be 93.75% for speaker-independent cases. For the disyllables, the accuracy rates are 93% for the first syllables and 90% for the second syllables. It shows that the tone of the second syllable may be affected by the preceding syllable. This degradation also reveals the fact of tone variation in Mandarin speech.

Author(s):  
M. C. Maya-Piedrahita ◽  
P. M. Herrera-Gomez ◽  
L. Berrío-Mesa ◽  
D. A. Cárdenas-Peña ◽  
A. A. Orozco-Gutierrez

As a neurodevelopmental pathology, Attention Deficit Hyperactivity Disorder (ADHD) mainly arises during childhood. Persistent patterns of generalized inattention, impulsivity, or hyperactivity characterize ADHD that may persist into adulthood. The conventional diagnosis relies on clinical observational processes yielding high rates of overdiagnosis due to varying interpretations among specialists or missing information. Although several studies have designed objective behavioral features to overcome such an issue, they lack significance. Despite electroencephalography (EEG) analyses extracting alternative biomarkers using signal processing techniques, the nonlinearity and nonstationarity of EEG signals restrain performance and generalization of hand-crafted features. This work proposes a methodology to support ADHD diagnosis by characterizing EEG signals from hidden Markov models (HMM), classifying subjects based on similarity measures for probability functions, and spatially interpreting the results using graphic embeddings of stochastic dynamic models. The methodology learns a single HMM for EEG signal from each patient, so favoring the inter-subject variability. Then, the Probability Product Kernel, specifically developed for assessing the similarity between HMMs, fed a support vector machine that classifies subjects according to their stochastic dynamics. Lastly, the kernel variant of Principal Component Analysis provided a means to visualize the EEG transitions in a two-dimensional space, evidencing dynamic differences between ADHD and Healthy Control children. From the electrophysiological perspective, we recorded EEG under the Stop Signal Task modified with reward levels, which considers cognitive features of interest as insufficient motivational circuits recruitment. The methodology compares the supported diagnosis in two EEG channel setups (whole channel set and channels of interest in frontocentral area) and four frequency bands (Theta, Alpha, Beta rhythms, and a wideband). Results evidence an accuracy rate of 97.0% in the Beta band and in the channels where previous works found error-related negativity events. Such accuracy rate strongly supports the dual pathway hypothesis and motivational deficit concerning the pathophysiology of ADHD. It also demonstrates the utility of joining inhibitory and motivational paradigms with dynamic EEG analysis into a noninvasive and affordable diagnostic tool for ADHD patients.


Sign in / Sign up

Export Citation Format

Share Document