scholarly journals Comparisons of Speech Parameterisation Techniques for Classification of Intellectual Disability Using Machine Learning

2022 ◽  
pp. 828-847
Author(s):  
Gaurav Aggarwal ◽  
Latika Singh

Classification of intellectually disabled children through manual assessment of speech at an early age is inconsistent, subjective, time-consuming and prone to error. This study attempts to classify the children with intellectual disabilities using two speech feature extraction techniques: Linear Predictive Coding (LPC) based cepstral parameters, and Mel-frequency cepstral coefficients (MFCC). Four different classification models: k-nearest neighbour (k-NN), support vector machine (SVM), linear discriminant analysis (LDA) and radial basis function neural network (RBFNN) are employed for classification purposes. 48 speech samples of each group are taken for analysis, from subjects with a similar age and socio-economic background. The effect of the different frame length with the number of filterbanks in the MFCC and different frame length with the order in the LPC is also examined for better accuracy. The experimental outcomes show that the projected technique can be used to help speech pathologists in estimating intellectual disability at early ages.

Author(s):  
Gaurav Aggarwal ◽  
Latika Singh

Classification of intellectually disabled children through manual assessment of speech at an early age is inconsistent, subjective, time-consuming and prone to error. This study attempts to classify the children with intellectual disabilities using two speech feature extraction techniques: Linear Predictive Coding (LPC) based cepstral parameters, and Mel-frequency cepstral coefficients (MFCC). Four different classification models: k-nearest neighbour (k-NN), support vector machine (SVM), linear discriminant analysis (LDA) and radial basis function neural network (RBFNN) are employed for classification purposes. 48 speech samples of each group are taken for analysis, from subjects with a similar age and socio-economic background. The effect of the different frame length with the number of filterbanks in the MFCC and different frame length with the order in the LPC is also examined for better accuracy. The experimental outcomes show that the projected technique can be used to help speech pathologists in estimating intellectual disability at early ages.


Electrocardiogram (ECG) examination via computer techniques that involve feature extraction, pre-processing and post-processing was implemented due to its significant advantages. Extracting ECG signal standard features that requires high processing operation level was the main focusing point for many studies. In this paper, up to 6 different ECG signal classes are accurately predicted in the absence of ECG feature extraction. The corner stone of the proposed technique in this paper is the Linear predictive coding (LPC) technique that regress and normalize the signal during the pre-processing phase. Prior to the feature extraction using Wavelet energy (WE), a direct Wavelet transform (DWT) is implemented that converted ECG signal to frequency domain. In addition, the dataset was divided into two parts , one for training and the other for testing purposes Which have been classified in this proposed algorithm using support vector machine (SVM). Moreover, using MIT AI2 Companion was developed by MIT Center for Mobile Learning, the classification result was shared to the patient mobile phone that can call the ambulance and send the location in case of serious emergency. Finally, the confusion matrix values are used to measure the proposed classification performance. For 6 different ECG classes, an accuracy ration of about 98.15% was recorded. This ratio became 100% for 3 ECG signal classes and decreases to 97.95% by increasing ECG signal to 7 classes.


2017 ◽  
Vol 24 (2) ◽  
pp. 17-26
Author(s):  
Mustafa Yagimli ◽  
Huseyin Kursat Tezer

Abstract The real-time voice command recognition system used for this study, aims to increase the situational awareness, therefore the safety of navigation, related especially to the close manoeuvres of warships, and the courses of commercial vessels in narrow waters. The developed system, the safety of navigation that has become especially important in precision manoeuvres, has become controllable with voice command recognition-based software. The system was observed to work with 90.6% accuracy using Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) parameters and with 85.5% accuracy using Linear Predictive Coding (LPC) and DTW parameters.


2018 ◽  
Vol 61 (5) ◽  
pp. 1497-1504
Author(s):  
Zhenjie Wang ◽  
Ke Sun ◽  
Lihui Du ◽  
Jian Yuan ◽  
Kang Tu ◽  
...  

Abstract. In this study, computer vision was used for the identification and classification of fungi on moldy paddy. To develop a rapid and efficient method for the classification of common fungal species found in stored paddy, computer vision was used to acquire images of individual colonies of growing fungi for three consecutive days. After image processing, the color, shape, and texture features were acquired and used in a subsequent discriminant analysis. Both linear (i.e., linear discriminant analysis and partial least squares discriminant analysis) and nonlinear (i.e., random forest and support vector machine [SVM]) pattern recognition models were employed for the classification of fungal colonies, and the results were compared. The results indicate that when using all of the features for three consecutive days, the performance of the nonlinear tools was superior to that of the linear tools, especially in the case of the SVM models, which achieved an accuracy of 100% on the calibration sets and an accuracy of 93.2% to 97.6% on the prediction sets. After sequential selection of projection algorithm, ten common features were selected for building the classification models. The results showed that the SVM model achieved an overall accuracy of 95.6%, 98.3%, and 99.0% on the prediction sets on days 2, 3, and 4, respectively. This work demonstrated that computer vision with several features is suitable for the identification and classification of fungi on moldy paddy based on the form of the individual colonies at an early growth stage during paddy storage. Keywords: Classification, Computer vision, Fungal colony, Feature selection, SVM.


Sensors ◽  
2020 ◽  
Vol 20 (16) ◽  
pp. 4629 ◽  
Author(s):  
Ciaran Cooney ◽  
Attila Korik ◽  
Raffaella Folli ◽  
Damien Coyle

Classification of electroencephalography (EEG) signals corresponding to imagined speech production is important for the development of a direct-speech brain–computer interface (DS-BCI). Deep learning (DL) has been utilized with great success across several domains. However, it remains an open question whether DL methods provide significant advances over traditional machine learning (ML) approaches for classification of imagined speech. Furthermore, hyperparameter (HP) optimization has been neglected in DL-EEG studies, resulting in the significance of its effects remaining uncertain. In this study, we aim to improve classification of imagined speech EEG by employing DL methods while also statistically evaluating the impact of HP optimization on classifier performance. We trained three distinct convolutional neural networks (CNN) on imagined speech EEG using a nested cross-validation approach to HP optimization. Each of the CNNs evaluated was designed specifically for EEG decoding. An imagined speech EEG dataset consisting of both words and vowels facilitated training on both sets independently. CNN results were compared with three benchmark ML methods: Support Vector Machine, Random Forest and regularized Linear Discriminant Analysis. Intra- and inter-subject methods of HP optimization were tested and the effects of HPs statistically analyzed. Accuracies obtained by the CNNs were significantly greater than the benchmark methods when trained on both datasets (words: 24.97%, p < 1 × 10–7, chance: 16.67%; vowels: 30.00%, p < 1 × 10–7, chance: 20%). The effects of varying HP values, and interactions between HPs and the CNNs were both statistically significant. The results of HP optimization demonstrate how critical it is for training CNNs to decode imagined speech.


2012 ◽  
Vol 8 (S295) ◽  
pp. 180-180
Author(s):  
He Ma ◽  
Yanxia Zhang ◽  
Yongheng Zhao ◽  
Bo Zhang

AbstractIn this work, two different algorithms: Linear Discriminant Analysis (LDA) and Support Vector Machines (SVMs) are combined for the classification of unresolved sources from SDSS DR8 and UKIDSS DR8. The experimental result shows that this joint approach is effective for our case.


OENO One ◽  
2019 ◽  
Vol 53 (4) ◽  
Author(s):  
Giuseppina P. Parpinello ◽  
Arianna Ricci ◽  
Panagiotis Arapitsas ◽  
Andrea Curioni ◽  
Luigi Moio ◽  
...  

Aim: The aim of this study was to investigate the application of mid-infrared (MIR) spectroscopy combined with multivariate analysis, to provide a rapid screening tool for discriminating among different Italian monovarietal red wines based on the relationship between grape variety and wine composition in particular phenolic compounds.Methods and results: The MIR spectra (from 4000 to 700 cm‒1) of 110 monovarietal Italian red wines, vintage 2016, were collected and evaluated by selected multivariate data analyses, including principal component analysis (PCA), linear discriminant analysis (DA), support vector machine (SVM), and soft intelligent modelling of class analogy (SIMCA). Samples were collected directly from companies across different regions of Italy and included 11 grape varieties: Sangiovese, Nebbiolo, Aglianico, Nerello Mascalese, Primitivo, Raboso, Cannonau, Teroldego, Sagrantino, Montepulciano and Corvina. PCA showed five wavelengths that mainly contributed to the PC1, including a much-closed peak at 1043 cm‒1, which correspond to the C–O stretch absorption bands that are important regions for glycerol, whereas the ethanol peaks at around 1085 cm‒1. The band at 877 cm‒1 are related to the C–C stretching vibration of organic molecules, whereas the asymmetric stretching for C–O in the aromatic –OH group of polyphenols is within spectral regions from 1050 to 1165 cm‒1. In particular, the (1175)–1100–1060 cm‒1 vibrational bands are combination bands, involving C–O stretching and O–H deformation of phenolic rings. The 1166–1168 cm‒1 peak is attributable to in-plane bending deformations of C–H and C–O groups of polyphenols, respectively, for which polymerisation may cause a slight peak shift due to the formation of H-bridges.The best result was obtained with the SVM, which achieved an overall correct classification for up to 72.2% of the training set, and 44.4% for the validation set of wines, respectively. The Sangiovese wines (n=19) were split into two sub-groups (Sang-Romagna, n=12 and Sang-Tuscany, n=7) considering the indeterminacy of its origins, which is disputed between Romagna and Tuscany. Although the classification of three grape varieties was problematic (Nerello Mascalese, Raboso and Primitivo), the remaining wines were almost correctly assigned to their actual classes.Conclusions: MIR spectroscopy coupled with chemometrics represents an interesting approach for the classification of monovarietal Italian red wines, which is important in quality control and authenticity monitoring.Significance and impact of the study: Authenticity is a main issue in winemaking in terms of quality evaluation and adulteration, in particular for origin certified/protected wines, for which the added marketing value is related to the link of grape variety with the area of origin. This study is part of the D-wine project “The diversity of tannins in Italian red wines”.


2020 ◽  
Vol 10 (19) ◽  
pp. 6724
Author(s):  
Youngwook Seo ◽  
Ahyeong Lee ◽  
Balgeum Kim ◽  
Jongguk Lim

(1) Background: The general use of food-processing facilities in the agro-food industry has increased the risk of unexpected material contamination. For instance, grain flours have similar colors and shapes, making their detection and isolation from each other difficult. Therefore, this study is aimed at verifying the feasibility of detecting and isolating grain flours by using hyperspectral imaging technology and developing a classification model of grain flours. (2) Methods: Multiple hyperspectral images were acquired through line scanning methods from reflectance of visible and near-infrared wavelength (400–1000 nm), reflectance of shortwave infrared wavelength (900–1700 nm), and fluorescence (400–700 nm) by 365 nm ultraviolet (UV) excitation. Eight varieties of grain flours were prepared (rice: 4, starch: 4), and the particle size and starch damage content were measured. To develop the classification model, four multivariate analysis methods (linear discriminant analysis (LDA), partial least-square discriminant analysis, support vector machine, and classification and regression tree) were implemented with several pre-processing methods, and their classification results were compared with respect to accuracy and Cohen’s kappa coefficient obtained from confusion matrices. (3) Results: The highest accuracy was achieved as 97.43% through short-wavelength infrared with normalization in the spectral domain. The submission of the developed classification model to the hyperspectral images showed that the fluorescence method achieves the highest accuracy of 81% using LDA. (4) Conclusions: In this study, the potential of non-destructive classification of rice and starch flours using multiple hyperspectral modalities and chemometric methods were demonstrated.


Sign in / Sign up

Export Citation Format

Share Document