On the Comparison of Line Spectral Frequencies and Mel-Frequency Cepstral Coefficients Using Feedforward Neural Network for Language Identification

<p>Of the many audio features available, this paper focuses on the comparison of two most popular features, i.e. line spectral frequencies (LSF) and Mel-Frequency Cepstral Coefficients. We trained a feedforward neural network with various hidden layers and number of hidden nodes to identify five different languages, i.e. Arabic, Chinese, English, Korean, and Malay. LSF, MFCC, and combination of both features were extracted as the feature vectors. Systematic experiments have been conducted to find the optimum parameters, i.e. sampling frequency, frame size, model order, and structure of neural network. The recognition rate per frame was converted to recognition rate per audio file using majority voting. On average, the recognition rate for LSF, MFCC, and combination of both features are 96%, 92%, and 96%, respectively. Therefore, LSF is the most suitable features to be utilized for language identification using feedforward neural network classifier.</p>

Download Full-text

Multilayered convolutional neural network-based auto-CODEC for audio signal denoising using mel-frequency cepstral coefficients

Neural Computing and Applications ◽

10.1007/s00521-021-05782-5 ◽

2021 ◽

Author(s):

Shivangi Raj ◽

P. Prakasam ◽

Shubham Gupta

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Audio Signal ◽

Signal Denoising ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients

Communications in Computer and Information Science - Applied Technologies ◽

10.1007/978-3-030-42520-3_20 ◽

2020 ◽

pp. 245-253

Author(s):

María José Mora-Regalado ◽

Omar Ruiz-Vivanco ◽

Alexandra González-Eras ◽

Pablo Torres-Carrión

Keyword(s):

Neural Network ◽

Real Time ◽

Deep Neural Network ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients ◽

Real Time Classification

Download Full-text

Cardiac Sound Classification Using Mel-Frequency Cepstral Coefficients (MFCC) and Artificial Neural Network (ANN)

2018 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE) ◽

10.1109/icitisee.2018.8721007 ◽

2018 ◽

Cited By ~ 2

Author(s):

Muhammad Rahmandani ◽

Hanung Adi Nugroho ◽

Noor Akhmad Setiawan

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Mel Frequency Cepstral Coefficients ◽

Sound Classification ◽

Cardiac Sound ◽

Artificial Neural ◽

Cepstral Coefficients ◽

Artificial Neural Network Ann

Download Full-text

Text independent speaker recognition using the mel frequency cepstral coefficients and a neural network classifier

First International Symposium on Control, Communications and Signal Processing, 2004. ◽

10.1109/isccsp.2004.1296479 ◽

2004 ◽

Cited By ~ 10

Author(s):

H. Seddik ◽

A. Rahmouni ◽

M. Sayadi

Keyword(s):

Neural Network ◽

Speaker Recognition ◽

Neural Network Classifier ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Atrial Fibrillation Detection Using Feedforward Neural Network

10.21203/rs.3.rs-780802/v1 ◽

2021 ◽

Author(s):

Yunfan Chen ◽

Chong Zhang ◽

Chengyu Liu ◽

Yiming Wang ◽

Xiangkui Wan

Keyword(s):

Neural Network ◽

Atrial Fibrillation ◽

Deep Learning ◽

Detection System ◽

Recognition Rate ◽

Feedforward Neural Network ◽

Classification Model ◽

Detection Sensitivity ◽

Ecg Signal ◽

Atrial Fibrillation Detection

Abstract Atrial fibrillation is one of the most common arrhythmias in clinics, which has a great impact on people's physical and mental health. Electrocardiogram (ECG) based arrhythmia detection is widely used in early atrial fibrillation detection. However, ECG needs to be manually checked in clinical practice, which is time-consuming and labor-consuming. It is necessary to develop an automatic atrial fibrillation detection system. Recent research has demonstrated that deep learning technology can help to improve the performance of the automatic classification model of ECG signals. To this end, this work proposes effective deep learning based technology to automatically detect atrial fibrillation. First, novel preprocessing algorithms of wavelet transform and sliding window filtering (SWF) are introduced to reduce the noise of the ECG signal and to filter high-frequency components in the ECG signal, respectively. Then, a robust R-wave detection algorithm is developed, which achieves 99.22% detection sensitivity, 98.55% positive recognition rate, and 2.25% deviance on the MIT-BIH arrhythmia database. In addition, we propose a feedforward neural network (FNN) to detect atrial fibrillation based on ECG records. Experiments verified by a 10-fold cross-validation strategy show that the proposed model achieves competitive detection performance and can be applied to wearable detection devices. The proposed atrial fibrillation detection model achieves an accuracy of 84.00%, the detection sensitivity of 84.26%, the specificity of 93.23%, and the area under the receiver working curve of 89.40% on the mixed dataset composed of Challenge2017 database and MIT-BIH arrhythmia database.

Download Full-text

Environment Recognition Using Selected MPEG-7 Audio Features and Mel-Frequency Cepstral Coefficients

2010 Fifth International Conference on Digital Telecommunications ◽

10.1109/icdt.2010.10 ◽

2010 ◽

Cited By ~ 19

Author(s):

Ghulam Muhammad ◽

Yousef A. Alotaibi ◽

Mansour Alsulaiman ◽

Mohammad Nurul Huda

Keyword(s):

Mel Frequency Cepstral Coefficients ◽

Audio Features ◽

Environment Recognition ◽

Cepstral Coefficients

Download Full-text

Identification of Regional Dialects Using Mel Frequency Cepstral Coefficients (MFCCs) and Neural Network

2018 International Seminar on Application for Technology of Information and Communication ◽

10.1109/isemantic.2018.8549731 ◽

2018 ◽

Cited By ~ 3

Author(s):

Ayu Mawadda Warohma ◽

Puspa Kurniasari ◽

Suci Dwijayanti ◽

Irmawan ◽

Bhakti Yudho Suprapto

Keyword(s):

Neural Network ◽

Mel Frequency Cepstral Coefficients ◽

Regional Dialects ◽

Cepstral Coefficients

Download Full-text

AutoVAT: An Automated Visual Acuity Test Using Spoken Digit Recognition with Mel Frequency Cepstral Coefficients and Convolutional Neural Network

Procedia Computer Science ◽

10.1016/j.procs.2021.01.029 ◽

2021 ◽

Vol 179 ◽

pp. 458-467

Author(s):

Derryl Taufik ◽

Novita Hanafiah

Keyword(s):

Neural Network ◽

Visual Acuity ◽

Convolutional Neural Network ◽

Mel Frequency Cepstral Coefficients ◽

Digit Recognition ◽

Visual Acuity Test ◽

Cepstral Coefficients ◽

Acuity Test

Download Full-text

Vowel recognition of patients after total laryngectomy using Mel Frequency Cepstral Coefficients and mouth contour

Journal of Automatic Control ◽

10.2298/jac1001033p ◽

2010 ◽

Vol 20 (1) ◽

pp. 33-38 ◽

Cited By ~ 2

Author(s):

Rafal Pietruch ◽

Antoni Grzanka

Keyword(s):

Support Vector Machines ◽

Total Laryngectomy ◽

Vocal Tract ◽

Recognition Rate ◽

Video Signal ◽

Machine Learning Algorithms ◽

Support Vector ◽

Mel Frequency Cepstral Coefficients ◽

Vector Machines ◽

Cepstral Coefficients

The paper addresses a problem of isolated vowels recognition in patients following total laryngectomy. The visual and acoustic speech modalities were separately incorporated in the machine learning algorithms. The authors used the Mel Frequency Cepstral Coefficients as acoustic descriptors of a speech signal. A lip contour was extracted from a video signal of the speaking faces using OpenCV software library. In a vowels recognition procedure the three types of classifiers were used for comparison purposes: Artificial Neural Networks, Support Vector Machines and Naive Bayes. The highest recognition rate was evaluated using Support Vector Machines. For a group of the laryngectomees having a different quality of speech the authors achieved 75% for acoustic and 40% for visual recognition performances. The authors obtained higher recognition rate than in a previous research where 10 cross-sectional areas of a vocal tract were estimated. Using presented image processing algorithm the visual features can be extracted automatically from a video signal.

Download Full-text

Neural network for single phoneme recognition based on mel-frequency cepstral coefficients coding

10th Symposium on Neural Network Applications in Electrical Engineering ◽

10.1109/neurel.2010.5644071 ◽

2010 ◽

Author(s):

Dino Kosic

Keyword(s):

Neural Network ◽

Mel Frequency Cepstral Coefficients ◽

Phoneme Recognition ◽

Cepstral Coefficients

Download Full-text