Acoustic Classification of Singing Insects Based on MFCC/LFCC Fusion

This work introduces a new approach for automatic identification of crickets, katydids and cicadas analyzing their acoustic signals. We propose the building of a tool to identify this biodiversity. The study proposes a sound parameterization technique designed specifically for identification and classification of acoustic signals of insects using Mel Frequency Cepstral Coefficients (MFCC) and Linear Frequency Cepstral Coefficients (LFCC). These two sets of coefficients are evaluated individually as has been done in previous studies and have been compared with the fusion proposed in this work, showing an outstanding increase in identification and classification at species level reaching a success rate of 98.07% on 343 insect species.

Download Full-text

SMCS: Automatic Real-Time Classification of Ambient Sounds, Based on a Deep Neural Network and Mel Frequency Cepstral Coefficients

Communications in Computer and Information Science - Applied Technologies ◽

10.1007/978-3-030-42520-3_20 ◽

2020 ◽

pp. 245-253

Author(s):

María José Mora-Regalado ◽

Omar Ruiz-Vivanco ◽

Alexandra González-Eras ◽

Pablo Torres-Carrión

Keyword(s):

Neural Network ◽

Real Time ◽

Deep Neural Network ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients ◽

Real Time Classification

Download Full-text

Voice Disorder Classification Based on Multitaper Mel Frequency Cepstral Coefficients Features

Computational and Mathematical Methods in Medicine ◽

10.1155/2015/956249 ◽

2015 ◽

Vol 2015 ◽

pp. 1-12 ◽

Cited By ~ 7

Author(s):

Ömer Eskidere ◽

Ahmet Gürhanlı

Keyword(s):

Voice Disorder ◽

Mel Frequency Cepstral Coefficients ◽

Essential Information ◽

Audio Processing ◽

Voice Signal ◽

Window Technique ◽

Normal Voice ◽

Cepstral Coefficients ◽

Better Than

The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.

Download Full-text

Particle Swarm Optimisation of Mel-frequency Cepstral Coefficients computation for the classification of asphyxiated infant cry

2010 3rd International Conference on Biomedical Engineering and Informatics ◽

10.1109/bmei.2010.5639674 ◽

2010 ◽

Cited By ~ 4

Author(s):

A. Zabidi ◽

W. Mansor ◽

Y. K. Lee ◽

A. I. Mohd Yassin ◽

R. Sahak

Keyword(s):

Particle Swarm ◽

Particle Swarm Optimisation ◽

Mel Frequency Cepstral Coefficients ◽

Infant Cry ◽

Cepstral Coefficients

Download Full-text

Fusion of Linear and Mel Frequency Cepstral Coefﬁcients for Automatic Classiﬁcation of Reptiles

Applied Sciences ◽

10.3390/app7020178 ◽

2017 ◽

Vol 7 (2) ◽

pp. 178 ◽

Cited By ~ 5

Author(s):

Juan Noda ◽

Carlos Travieso ◽

David Sánchez-Rodríguez

Keyword(s):

Automatic Classification ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

Classification of Historical Documents Based on LBP and LPQ Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.c8579.019320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 1534-1539

Keyword(s):

Support Vector ◽

Historical Documents ◽

Automatic Identification ◽

Document Images ◽

K Nearest Neighbors ◽

New Approach ◽

Modern Language ◽

Handwritten Document ◽

Feature Values

Historical documents are important source for knowing culture, language, social activities, educational system, etc. The historical documents are in different languages and evolved over centuries and transformed to present modern language, classification of documents into various eras, recognition of words etc. In this paper, we have proposed a new approach to automatic identification of the age of the historical handwritten document images based on LBP (Local Binary Pattern) and LPQ (Local Phase Quantization) algorithm. The standard historical handwritten document images named as MPS (Medieval Paleographic Scale) dataset which is publicly available is used to experiment. LBP and LPQ descriptors are used to extract the features of the historical document images. Further, documents are classified based on the discriminating feature values using classifiers namely K-NN (K-Nearest Neighbors) and SVM (Support Vector Machine) classifier. The accuracy of historical handwritten document images by K-NN and SVM are 90.7% and 92.8% respectively.

Download Full-text

Exploring the front fricative contrast in Greek: A study of acoustic variability based on cepstral coefficients

Journal of the International Phonetic Association ◽

10.1017/s002510031900029x ◽

2020 ◽

pp. 1-32

Author(s):

Jason Lilley ◽

Laura Spinu ◽

Angeliki Athanasopoulou

Keyword(s):

Markov Models ◽

Hidden Markov ◽

Multinomial Logistic Regression ◽

Classification Method ◽

Multinomial Logistic Regression Analysis ◽

English Vowels ◽

The Mean ◽

Cepstral Coefficients ◽

Acoustic Classification

In the current study, we explore the factors underlying the well-known difficulty in acoustic classification of front nonsibilant fricatives (Maniwa, Jongman & Wade 2009, McMurray & Jongman 2011) by applying a novel classification method to the production of Greek speakers. The Greek fricative inventory [f v θ ð s z ç ʝ x ɣ] includes voiced and voiceless segments from five distinct places of articulation. Our corpus contains all of the Greek fricatives produced by 29 monolingual speakers, but our focus is on the distinction between the front nonsibilant fricatives [f v θ ð]. For comparison, we also discuss the other places of articulation where relevant. We apply a relatively novel classification method based on cepstral coefficients, previously successful in categorizing English obstruent bursts (Bunnell, Polikoff & McNicholas 2004), English vowels (Ferragne & Pellegrino 2010), Romanian fricatives (Spinu & Lilley 2016), and Russian fricatives (Spinu, Kochetov & Lilley 2018). For this study, fricative boundaries were automatically aligned using Hidden Markov Models (HMMs) and then manually checked. Six Bark-frequency cepstral coefficients (c0–c5) were extracted from 20-millisecond Hann windows. HMMs were used to divide the fricatives and adjacent vowels into three regions of internally minimized variance. A multinomial logistic regression analysis then used the mean cepstral coefficients from each region as predictors for classification by consonant identity. Our method yields highly successful classification rates, exceeding the performance of previous methods. We discuss these results in light of the differences of the phonemic distributions of fricatives between English and Greek.

Download Full-text