scholarly journals Acoustic Classification of Singing Insects Based on MFCC/LFCC Fusion

2019 ◽  
Vol 9 (19) ◽  
pp. 4097 ◽  
Author(s):  
Juan J. Noda ◽  
Carlos M. Travieso-González ◽  
David Sánchez-Rodríguez ◽  
Jesús B. Alonso-Hernández

This work introduces a new approach for automatic identification of crickets, katydids and cicadas analyzing their acoustic signals. We propose the building of a tool to identify this biodiversity. The study proposes a sound parameterization technique designed specifically for identification and classification of acoustic signals of insects using Mel Frequency Cepstral Coefficients (MFCC) and Linear Frequency Cepstral Coefficients (LFCC). These two sets of coefficients are evaluated individually as has been done in previous studies and have been compared with the fusion proposed in this work, showing an outstanding increase in identification and classification at species level reaching a success rate of 98.07% on 343 insect species.

2015 ◽  
Vol 2015 ◽  
pp. 1-12 ◽  
Author(s):  
Ömer Eskidere ◽  
Ahmet Gürhanlı

The Mel Frequency Cepstral Coefficients (MFCCs) are widely used in order to extract essential information from a voice signal and became a popular feature extractor used in audio processing. However, MFCC features are usually calculated from a single window (taper) characterized by large variance. This study shows investigations on reducing variance for the classification of two different voice qualities (normal voice and disordered voice) using multitaper MFCC features. We also compare their performance by newly proposed windowing techniques and conventional single-taper technique. The results demonstrate that adapted weighted Thomson multitaper method could distinguish between normal voice and disordered voice better than the results done by the conventional single-taper (Hamming window) technique and two newly proposed windowing methods. The multitaper MFCC features may be helpful in identifying voices at risk for a real pathology that has to be proven later.


Historical documents are important source for knowing culture, language, social activities, educational system, etc. The historical documents are in different languages and evolved over centuries and transformed to present modern language, classification of documents into various eras, recognition of words etc. In this paper, we have proposed a new approach to automatic identification of the age of the historical handwritten document images based on LBP (Local Binary Pattern) and LPQ (Local Phase Quantization) algorithm. The standard historical handwritten document images named as MPS (Medieval Paleographic Scale) dataset which is publicly available is used to experiment. LBP and LPQ descriptors are used to extract the features of the historical document images. Further, documents are classified based on the discriminating feature values using classifiers namely K-NN (K-Nearest Neighbors) and SVM (Support Vector Machine) classifier. The accuracy of historical handwritten document images by K-NN and SVM are 90.7% and 92.8% respectively.


Author(s):  
Jason Lilley ◽  
Laura Spinu ◽  
Angeliki Athanasopoulou

In the current study, we explore the factors underlying the well-known difficulty in acoustic classification of front nonsibilant fricatives (Maniwa, Jongman & Wade 2009, McMurray & Jongman 2011) by applying a novel classification method to the production of Greek speakers. The Greek fricative inventory [f v θ ð s z ç ʝ x ɣ] includes voiced and voiceless segments from five distinct places of articulation. Our corpus contains all of the Greek fricatives produced by 29 monolingual speakers, but our focus is on the distinction between the front nonsibilant fricatives [f v θ ð]. For comparison, we also discuss the other places of articulation where relevant. We apply a relatively novel classification method based on cepstral coefficients, previously successful in categorizing English obstruent bursts (Bunnell, Polikoff & McNicholas 2004), English vowels (Ferragne & Pellegrino 2010), Romanian fricatives (Spinu & Lilley 2016), and Russian fricatives (Spinu, Kochetov & Lilley 2018). For this study, fricative boundaries were automatically aligned using Hidden Markov Models (HMMs) and then manually checked. Six Bark-frequency cepstral coefficients (c0–c5) were extracted from 20-millisecond Hann windows. HMMs were used to divide the fricatives and adjacent vowels into three regions of internally minimized variance. A multinomial logistic regression analysis then used the mean cepstral coefficients from each region as predictors for classification by consonant identity. Our method yields highly successful classification rates, exceeding the performance of previous methods. We discuss these results in light of the differences of the phonemic distributions of fricatives between English and Greek.


Sign in / Sign up

Export Citation Format

Share Document