Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling

As mentioned in Chapter II, there are two kinds of LDA approaches: classification- oriented LDA and feature extraction-oriented LDA. In most chapters of this session of the book, we focus our attention on the feature extraction aspect of LDA for SSS problems. On the other hand,, with this chapter we present our studies on the pattern classification aspect of LDA for SSS problems. In this chapter, we present three novel classification-oriented linear discriminant criteria. The first one is large margin linear projection (LMLP) which makes full use of the characteristic of the SSS problems. The second one is the minimum norm minimum squared-error criterion which is a modification of the minimum squared-error discriminant criterion. The third one is the maximum scatter difference which is a modification of the Fisher discriminant criterion.

Download Full-text

DNN acoustic modeling with modular multi-lingual feature extraction networks

2013 IEEE Workshop on Automatic Speech Recognition and Understanding ◽

10.1109/asru.2013.6707754 ◽

2013 ◽

Cited By ~ 6

Author(s):

Jonas Gehring ◽

Quoc Bao Nguyen ◽

Florian Metze ◽

Alex Waibel

Keyword(s):

Feature Extraction ◽

Acoustic Modeling

Download Full-text

Low-Rank Preserving t-Linear Projection for Robust Image Feature Extraction

IEEE Transactions on Image Processing ◽

10.1109/tip.2020.3031813 ◽

2021 ◽

Vol 30 ◽

pp. 108-120

Author(s):

Xiaolin Xiao ◽

Yongyong Chen ◽

Yue-Jiao Gong ◽

Yicong Zhou

Keyword(s):

Feature Extraction ◽

Image Feature ◽

Low Rank ◽

Image Feature Extraction ◽

Linear Projection ◽

Robust Image

Download Full-text

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200730225301 ◽

2020 ◽

Vol 13 ◽

Author(s):

Mohit Dua ◽

Pawandeep Singh Sethi ◽

Vinam Agrawal ◽

Raghav Chawla

Keyword(s):

Feature Extraction ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Gaussian Mixture ◽

Performance Comparison ◽

Acoustic Modeling ◽

Extraction Techniques ◽

Front End ◽

Noise Robust ◽

Asr System

Introduction: An Automatic Speech Recognition (ASR) system enables to recognize the speech utterances and thus can be used to convert speech into text for various purposes. These systems are deployed in different environments such as clean or noisy and are used by all ages or types of people. These also present some of the major difficulties faced in the development of an ASR system. Thus, an ASR system need to be efficient, while also being accurate and robust. Our main goal is to minimize the error rate during training as well as testing phases, while implementing an ASR system. Performance of ASR depends upon different combinations of feature extraction techniques and back-end techniques. In this paper, using a continuous speech recognition system, the performance comparison of different combinations of feature extraction techniques and various types of back-end techniques has been presented Methods: Hidden Markov Models (HMMs), Subspace Gaussian Mixture Models (SGMMs) and Deep Neural Networks (DNNs) with DNN-HMM architecture, namely Karel's, Dan's and Hybrid DNN-SGMM architecture are used at the back-end of the implemented system. Mel frequency Cepstral Coefficient (MFCC), Perceptual Linear Prediction (PLP), and Gammatone Frequency Cepstral coefficients (GFCC) are used as feature extraction techniques at the front-end of the proposed system. Kaldi toolkit has been used for the implementation of the proposed work. The system is trained on the Texas Instruments-Massachusetts Institute of Technology (TIMIT) speech corpus for English language Results: The experimental results show that MFCC outperforms GFCC and PLP in noiseless conditions, while PLP tends to outperform MFCC and GFCC in noisy conditions. Furthermore, the hybrid of Dan's DNN implementation along with SGMM performs the best for the back-end acoustic modeling. The proposed architecture with PLP feature extraction technique in the front end and hybrid of Dan's DNN implementation along with SGMM at the back end outperforms the other combinations in a noisy environment. Conclusion: Automatic Speech recognition has numerous applications in our lives like Home automation, Personal assistant, Robotics etc. It is highly desirable to build an ASR system with good performance. The performance Automatic Speech Recognition is affected by various factors which include vocabulary size, whether system is speaker dependent or independent, whether speech is isolated, discontinuous or continuous, adverse conditions like noise. The paper presented an ensemble architecture that uses PLP for feature extraction at the front end and a hybrid of SGMM + Dan's DNN in the backend to build a noise robust ASR system Discussion: The presented work in this paper discusses the performance comparison of continuous ASR systems developed using different combinations of front-end feature extraction (MFCC, PLP, and GFCC) and back-end acoustic modeling (mono-phone, tri-phone, SGMM, DNN and hybrid DNN-SGMM) techniques. Each type of front-end technique is tested in combination with each type of back-end technique. Finally, it compares the results of the combinations thus formed, to find out the best performing combination in noisy and clean conditions

Download Full-text

Feature extraction and acoustic modeling: an approach for improved generalization across languages and accents

IEEE Workshop on Automatic Speech Recognition and Understanding, 2005. ◽

10.1109/asru.2005.1566527 ◽

2005 ◽

Cited By ~ 13

Author(s):

S. Dupont ◽

C. Ris ◽

O. Deroo ◽

S. Poitoux

Keyword(s):

Feature Extraction ◽

Acoustic Modeling

Download Full-text

Feature Fusion Using Complex Descriminator

Computational Intelligence and its Applications - Biometric Image Discrimination Technologies ◽

10.4018/978-1-59140-830-7.ch013 ◽

2006 ◽

pp. 329-350

Author(s):

David Zhang ◽

Xiao-Yuan Jing ◽

Jian Yang

Keyword(s):

Symmetry Property ◽

Feature Fusion ◽

Detailed Comparison ◽

Comparison Analysis ◽

Linear Projection ◽

Analysis Methods ◽

Projection Analysis ◽

Complex Linear ◽

Parallel Feature

This chapter describes feature fusion techniques using complex discriminator. After the introduction, we first introduce serial and parallel feature fusion strategies. Then, the complex linear projection analysis methods, complex PCA and complex LDA, are developed. Next, some feature preprocessing techniques are given. The symmetry property of parallel feature fusion is analyzed and revealed. Then, the proposed methods are applied to biometric applications, related experiments are performed and the detailed comparison analysis is exhibited. Finally, a summary is given.

Download Full-text

An Improved Quantitative Image Analyser

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100078663 ◽

1977 ◽

Vol 35 ◽

pp. 226-227

Author(s):

J.P. Fallon ◽

P.J. Gregory ◽

C.J. Taylor

Keyword(s):

Feature Extraction ◽

Quantitative Analysis ◽

Frequency Content ◽

General Sense ◽

Physical Measurements ◽

Quantitative Image ◽

Image Analyser ◽

Automatic Methods ◽

Quantitative Results

Quantitative image analysis systems have been used for several years in research and quality control applications in various fields including metallurgy and medicine. The technique has been applied as an extension of subjective microscopy to problems requiring quantitative results and which are amenable to automatic methods of interpretation.Feature extraction. In the most general sense, a feature can be defined as a portion of the image which differs in some consistent way from the background. A feature may be characterized by the density difference between itself and the background, by an edge gradient, or by the spatial frequency content (texture) within its boundaries. The task of feature extraction includes recognition of features and encoding of the associated information for quantitative analysis.Quantitative Analysis. Quantitative analysis is the determination of one or more physical measurements of each feature. These measurements may be straightforward ones such as area, length, or perimeter, or more complex stereological measurements such as convex perimeter or Feret's diameter.

Download Full-text

570 Catheter ablation for chronic atrial fibrillation using carto navigated circumferential and complex linear lesions

EP Europace ◽

10.1016/s1099-5129(05)80372-1 ◽

2005 ◽

Vol 7 ◽

pp. 126-126

Keyword(s):

Atrial Fibrillation ◽

Catheter Ablation ◽

Chronic Atrial Fibrillation ◽

Linear Lesions ◽

Complex Linear

Download Full-text

Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling

Data augmentation and feature extraction using variational autoencoder for acoustic modeling

Acoustic monitoring and classification of bee swarm activity using MFCC feature extraction and HMM acoustic modeling

Discriminant Criteria for Pattern Classification

DNN acoustic modeling with modular multi-lingual feature extraction networks

Low-Rank Preserving t-Linear Projection for Robust Image Feature Extraction

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR

Feature extraction and acoustic modeling: an approach for improved generalization across languages and accents

Feature Fusion Using Complex Descriminator

An Improved Quantitative Image Analyser

570 Catheter ablation for chronic atrial fibrillation using carto navigated circumferential and complex linear lesions

Export Citation Format