Rapid Discriminative Learning

Jun Rokui;

doi:10.20965/jaciii.2004.p0108

Rapid Discriminative Learning

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2004.p0108 ◽

2004 ◽

Vol 8 (2) ◽

pp. 108-114

Author(s):

Jun Rokui ◽

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Hierarchical Model ◽

Variable Length ◽

Discriminative Learning ◽

Learning Method ◽

Recognition Method ◽

Highly Effective ◽

Hierarchical Neural Network

This paper presents MCE/GPD using GPD that is known as a highly effective discriminative learning method. MCE/GPD is an excellent recognition method that is applicable especially to speech recognition, since it excels in recognizing performance and can be used to deal with variable-length vectors. MCE/GPD involves a problem of calculation resulting from c omplicated algorithms making it impractical. In this paper, we propose a learning method to increase speed at learning based on a hierarchical model. We used a hierarchical neural network to evaluate the method’s performance.

Download Full-text

Exploiting variable length segments with coarticulation effect in online speech recognition based on deep bidirectional recurrent neural network and context-sensitive segment

International Journal of Speech Technology ◽

10.1007/s10772-021-09885-1 ◽

2021 ◽

Author(s):

Song-Il Mun ◽

Chol-Jin Han ◽

Hye-Song Hong

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recurrent Neural Network ◽

Variable Length ◽

Context Sensitive

Download Full-text

Speech recognition method based on genetic vector quantization and BP neural network

10.1117/12.836816 ◽

2009 ◽

Author(s):

Li'ai Gao ◽

Lihua Li ◽

Jian Zhou ◽

Qiuxia Zhao

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Vector Quantization ◽

Bp Neural Network ◽

Recognition Method

Download Full-text

A hierarchical neural network model based on a C/V segmentation algorithm for isolated Mandarin speech recognition

IEEE Transactions on Signal Processing ◽

10.1109/78.134458 ◽

1991 ◽

Vol 39 (9) ◽

pp. 2141-2146 ◽

Cited By ~ 48

Author(s):

Jhing-Fa Wang ◽

Chung-Hsien Wu ◽

Shih-Hung Chang ◽

Jau-Yien Lee Lee

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Network Model ◽

Neural Network Model ◽

Segmentation Algorithm ◽

Model Based ◽

Hierarchical Neural Network ◽

Mandarin Speech Recognition

Download Full-text

Speech recognition apparatus using neural network and learning method therefor

The Journal of the Acoustical Society of America ◽

10.1121/1.426923 ◽

1999 ◽

Vol 105 (5) ◽

pp. 2553

Author(s):

Mitsuhiro Inazumi

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Learning Method

Download Full-text

An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA) ◽

10.1109/icicta.2017.74 ◽

2017 ◽

Cited By ~ 3

Author(s):

Wenbin Ruan ◽

Zhenye Gan ◽

Bin Liu ◽

Yin Guo

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Deep Neural Network ◽

Recognition Method

Download Full-text

A speech recognition method based clustering neural network integration

2011 International Conference on Electric Information and Control Engineering ◽

10.1109/iceice.2011.5777537 ◽

2011 ◽

Author(s):

Jing Zhang ◽

Min Zhang

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Network Integration ◽

Recognition Method

Download Full-text

Domain-Adversarial Based Model with Phonological Knowledge for Cross-Lingual Speech Recognition

Electronics ◽

10.3390/electronics10243172 ◽

2021 ◽

Vol 10 (24) ◽

pp. 3172

Author(s):

Qingran Zhan ◽

Xiang Xie ◽

Chenguang Hu ◽

Juan Zuluaga-Gomez ◽

Jing Wang ◽

...

Keyword(s):

Neural Network ◽

Neural Networks ◽

Speech Recognition ◽

Training Data ◽

Target Language ◽

Learning Method ◽

Acoustic Features ◽

Adversarial Learning ◽

Phonological Knowledge ◽

Cross Lingual

Phonological-based features (articulatory features, AFs) describe the movements of the vocal organ which are shared across languages. This paper investigates a domain-adversarial neural network (DANN) to extract reliable AFs, and different multi-stream techniques are used for cross-lingual speech recognition. First, a novel universal phonological attributes definition is proposed for Mandarin, English, German and French. Then a DANN-based AFs detector is trained using source languages (English, German and French). When doing the cross-lingual speech recognition, the AFs detectors are used to transfer the phonological knowledge from source languages (English, German and French) to the target language (Mandarin). Two multi-stream approaches are introduced to fuse the acoustic features and cross-lingual AFs. In addition, the monolingual AFs system (i.e., the AFs are directly extracted from the target language) is also investigated. Experiments show that the performance of the AFs detector can be improved by using convolutional neural networks (CNN) with a domain-adversarial learning method. The multi-head attention (MHA) based multi-stream can reach the best performance compared to the baseline, cross-lingual adaptation approach, and other approaches. More specifically, the MHA-mode with cross-lingual AFs yields significant improvements over monolingual AFs with the restriction of training data size and, which can be easily extended to other low-resource languages.

Download Full-text