Bridgenets: Student-Teacher Transfer Learning Based on Recursive Neural Networks and Its Application to Distant Speech Recognition

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8462137 ◽

2018 ◽

Author(s):

Jaeyoung Kim ◽

Mostafa El-Khamy ◽

Jungwon Lee

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Transfer Learning ◽

Student Teacher ◽

Teacher Transfer ◽

Recursive Neural Networks

Download Full-text

A unified approach to transfer learning of deep neural networks with applications to speaker adaptation in automatic speech recognition

Neurocomputing ◽

10.1016/j.neucom.2016.09.018 ◽

2016 ◽

Vol 218 ◽

pp. 448-459 ◽

Author(s):

Zhen Huang ◽

Sabato Marco Siniscalchi ◽

Chin-Hui Lee

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Transfer Learning ◽

Automatic Speech Recognition ◽

Deep Neural Networks ◽

Unified Approach

Download Full-text

Speech Recognition Using Neural Networks

International Institute of Engineers (IIE) May 22-23, 2015 Dubai (UAE) ◽

10.15242/iie.e0515043 ◽

2015 ◽

Keyword(s):

Neural Networks ◽

Speech Recognition

Download Full-text

Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home

10.21437/interspeech.2017-1510 ◽

2017 ◽

Author(s):

Chanwoo Kim ◽

Ananya Misra ◽

Kean Chin ◽

Thad Hughes ◽

Arun Narayanan ◽

...

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Large Scale ◽

Deep Neural Networks ◽

Download Full-text

Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition

10.21437/interspeech.2016-515 ◽

2016 ◽

Author(s):

Wei-Ning Hsu ◽

Yu Zhang ◽

Ann Lee ◽

James Glass

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks

Download Full-text

Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks

10.21437/interspeech.2019-2629 ◽

2019 ◽

Author(s):

Muhammad Umar Farooq ◽

Farah Adeeba ◽

Sahar Rauf ◽

Sarmad Hussain

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Recognition System ◽

Speech Recognition System ◽

Large Vocabulary

Download Full-text

Deep bidirectional neural networks for robust speech recognition under heavy background noise

Materials Today Proceedings ◽

10.1016/j.matpr.2021.02.640 ◽

2021 ◽

Author(s):

Jeevan Reddy Koya ◽

S.P. Venu Madhava Rao

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Background Noise ◽

Robust Speech Recognition

Download Full-text

Interpretation of Swedish Sign Language Using Convolutional Neural Networks and Transfer Learning

SN Computer Science ◽

10.1007/s42979-021-00612-w ◽

2021 ◽

Vol 2 (3) ◽

Author(s):

Gustaf Halvardsson ◽

Johanna Peterson ◽

César Soto-Valero ◽

Benoit Baudry

Keyword(s):

Neural Networks ◽

Sign Language ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Web Application ◽

Training Dataset ◽

Motion Processing ◽

Image Perception ◽

Sign Languages ◽

AbstractThe automatic interpretation of sign languages is a challenging task, as it requires the usage of high-level vision and high-level motion processing systems for providing accurate image perception. In this paper, we use Convolutional Neural Networks (CNNs) and transfer learning to make computers able to interpret signs of the Swedish Sign Language (SSL) hand alphabet. Our model consists of the implementation of a pre-trained InceptionV3 network, and the usage of the mini-batch gradient descent optimization algorithm. We rely on transfer learning during the pre-training of the model and its data. The final accuracy of the model, based on 8 study subjects and 9400 images, is 85%. Our results indicate that the usage of CNNs is a promising approach to interpret sign languages, and transfer learning can be used to achieve high testing accuracy despite using a small training dataset. Furthermore, we describe the implementation details of our model to interpret signs as a user-friendly web application.

Download Full-text

Improving Semi-Supervised Learning for Audio Classification with FixMatch

Electronics ◽

10.3390/electronics10151807 ◽

2021 ◽

Vol 10 (15) ◽

pp. 1807

Author(s):

Sascha Grollmisch ◽

Estefanía Cano

Keyword(s):

Neural Networks ◽

Supervised Learning ◽

Transfer Learning ◽

Data Transfer ◽

State Of The Art ◽

Training Data ◽

Audio Classification ◽

Image Domain ◽

Full Dataset ◽

Including unlabeled data in the training process of neural networks using Semi-Supervised Learning (SSL) has shown impressive results in the image domain, where state-of-the-art results were obtained with only a fraction of the labeled data. The commonality between recent SSL methods is that they strongly rely on the augmentation of unannotated data. This is vastly unexplored for audio data. In this work, SSL using the state-of-the-art FixMatch approach is evaluated on three audio classification tasks, including music, industrial sounds, and acoustic scenes. The performance of FixMatch is compared to Convolutional Neural Networks (CNN) trained from scratch, Transfer Learning, and SSL using the Mean Teacher approach. Additionally, a simple yet effective approach for selecting suitable augmentation methods for FixMatch is introduced. FixMatch with the proposed modifications always outperformed Mean Teacher and the CNNs trained from scratch. For the industrial sounds and music datasets, the CNN baseline performance using the full dataset was reached with less than 5% of the initial training data, demonstrating the potential of recent SSL methods for audio data. Transfer Learning outperformed FixMatch only for the most challenging dataset from acoustic scene classification, showing that there is still room for improvement.

Download Full-text

Speech Recognition and Machine Translation Using Neural Networks

2021 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM) ◽

10.1109/icieam51226.2021.9446474 ◽

2021 ◽

Author(s):

R. F. Gibadullin ◽

M. Yu. Perukhin ◽

A V Ilin

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Machine Translation

Download Full-text

Algorithm Selection Framework for Legalization Using Deep Convolutional Neural Networks and Transfer Learning

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems ◽

10.1109/tcad.2021.3079126 ◽

2021 ◽

pp. 1-1

Author(s):

Renan Netto ◽

Sheiny Fabre ◽

Tiago Augusto Fontana ◽

Vinicius Livramento ◽

Laercio L. Pilla ◽

...

Keyword(s):

Neural Networks ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks ◽

Algorithm Selection ◽

Selection Framework

Download Full-text