An End-to-End Dialect Identification System with Transfer Learning from a Multilingual Automatic Speech Recognition Model

Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition

2021 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme51207.2021.9428334 ◽

2021 ◽

Author(s):

Jian Luo ◽

Jianzong Wang ◽

Ning Cheng ◽

Edward Xiao ◽

Jing Xiao ◽

...

Keyword(s):

Speech Recognition ◽

Transfer Learning ◽

Automatic Speech Recognition ◽

Domain Adaptation ◽

Language Transfer ◽

End To End ◽

Cross Language

Download Full-text

Automatic Speech Recognition for Indian Accent Lectures contents using End-to-End Speech Recognition model

10.4108/eai.7-12-2021.2314531 ◽

2021 ◽

Author(s):

Ashok Kumar L ◽

Karthika Renuka D ◽

Raajkumar G

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Recognition Model ◽

End To End

Download Full-text

Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition

10.21437/interspeech.2020-1930 ◽

2020 ◽

Author(s):

Ryo Masumura ◽

Naoki Makishima ◽

Mana Ihori ◽

Akihiko Takashima ◽

Tomohiro Tanaka ◽

...

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Large Scale ◽

End To End

Download Full-text

Low-Complexity DNN-Based End-to-End Automatic Speech Recognition using Low-Rank Approximation

2020 International SoC Design Conference (ISOCC) ◽

10.1109/isocc50952.2020.9332970 ◽

2020 ◽

Author(s):

Jongmin Park ◽

Youngjoo Lee

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Low Complexity ◽

Low Rank ◽

Low Rank Approximation ◽

Rank Approximation ◽

End To End

Download Full-text

Improving Mispronunciation Detection of Mandarin for Tibetan Students Based on the End-To-End Speech Recognition Model

10.1109/isaiam53259.2021.00039 ◽

2021 ◽

Author(s):

Zhenye Gan ◽

Xin Zhao ◽

Shihua Zhou ◽

Rui Wang

Keyword(s):

Speech Recognition ◽

Recognition Model ◽

End To End

Download Full-text

Bridging automatic speech recognition and psycholinguistics: Extending Shortlist to an end-to-end model of human speech recognition (L)

The Journal of the Acoustical Society of America ◽

10.1121/1.1624065 ◽

2003 ◽

Vol 114 (6) ◽

pp. 3032-3035 ◽

Cited By ~ 9

Author(s):

Odette Scharenborg ◽

Louis ten Bosch ◽

Lou Boves ◽

Dennis Norris

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Human Speech ◽

End To End

Download Full-text

Combining De-noising Auto-encoder and Recurrent Neural Networks in End-to-End Automatic Speech Recognition for Noise Robustness

2018 IEEE Spoken Language Technology Workshop (SLT) ◽

10.1109/slt.2018.8639597 ◽

2018 ◽

Author(s):

Tzu-Hsuan Ting ◽

Chia-Ping Chen

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Recurrent Neural Networks ◽

Noise Robustness ◽

End To End

Download Full-text

Cross-Language End-to-End Speech Recognition Research Based on Transfer Learning for the Low-Resource Tujia Language

Symmetry ◽

10.3390/sym11020179 ◽

2019 ◽

Vol 11 (2) ◽

pp. 179 ◽

Cited By ~ 4

Author(s):

Chongchong Yu ◽

Yunbing Chen ◽

Yueqiao Li ◽

Meng Kang ◽

Shixuan Xu ◽

...

Keyword(s):

Speech Recognition ◽

Transfer Learning ◽

Short Term Memory ◽

Recognition System ◽

Language Recognition ◽

Low Resource ◽

End To End ◽

The Cross ◽

Hidden Layer ◽

Cross Language

To rescue and preserve an endangered language, this paper studied an end-to-end speech recognition model based on sample transfer learning for the low-resource Tujia language. From the perspective of the Tujia language international phonetic alphabet (IPA) label layer, using Chinese corpus as an extension of the Tujia language can effectively solve the problem of an insufficient corpus in the Tujia language, constructing a cross-language corpus and an IPA dictionary that is unified between the Chinese and Tujia languages. The convolutional neural network (CNN) and bi-directional long short-term memory (BiLSTM) network were used to extract the cross-language acoustic features and train shared hidden layer weights for the Tujia language and Chinese phonetic corpus. In addition, the automatic speech recognition function of the Tujia language was realized using the end-to-end method that consists of symmetric encoding and decoding. Furthermore, transfer learning was used to establish the model of the cross-language end-to-end Tujia language recognition system. The experimental results showed that the recognition error rate of the proposed model is 46.19%, which is 2.11% lower than the that of the model that only used the Tujia language data for training. Therefore, this approach is feasible and effective.

Download Full-text

Speech Assistance for Persons With Speech Impediments Using Artificial Neural Networks

Volume 3: Biomedical and Biotechnology Engineering ◽

10.1115/imece2017-71027 ◽

2017 ◽

Author(s):

Ramy Mounir ◽

Redwan Alqasemi ◽

Rajiv Dubey

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Deep Learning ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Challenging Problem ◽

Speech Impairment ◽

Recognition Model ◽

Wide Range ◽

Speech Variability

This work focuses on the research related to enabling individuals with speech impairment to use speech-to-text software to recognize and dictate their speech. Automatic Speech Recognition (ASR) tends to be a challenging problem for researchers because of the wide range of speech variability. Some of the variabilities include different accents, pronunciations, speeds, volumes, etc. It is very difficult to train an end-to-end speech recognition model on data with speech impediment due to the lack of large enough datasets, and the difficulty of generalizing a speech disorder pattern on all users with speech impediments. This work highlights the different techniques used in deep learning to achieve ASR and how it can be modified to recognize and dictate speech from individuals with speech impediments.

Download Full-text

Fast offline transformer‐based end‐to‐end automatic speech recognition for real‐world applications

ETRI Journal ◽

10.4218/etrij.2021-0106 ◽

2021 ◽

Author(s):

Yoo Rhee Oh ◽

Kiyoung Park ◽

Jeon Gue Park

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Real World ◽

Real World Applications ◽

End To End

Download Full-text