scholarly journals Automatic speech recognition errors detection using supervised learning techniques

Author(s):  
Rahhal Errattahi ◽  
Asmaa El Hannani ◽  
Hassan Ouahmane ◽  
Thomas Hain
2021 ◽  
Vol 11 (19) ◽  
pp. 8872
Author(s):  
Iván G. Torre ◽  
Mónica Romero ◽  
Aitor Álvarez

Automatic speech recognition in patients with aphasia is a challenging task for which studies have been published in a few languages. Reasonably, the systems reported in the literature within this field show significantly lower performance than those focused on transcribing non-pathological clean speech. It is mainly due to the difficulty of recognizing a more unintelligible voice, as well as due to the scarcity of annotated aphasic data. This work is mainly focused on applying novel semi-supervised learning methods to the AphasiaBank dataset in order to deal with these two major issues, reporting improvements for the English language and providing the first benchmark for the Spanish language for which less than one hour of transcribed aphasic speech was used for training. In addition, the influence of reinforcing the training and decoding processes with out-of-domain acoustic and text data is described by using different strategies and configurations to fine-tune the hyperparameters and the final recognition systems. The interesting results obtained encourage extending this technological approach to other languages and scenarios where the scarcity of annotated data to train recognition models is a challenging reality.


2020 ◽  
Vol 8 (6) ◽  
pp. 1667-1671

Speech is the most proficient method of correspondence between people groups. Discourse acknowledgment is an interdisciplinary subfield of computational phonetics that creates approaches and advances that empowers the acknowledgment and interpretation of communicated in language into content by PCs. It is otherwise called programmed discourse acknowledgment (ASR), PC discourse acknowledgment or discourse to content (STT). It consolidates information and research in the etymology, software engineering, and electrical building fields. This, being the best methodology of correspondence, could likewise be a helpful interface to speak with machines. Machine learning consists of supervised and unsupervised learning among which supervised learning is used for the speech recognition objectives. Supervised learning is that the data processing task of inferring a perform from labeled coaching information. Speech recognition is the current trend that has gained focus over the decades. Most automation technologies use speech and speech recognition for various perspectives. This paper offers a diagram of major innovative point of view and valuation for the fundamental advancement of speech recognitionand offers review method created in each phase of discourse acknowledgment utilizing supervised learning. The project will use ANN to recognize speeches using magnitudes with large datasets.


2021 ◽  
Author(s):  
Tomohiro Tanaka ◽  
Ryo Masumura ◽  
Mana Ihori ◽  
Akihiko Takashima ◽  
Shota Orihashi ◽  
...  

Symmetry ◽  
2019 ◽  
Vol 11 (8) ◽  
pp. 1018 ◽  
Author(s):  
Dong Wang ◽  
Xiaodong Wang ◽  
Shaohe Lv

Automatic speech recognition, especially large vocabulary continuous speech recognition, is an important issue in the field of machine learning. For a long time, the hidden Markov model (HMM)-Gaussian mixed model (GMM) has been the mainstream speech recognition framework. But recently, HMM-deep neural network (DNN) model and the end-to-end model using deep learning has achieved performance beyond HMM-GMM. Both using deep learning techniques,


2018 ◽  
Vol 128 ◽  
pp. 32-37 ◽  
Author(s):  
Rahhal Errattahi ◽  
Asmaa El Hannani ◽  
Hassan Ouahmane

Sign in / Sign up

Export Citation Format

Share Document