The System for Speech Recognition on the Basis of the Neural Network

2004 ◽  
Vol 62 (1-6) ◽  
pp. 131-142
Author(s):  
V. A. Pimenov
2019 ◽  
Vol 1 (1) ◽  
pp. 87-95
Author(s):  
Bishon Lamichanne ◽  
Hari K.C.

Speech is one of the most natural ways to communicate between people. It plays an important role in our daily lives. To make machines able to talk with people is a challenging but very useful task. A crucial step is to enable machines to recognize and understand what people are saying. Hence, speech recognition becomes a key technique providing an interface for communication between machines and humans. There has been a long research history on speech recognition. Neural network is known as a technique that has ability to classify nonlinear problem. Today, lots of research are going in the field of speech recognition with the help of the Neural Network. Even though positive results have been obtained from continuous study, research on minimizing the error rate is still gaining lots attention. The English language offers a number of challenges for speech recognition. This paper implements the RNN to analyze and recognize speech from the set of spoken words.


Author(s):  
Yedilkhan Amirgaliyev ◽  
Kuanyshbay Kuanyshbay ◽  
Aisultan Shoiynbek

This paper evaluates and compares the performances of three well-known optimization algorithms (Adagrad, Adam, Momentum) for faster training the neural network of CTC algorithm for speech recognition. For CTC algorithms recurrent neural network has been used, specifically Long-Short-Term memory. LSTM is effective and often used model. Data has been downloaded from VCTK corpus of Edinburgh University. The results of optimization algorithms have been evaluated by the Label error rate and CTC loss.


2021 ◽  
Vol 5 (2) ◽  
pp. 460
Author(s):  
Nur Azis ◽  
Herwanto Herwanto ◽  
Fathurrahman Ramadhani

The process of manually prescribing drugs by doctors can cause several problems, including doctors not knowing what drugs are available and it takes time to find out what drugs are available in the pharmacy. Speech recognition is now widely used in various ways, which can help facilitate work. The application of speech recognition can be done in the e-prescribing application with the neural network method using the Convolutional Neural Network (CNN) algorithm, which is the basic method of deep learning. This study aims to facilitate physicians in filling out drug data in e-prescribing applications using speech recognition. The data used in this study were obtained from the open source dataset provided by Google and collected independent datasets. From the results of experiments that have been carried out, the accuracy achieved with 40 epochs and 40 direct impressions with different words is 90%. Where words are successfully recognized 36 words out of 40 words


2014 ◽  
Vol 539 ◽  
pp. 136-140
Author(s):  
Jing Zhai Zhang ◽  
Xiang Dong Qiao ◽  
Peng Zhou Zhang

As a newly cross subject which began in the 1940 s, the neural network plays an important part in human intelligencehuman intelligencehuman intelligence studies, has been a attention and research hotspot in many subjects such as information science, brain science, psychology, mathematics and physics. The neural network has well aabstract categoriesabstract categoriesaaa bstract categories capability, which has been applied to the research and development of speech recognition system, and become an effective tool for resolving the identification problem. This paper mainly analyzes the philosophy and procedures of speech recognition, and modeling theory and characteristics of the neural network, discusses the application of neural network in speech recognition.


Sensors ◽  
2021 ◽  
Vol 21 (20) ◽  
pp. 6744
Author(s):  
Darya Vorontsova ◽  
Ivan Menshikov ◽  
Aleksandr Zubov ◽  
Kirill Orlov ◽  
Peter Rikunov ◽  
...  

In this work, we focus on silent speech recognition in electroencephalography (EEG) data of healthy individuals to advance brain–computer interface (BCI) development to include people with neurodegeneration and movement and communication difficulties in society. Our dataset was recorded from 270 healthy subjects during silent speech of eight different Russia words (commands): `forward’, `backward’, `up’, `down’, `help’, `take’, `stop’, and `release’, and one pseudoword. We began by demonstrating that silent word distributions can be very close statistically and that there are words describing directed movements that share similar patterns of brain activity. However, after training one individual, we achieved 85% accuracy performing 9 words (including pseudoword) classification and 88% accuracy on binary classification on average. We show that a smaller dataset collected on one participant allows for building a more accurate classifier for a given subject than a larger dataset collected on a group of people. At the same time, we show that the learning outcomes on a limited sample of EEG-data are transferable to the general population. Thus, we demonstrate the possibility of using selected command-words to create an EEG-based input device for people on whom the neural network classifier has not been trained, which is particularly important for people with disabilities.


1994 ◽  
Vol 33 (01) ◽  
pp. 157-160 ◽  
Author(s):  
S. Kruse-Andersen ◽  
J. Kolberg ◽  
E. Jakobsen

Abstract:Continuous recording of intraluminal pressures for extended periods of time is currently regarded as a valuable method for detection of esophageal motor abnormalities. A subsequent automatic analysis of the resulting motility data relies on strict mathematical criteria for recognition of pressure events. Due to great variation in events, this method often fails to detect biologically relevant pressure variations. We have tried to develop a new concept for recognition of pressure events based on a neural network. Pressures were recorded for over 23 hours in 29 normal volunteers by means of a portable data recording system. A number of pressure events and non-events were selected from 9 recordings and used for training the network. The performance of the trained network was then verified on recordings from the remaining 20 volunteers. The accuracy and sensitivity of the two systems were comparable. However, the neural network recognized pressure peaks clearly generated by muscular activity that had escaped detection by the conventional program. In conclusion, we believe that neu-rocomputing has potential advantages for automatic analysis of gastrointestinal motility data.


1997 ◽  
Vol 36 (04/05) ◽  
pp. 349-351
Author(s):  
H. Mizuta ◽  
K. Kawachi ◽  
H. Yoshida ◽  
K. Iida ◽  
Y. Okubo ◽  
...  

Abstract:This paper compares two classifiers: Pseudo Bayesian and Neural Network for assisting in making diagnoses of psychiatric patients based on a simple yes/no questionnaire which is provided at the outpatient’s first visit to the hospital. The classifiers categorize patients into three most commonly seen ICD classes, i.e. schizophrenic, emotional and neurotic disorders. One hundred completed questionnaires were utilized for constructing and evaluating the classifiers. Average correct decision rates were 73.3% for the Pseudo Bayesian Classifier and 77.3% for the Neural Network classifier. These rates were higher than the rate which an experienced psychiatrist achieved based on the same restricted data as the classifiers utilized. These classifiers may be effectively utilized for assisting psychiatrists in making their final diagnoses.


2020 ◽  
Vol 2020 (10) ◽  
pp. 54-62
Author(s):  
Oleksii VASYLIEV ◽  

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.


Sign in / Sign up

Export Citation Format

Share Document