The System for Speech Recognition on the Basis of the Neural Network

V. A. Pimenov

doi:10.1615/telecomradeng.v62.i2.40

Performance Analysis and Recognition of Speech using Recurrent Neural Network

Technical Journal ◽

10.3126/tj.v1i1.27596 ◽

2019 ◽

Vol 1 (1) ◽

pp. 87-95

Author(s):

Bishon Lamichanne ◽

Hari K.C.

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Performance Analysis ◽

Recurrent Neural Network ◽

Error Rate ◽

English Language ◽

Daily Lives ◽

Crucial Step ◽

The Neural Network ◽

Positive Results

Speech is one of the most natural ways to communicate between people. It plays an important role in our daily lives. To make machines able to talk with people is a challenging but very useful task. A crucial step is to enable machines to recognize and understand what people are saying. Hence, speech recognition becomes a key technique providing an interface for communication between machines and humans. There has been a long research history on speech recognition. Neural network is known as a technique that has ability to classify nonlinear problem. Today, lots of research are going in the field of speech recognition with the help of the Neural Network. Even though positive results have been obtained from continuous study, research on minimizing the error rate is still gaining lots attention. The English language offers a number of challenges for speech recognition. This paper implements the RNN to analyze and recognize speech from the set of spoken words.

Download Full-text

COMPARISON OF OPTIMIZATION ALGORITHMS OF CONNECTIONIST TEMPORAL CLASSIFIER FOR SPEECH RECOGNITION SYSTEM

Informatyka Automatyka Pomiary w Gospodarce i Ochronie Środowiska ◽

10.35784/iapgos.234 ◽

2019 ◽

Vol 9 (3) ◽

pp. 54-57

Author(s):

Yedilkhan Amirgaliyev ◽

Kuanyshbay Kuanyshbay ◽

Aisultan Shoiynbek

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Short Term Memory ◽

Optimization Algorithms ◽

Recognition System ◽

Model Data ◽

Short Term ◽

Term Memory ◽

The Neural Network ◽

Long Short Term Memory

This paper evaluates and compares the performances of three well-known optimization algorithms (Adagrad, Adam, Momentum) for faster training the neural network of CTC algorithm for speech recognition. For CTC algorithms recurrent neural network has been used, specifically Long-Short-Term memory. LSTM is effective and often used model. Data has been downloaded from VCTK corpus of Edinburgh University. The results of optimization algorithms have been evaluated by the Label error rate and CTC loss.

Download Full-text

Implementasi Speech Recognition Pada Aplikasi E-Prescribing Menggunakan Algoritme Convolutional Neural Network

JURNAL MEDIA INFORMATIKA BUDIDARMA ◽

10.30865/mib.v5i2.2841 ◽

2021 ◽

Vol 5 (2) ◽

pp. 460

Author(s):

Nur Azis ◽

Herwanto Herwanto ◽

Fathurrahman Ramadhani

Keyword(s):

Neural Network ◽

Deep Learning ◽

Speech Recognition ◽

Convolutional Neural Network ◽

Open Source ◽

Basic Method ◽

Neural Network Method ◽

The Neural Network ◽

Network Method

The process of manually prescribing drugs by doctors can cause several problems, including doctors not knowing what drugs are available and it takes time to find out what drugs are available in the pharmacy. Speech recognition is now widely used in various ways, which can help facilitate work. The application of speech recognition can be done in the e-prescribing application with the neural network method using the Convolutional Neural Network (CNN) algorithm, which is the basic method of deep learning. This study aims to facilitate physicians in filling out drug data in e-prescribing applications using speech recognition. The data used in this study were obtained from the open source dataset provided by Google and collected independent datasets. From the results of experiments that have been carried out, the accuracy achieved with 40 epochs and 40 direct impressions with different words is 90%. Where words are successfully recognized 36 words out of 40 words

Download Full-text

Research of Speech Recognition Based on Neural Network

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.539.136 ◽

2014 ◽

Vol 539 ◽

pp. 136-140

Author(s):

Jing Zhai Zhang ◽

Xiang Dong Qiao ◽

Peng Zhou Zhang

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Information Science ◽

Identification Problem ◽

Recognition System ◽

Brain Science ◽

The Neural Network ◽

Intelligence Studies ◽

Modeling Theory ◽

Mathematics And Physics

As a newly cross subject which began in the 1940 s, the neural network plays an important part in human intelligencehuman intelligencehuman intelligence studies, has been a attention and research hotspot in many subjects such as information science, brain science, psychology, mathematics and physics. The neural network has well aabstract categoriesabstract categoriesaaa bstract categories capability, which has been applied to the research and development of speech recognition system, and become an effective tool for resolving the identification problem. This paper mainly analyzes the philosophy and procedures of speech recognition, and modeling theory and characteristics of the neural network, discusses the application of neural network in speech recognition.

Download Full-text

Silent EEG-Speech Recognition Using Convolutional and Recurrent Neural Network with 85% Accuracy of 9 Words Classification

Sensors ◽

10.3390/s21206744 ◽

2021 ◽

Vol 21 (20) ◽

pp. 6744

Author(s):

Darya Vorontsova ◽

Ivan Menshikov ◽

Aleksandr Zubov ◽

Kirill Orlov ◽

Peter Rikunov ◽

...

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Brain Activity ◽

Binary Classification ◽

Input Device ◽

Limited Sample ◽

The Neural Network ◽

Eeg Data ◽

Communication Difficulties ◽

Silent Speech

In this work, we focus on silent speech recognition in electroencephalography (EEG) data of healthy individuals to advance brain–computer interface (BCI) development to include people with neurodegeneration and movement and communication difficulties in society. Our dataset was recorded from 270 healthy subjects during silent speech of eight different Russia words (commands): `forward’, `backward’, `up’, `down’, `help’, `take’, `stop’, and `release’, and one pseudoword. We began by demonstrating that silent word distributions can be very close statistically and that there are words describing directed movements that share similar patterns of brain activity. However, after training one individual, we achieved 85% accuracy performing 9 words (including pseudoword) classification and 88% accuracy on binary classification on average. We show that a smaller dataset collected on one participant allows for building a more accurate classifier for a given subject than a larger dataset collected on a group of people. At the same time, we show that the learning outcomes on a limited sample of EEG-data are transferable to the general population. Thus, we demonstrate the possibility of using selected command-words to create an EEG-based input device for people on whom the neural network classifier has not been trained, which is particularly important for people with disabilities.

Download Full-text

Identifying the neural network for selective attention to visceral sensation using functional magnetic resonance imaging

Gastroenterology ◽

10.1016/s0016-5085(01)80112-8 ◽

2001 ◽

Vol 120 (5) ◽

pp. A23-A23

Author(s):

L GREGORY ◽

L YAGUEZ ◽

C ALTMANN ◽

S WILLIAMS ◽

D THOMPSON ◽

...

Keyword(s):

Neural Network ◽

Magnetic Resonance Imaging ◽

Magnetic Resonance ◽

Functional Magnetic Resonance Imaging ◽

Selective Attention ◽

Visceral Sensation ◽

Functional Magnetic Resonance ◽

Resonance Imaging ◽

The Neural Network

Download Full-text

Neural Network for Automatic Analysis of Motility Data

Methods of Information in Medicine ◽

10.1055/s-0038-1634978 ◽

1994 ◽

Vol 33 (01) ◽

pp. 157-160 ◽

Cited By ~ 2

Author(s):

S. Kruse-Andersen ◽

J. Kolberg ◽

E. Jakobsen

Keyword(s):

Neural Network ◽

Muscular Activity ◽

Recording System ◽

Automatic Analysis ◽

Biologically Relevant ◽

System A ◽

The Neural Network ◽

Valuable Method ◽

Motor Abnormalities ◽

Trained Network

Abstract:Continuous recording of intraluminal pressures for extended periods of time is currently regarded as a valuable method for detection of esophageal motor abnormalities. A subsequent automatic analysis of the resulting motility data relies on strict mathematical criteria for recognition of pressure events. Due to great variation in events, this method often fails to detect biologically relevant pressure variations. We have tried to develop a new concept for recognition of pressure events based on a neural network. Pressures were recorded for over 23 hours in 29 normal volunteers by means of a portable data recording system. A number of pressure events and non-events were selected from 9 recordings and used for training the network. The performance of the trained network was then verified on recordings from the remaining 20 volunteers. The accuracy and sensitivity of the two systems were comparable. However, the neural network recognized pressure peaks clearly generated by muscular activity that had escaped detection by the conventional program. In conclusion, we believe that neu-rocomputing has potential advantages for automatic analysis of gastrointestinal motility data.

Download Full-text

Decision Support for Psychiatric Diagnosis Based on a Simple Questionnaire

Methods of Information in Medicine ◽

10.1055/s-0038-1636858 ◽

1997 ◽

Vol 36 (04/05) ◽

pp. 349-351

Author(s):

H. Mizuta ◽

K. Kawachi ◽

H. Yoshida ◽

K. Iida ◽

Y. Okubo ◽

...

Keyword(s):

Neural Network ◽

Decision Support ◽

Psychiatric Diagnosis ◽

Psychiatric Patients ◽

Bayesian Classifier ◽

Neural Network Classifier ◽

Correct Decision ◽

Neurotic Disorders ◽

The Neural Network

Abstract:This paper compares two classifiers: Pseudo Bayesian and Neural Network for assisting in making diagnoses of psychiatric patients based on a simple yes/no questionnaire which is provided at the outpatient’s first visit to the hospital. The classifiers categorize patients into three most commonly seen ICD classes, i.e. schizophrenic, emotional and neurotic disorders. One hundred completed questionnaires were utilized for constructing and evaluating the classifiers. Average correct decision rates were 73.3% for the Pseudo Bayesian Classifier and 77.3% for the Neural Network classifier. These rates were higher than the rate which an experienced psychiatrist achieved based on the same restricted data as the classifiers utilized. These classifiers may be effectively utilized for assisting psychiatrists in making their final diagnoses.

Download Full-text

Development of HMM/Neural Network‐Based Medium‐Vocabulary Isolated‐Word Lithuanian Speech Recognition System

Informatica ◽

10.15388/informatica.2004.073 ◽

2004 ◽

Vol 15 (4) ◽

pp. 465-474 ◽

Cited By ~ 1

Author(s):

Mark Filipovič ◽

Antanas Lipeika

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Isolated Word

Download Full-text

SCORING MODELING BASED ON NEURAL NETWORKS FOR DETERMINING A BANK BORROWER'S RATING

Economy of Ukraine ◽

10.15407/economyukr.2020.10.054 ◽

2020 ◽

Vol 2020 (10) ◽

pp. 54-62

Author(s):

Oleksii VASYLIEV ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Network Architecture ◽

Statistical Data ◽

Activation Function ◽

Decision Making Process ◽

Neural Network Architecture ◽

Acceptable Accuracy ◽

The Neural Network ◽

Sigmoid Activation Function

The problem of applying neural networks to calculate ratings used in banking in the decision-making process on granting or not granting loans to borrowers is considered. The task is to determine the rating function of the borrower based on a set of statistical data on the effectiveness of loans provided by the bank. When constructing a regression model to calculate the rating function, it is necessary to know its general form. If so, the task is to calculate the parameters that are included in the expression for the rating function. In contrast to this approach, in the case of using neural networks, there is no need to specify the general form for the rating function. Instead, certain neural network architecture is chosen and parameters are calculated for it on the basis of statistical data. Importantly, the same neural network architecture can be used to process different sets of statistical data. The disadvantages of using neural networks include the need to calculate a large number of parameters. There is also no universal algorithm that would determine the optimal neural network architecture. As an example of the use of neural networks to determine the borrower's rating, a model system is considered, in which the borrower's rating is determined by a known non-analytical rating function. A neural network with two inner layers, which contain, respectively, three and two neurons and have a sigmoid activation function, is used for modeling. It is shown that the use of the neural network allows restoring the borrower's rating function with quite acceptable accuracy.

Download Full-text