An Critical Analysis of Speech Recognition of Tamil and Malay Language Through Artificial Neural Network

Mapping Intimacies ◽

10.20944/preprints202102.0156.v1 ◽

2021 ◽

Author(s):

Kingston Pal Thamburaj ◽

Kartheges Ponniah ◽

Ilangkumaran Sivanathan ◽

Muniisvaran Kumar

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Speech Recognition ◽

Data Entry ◽

Recognition System ◽

Visual Speech ◽

Document Preparation ◽

Multiple Sensors ◽

Visual Speech Recognition ◽

Artificial Neural

Human and Computer interaction has been a part of our day-to-day life. Speech is one of the essential and comfortable ways of interacting through devices as well as a human being. The device, particularly smartphones have multiple sensors in camera and microphone, etc. speech recognition is the process of converting the acoustic signal to a smartphone as a set of words. The efficient performance of the speech recognition system highly enhances the interaction between humans and machines by making the latter more receptive to user needs. The recognized words can be applied for many applications such as Commands & Control, Data entry, and Document preparation. This research paper highlights speech recognition through ANN (Artificial Neural Network). Also, a hybrid model is proposed for audio-visual speech recognition of the Tamil and Malay language through SOM (Self-organizing map0 and MLP (Multilayer Perceptron). The Effectiveness of the different models of NN (Neural Network) utilized in speech recognition will be examined.

Download Full-text

Speech recognition system for Turkish language with hybrid method

Global Journal of Computer Sciences Theory and Research ◽

10.18844/gjcs.v7i1.2699 ◽

2017 ◽

Vol 7 (1) ◽

pp. 48-57

Author(s):

Cigdem Bakir

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Speech Recognition ◽

Hybrid Model ◽

Recognition System ◽

Second Phase ◽

Speech Recognition System ◽

Feature Vectors ◽

Artificial Neural ◽

Artificial Neural Network Ann

Currently, technological developments are accompanied by a number of associated problems. Security takes the first place amongst such problems. In particular, biometric systems such as authentication constitute a significant fraction of the security problem. This is because sound recordings having connection with various crimes are required to be analysed for forensic purposes. Authentication systems necessitate transmission, design and classification of biometric data in a secure manner. The aim of this study is to actualise an automatic voice and speech recognition system using wavelet transform, taking Turkish sound forms and properties into consideration. Approximately 3740 Turkish voice samples of words and clauses of differing lengths were collected from 25 males and 25 females. The features of these voice samples were obtained using Mel-frequency cepstral coefficients (MFCCs), Mel-frequency discrete wavelet coefficients (MFDWCs) and linear prediction cepstral coefficient (LPCC). Feature vectors of the voice samples obtained were trained with k-means, artificial neural network (ANN) and hybrid model. The hybrid model was formed by combining with k-means clustering and ANN. In the first phase of this model, k-means performed subsets obtained with voice feature vectors. In the second phase, a set of training and tests were formed from these sub-clusters. Thus, for being trained more suitable data by clustering increased the accuracy. In the test phase, the owner of a given voice sample was identified by taking the trained voice samples into consideration. The results and performance of the algorithms used for classification are also demonstrated in a comparative manner. Keywords: Speech recognition, hybrid model, k-means, artificial neural network (ANN).

Download Full-text

Neural network acoustic and visual speech recognition system

The Journal of the Acoustical Society of America ◽

10.1121/1.420021 ◽

1997 ◽

Vol 102 (3) ◽

pp. 1282

Author(s):

David G. Stork

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recognition System ◽

Visual Speech ◽

Speech Recognition System ◽

Visual Speech Recognition

Download Full-text

Performance Enhancement of Automatic Speech Recognition System using Euclidean Distance Comparison and Artificial Neural Network

2018 3rd International Conference On Internet of Things: Smart Innovation and Usages (IoT-SIU) ◽

10.1109/iot-siu.2018.8519839 ◽

2018 ◽

Cited By ~ 2

Author(s):

Anurag Bajpai ◽

Umang Varshney ◽

Deepam Dubey

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Euclidean Distance ◽

Performance Enhancement ◽

Recognition System ◽

Speech Recognition System ◽

Automatic Speech Recognition System ◽

Artificial Neural

Download Full-text

Audio-Visual Speech Recognition System Using Recurrent Neural Network

2019 4th International Conference on Information Technology (InCIT) ◽

10.1109/incit.2019.8912049 ◽

2019 ◽

Author(s):

Yeh-Huann Goh ◽

Kai-Xian Lau ◽

Yoon-Ket Lee

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recurrent Neural Network ◽

Recognition System ◽

Visual Speech ◽

Speech Recognition System ◽

Visual Speech Recognition

Download Full-text

Automatic Isolated Speech Recognition System Using MFCC Analysis and Artificial Neural Network Classifier: Feasible For Diversity of Speech Applications

2020 IEEE Student Conference on Research and Development (SCOReD) ◽

10.1109/scored50371.2020.9250964 ◽

2020 ◽

Author(s):

Md. Daud Shakil ◽

Md. Abdur Rahman ◽

Md. Mohiuddin Soliman ◽

Md. Atiqul Islam

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Neural Network Classifier ◽

Artificial Neural Network Classifier ◽

Artificial Neural

Download Full-text

Neural network acoustic and visual speech recognition system training method and apparatus

The Journal of the Acoustical Society of America ◽

10.1121/1.420244 ◽

1997 ◽

Vol 102 (6) ◽

pp. 3252

Author(s):

David G. Stork ◽

Gregory J. Wolff

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recognition System ◽

Visual Speech ◽

Speech Recognition System ◽

Training Method ◽

Visual Speech Recognition

Download Full-text

Convolution Neural Network Based Visual Speech Recognition System for Syllable Identification

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200917142628 ◽

2020 ◽

Vol 13 ◽

Author(s):

Hunny Pahuja ◽

Priya Ranjan ◽

Amit Ujlayan ◽

Ayush Goyal

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Real Time ◽

Acoustic Property ◽

Recognition System ◽

Convolution Neural Network ◽

Visual Speech ◽

Large Set ◽

Speech Recognition System ◽

Visual Speech Recognition

Introduction: This paper introduces novel and reliable approach for speech impaired people to assist them to communicate effectively in real time. A deep learning technique named as convolution neural network is used as its classifier. With the help of this algorithm, words are recognized from an input which is a visual speech, disregards with its audible or acoustic property. Methods: This network extracts the features from mouth stances and different images respectively. With the help of a source, non-audible mouth stances are taken as an input and then segregated as subsets to get desired output. The Complete Datum is then arranged to recognize the word as an affricate. Results: Convolution neural network is one of the most effective algorithms that extracts features, performs classification and provides the desired output from the input images for speech recognition system. Conclusion: Recognizing the syllables at real time from visual mouth stances input is the main objective of the proposed method. When tested, datum accuracy and quantity of training sets is giving satisfactory output. A small set of datum is taken as first step of learning. In future, large set of datum can be considered for analyzing the data. Discussion: On the basis of type of Datum, network proposed in this paper is tested to obtain its precision level. A network is maintained to identify the syllables but it fails when syllables are of same set. Requirement of Higher end graphics pro-cessing units is there to bring down the time consumption and increases the efficiency of network.

Download Full-text

RBF neural network mouth tracking for audio-visual speech recognition system

2004 IEEE Region 10 Conference TENCON 2004. ◽

10.1109/tencon.2004.1414362 ◽

2005 ◽

Cited By ~ 2

Author(s):

Ee Hui Lim ◽

K.P. Seng ◽

K.M. Tse

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Rbf Neural Network ◽

Recognition System ◽

Visual Speech ◽

Speech Recognition System ◽

Visual Speech Recognition

Download Full-text

Visual speech recognition for small scale dataset using VGG16 convolution neural network

Multimedia Tools and Applications ◽

10.1007/s11042-021-11119-0 ◽

2021 ◽

Author(s):

Shashidhar R ◽

Sudarshan Patilkulkarni

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Convolution Neural Network ◽

Visual Speech ◽

Small Scale ◽

Visual Speech Recognition

Download Full-text

Lips detection for audio-visual speech recognition system

2008 International Symposium on Intelligent Signal Processing and Communications Systems ◽

10.1109/ispacs.2009.4806689 ◽

2009 ◽

Author(s):

Siew Wen Chin ◽

Li-Minn Ang ◽

Kah Phooi Seng

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Visual Speech ◽

Speech Recognition System ◽

Visual Speech Recognition

Download Full-text