Artificial Neural Network based Automatic Speech Recognition Engine for Voice Controlled Micro Air Vehicles

Author(s):  
Sushma. M. Gowda ◽  
D.K Rahul ◽  
Anush Anand ◽  
S. Veena ◽  
Vinod B Durdi
2021 ◽  
pp. 83-92
Author(s):  
Shoeb Hussain ◽  
Ronaq Nazir ◽  
Urooj Javeed ◽  
Shoaib Khan ◽  
Rumaisa Sofi

2017 ◽  
Vol 7 (1) ◽  
pp. 48-57
Author(s):  
Cigdem Bakir

Currently, technological developments are accompanied by a number of associated problems. Security takes the first place amongst such problems. In particular, biometric systems such as authentication constitute a significant fraction of the security problem. This is because sound recordings having connection with various crimes are required to be analysed for forensic purposes. Authentication systems necessitate transmission, design and classification of biometric data in a secure manner. The aim of this study is to actualise an automatic voice and speech recognition system using wavelet transform, taking Turkish sound forms and properties into consideration. Approximately 3740 Turkish voice samples of words and clauses of differing lengths were collected from 25 males and 25 females. The features of these voice samples were obtained using Mel-frequency cepstral coefficients (MFCCs), Mel-frequency discrete wavelet coefficients (MFDWCs) and linear prediction cepstral coefficient (LPCC). Feature vectors of the voice samples obtained were trained with k-means, artificial neural network (ANN) and hybrid model. The hybrid model was formed by combining with k-means clustering and ANN. In the first phase of this model, k-means performed subsets obtained with voice feature vectors. In the second phase, a set of training and tests were formed from these sub-clusters. Thus, for being trained more suitable data by clustering increased the accuracy. In the test phase, the owner of a given voice sample was identified by taking the trained voice samples into consideration. The results and performance of the algorithms used for classification are also demonstrated in a comparative manner. Keywords: Speech recognition, hybrid model, k-means, artificial neural network (ANN).


Author(s):  
Lam D. Pham ◽  
Hieu M. Nguyen ◽  
Du N. N. T. Nguyen ◽  
Trang Hoang

Artificial Neural Network (ANN) is promoted to one of major schemes applied in pattern recognition area. Indeed, many approaches to software-based platforms have proven great performance of ANN. However, developing pattern recognition systems integrating ANN hardware-based architecture has been limited not only by the silicon requirements such as frequency, area, power, or resource but also by high accuracy and real-time applications strictly. Although a considerable number of ANN hardware-based architectures have been proposed currently, they have experienced a deprivation of functions due to both small configurations and ability of reconfiguration. Consequently, achieving an effective ANN hardware-based architecture so as to adapt to not only strict accuracy, enormous configures, or silicon area but also real-time criterion in pattern recognition systems has been really challenged. To tackle these issues, this work has proposed a dynamic structure of three-layer ANN architecture being able to reconfigure for adapting to various real-time applications. What is more, a complete SOPC system integrating proposed ANN hardware has also implemented to apply Vietnamese speech recognition automatically to confirm high recognition probability around 95.2 % towards 20 Vietnamese discrete words. Moreover, experiment results on such ASIC-based architecture have witnessed maximum frequency at 250 MHz on 130nm technology as well as great ability of reconfiguration.


Sign in / Sign up

Export Citation Format

Share Document