scholarly journals Improving the Performance of Automatic Speech Recognition Using Blind Source Separation

In real world applications, Speech recognition system have grown due its significance in various online and offline applications such as security, robotic application, speech translator etc. These systems are widely used now-a-days where acquisition of signal is performed using various instruments which causes noise, source mixing and other impurities which affects the performance of speech recognition system. In this work, issue of source mixing in original speech signal is addressed which causes performance degradation. In order to overcome this we propose a new approach which utilizes non-negative matrix factorization modelling. This method utilizes scattering transform by applying wavelet filter bank and pyramid scattering to estimate the source and minimization of unwanted signals. After estimation the signals or sources, source separation algorithm is implemented using optimization process based on the training and testing method. Proposed approach is compared with other existing method by computing performance measurement matrices which shows the better performance

Author(s):  
Lery Sakti Ramba

The purpose of this research is to design home automation system that can be controlled using voice commands. This research was conducted by studying other research related to the topics in this research, discussing with competent parties, designing systems, testing systems, and conducting analyzes based on tests that have been done. In this research voice recognition system was designed using Deep Learning Convolutional Neural Networks (DL-CNN). The CNN model that has been designed will then be trained to recognize several kinds of voice commands. The result of this research is a speech recognition system that can be used to control several electronic devices connected to the system. The speech recognition system in this research has a 100% success rate in room conditions with background intensity of 24dB (silent), 67.67% in room conditions with 42dB background noise intensity, and only 51.67% in room conditions with background intensity noise 52dB (noisy). The percentage of the success of the speech recognition system in this research is strongly influenced by the intensity of background noise in a room. Therefore, to obtain optimal results, the speech recognition system in this research is more suitable for use in rooms with low intensity background noise.


Author(s):  
Shansong Liu ◽  
Shoukang Hu ◽  
Xurong Xie ◽  
Mengzhe Geng ◽  
Mingyu Cui ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document