Speech recognition system using enhanced mel frequency cepstral coefficient with windowing and framing method

Kannada is the regional language of India spoken in Karnataka. This paper presents development of continuous kannada speech recognition system using monophone modelling and triphone modelling using HTK. Mel Frequency Cepstral Coefficient (MFCC) is used as feature extractor, exploits cepstral and perceptual frequency scale leads good recognition accuracy. Hidden Markov Model is used as classifier. In this paper Gaussian mixture splitting is done that captures the variations of the phones. The paper presents performance of continuous Kannada Automatic Speech Recognition (ASR) system with respect to 2, 4,8,16 and 32 Gaussian mixtures with monophone and context dependent tri-phone modelling. The experimental result shows that good recognition accuracy is achieved for context dependent tri-phone modelling than monophone modelling as the number Gaussian mixture is increased.

Download Full-text

Developing Speech Recognition System for Quranic Verse Recitation Learning Software

IJID (International Journal on Informatics for Development) ◽

10.14421/ijid.2012.01203 ◽

2012 ◽

Vol 1 (2) ◽

pp. 14

Author(s):

Budiman Putra ◽

B. Atmaja ◽

D. Prananto

Keyword(s):

Speech Recognition ◽

Learning Process ◽

Gaussian Mixture ◽

Recognition System ◽

Speech Recognition System ◽

Multimedia Software ◽

Experiment Data ◽

Learning Flexibility ◽

Learning Software ◽

Mel Frequency Cepstral Coefficient

Quran as holy book for Muslim consists of many rules which are needed to be considered in reading Quran verse properly. If the recitation does not meet all of those rules, the meaning of Quran verse recited will be different with its origins. Intensive learning is needed to be able to do correct recitation. However, the limitation of teachers and time to study Quran verse recitation together in a class could be an obstacle in Quran recitation learning. In order to minimize the obstacle and to ease the learning process we implement speech recognition techniques based on Mel Frequency Cepstral Coefficient (MFCC) features and Gaussian Mixture Model (GMM) modeling, we have successfully designed and developed Quran verse recitation learning software in prototype stage. This software is interactive multimedia software which has many features for learning flexibility and effectiveness. This paper explains the developing of speech recognition system for Quran learning software which is built with the ability to perform evaluation and correction in Quran recitation. In this paper, the authors present clearly the built and tested prototype of the system based on experiment data.

Download Full-text

Direct Recovery of Clean Speech Using a Hybrid Noise Suppression Algorithm for Robust Speech Recognition System

ISRN Signal Processing ◽

10.5402/2012/306305 ◽

2012 ◽

Vol 2012 ◽

pp. 1-9

Author(s):

Peng Dai ◽

Ing Yann Soon ◽

Rui Tao

Keyword(s):

Speech Recognition ◽

Noise Suppression ◽

Recognition Rate ◽

Nonlinear Function ◽

Recognition System ◽

Speech Recognition System ◽

Direct Solution ◽

Power Domain ◽

Discontinuity Problem ◽

Mel Frequency Cepstral Coefficient

A new log-power domain feature enhancement algorithm named NLPS is developed. It consists of two parts, direct solution of nonlinear system model and log-power subtraction. In contrast to other methods, the proposed algorithm does not need prior speech/noise statistical model. Instead, it works by direct solution of the nonlinear function derived from the speech recognition system. Separate steps are utilized to refine the accuracy of estimated cepstrum by log-power subtraction, which is the second part of the proposed algorithm. The proposed algorithm manages to solve the speech probability distribution function (PDF) discontinuity problem caused by traditional spectral subtraction series algorithms. The effectiveness of the proposed filter is extensively compared using the standard database, AURORA2. The results show that significant improvement can be achieved by incorporating the proposed algorithm. The proposed algorithm reaches a recognition rate of over 86% for noisy speech (average from SNR 0 dB to 20 dB), which means a 48% error reduction over the baseline Mel-frequency Cepstral Coefficient (MFCC) system.

Download Full-text

Development of HMM/Neural Network‐Based Medium‐Vocabulary Isolated‐Word Lithuanian Speech Recognition System

Informatica ◽

10.15388/informatica.2004.073 ◽

2004 ◽

Vol 15 (4) ◽

pp. 465-474 ◽

Cited By ~ 1

Author(s):

Mark Filipovič ◽

Antanas Lipeika

Keyword(s):

Neural Network ◽

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Isolated Word

Download Full-text

Design Of A Voice Controlled Home Automation System Using Deep Learning Convolutional Neural Network (DL-CNN)

Telekontran : Jurnal Ilmiah Telekomunikasi, Kendali dan Elektronika Terapan ◽

10.34010/telekontran.v8i1.3078 ◽

2020 ◽

Vol 8 (1) ◽

pp. 57-73

Author(s):

Lery Sakti Ramba

Keyword(s):

Deep Learning ◽

Speech Recognition ◽

Background Noise ◽

Electronic Devices ◽

Recognition System ◽

Background Intensity ◽

Automation System ◽

Home Automation ◽

Speech Recognition System ◽

Home Automation System

The purpose of this research is to design home automation system that can be controlled using voice commands. This research was conducted by studying other research related to the topics in this research, discussing with competent parties, designing systems, testing systems, and conducting analyzes based on tests that have been done. In this research voice recognition system was designed using Deep Learning Convolutional Neural Networks (DL-CNN). The CNN model that has been designed will then be trained to recognize several kinds of voice commands. The result of this research is a speech recognition system that can be used to control several electronic devices connected to the system. The speech recognition system in this research has a 100% success rate in room conditions with background intensity of 24dB (silent), 67.67% in room conditions with 42dB background noise intensity, and only 51.67% in room conditions with background intensity noise 52dB (noisy). The percentage of the success of the speech recognition system in this research is strongly influenced by the intensity of background noise in a room. Therefore, to obtain optimal results, the speech recognition system in this research is more suitable for use in rooms with low intensity background noise.

Download Full-text

The ZTSpeech system for CHiME-5 Challenge: A far-field speech recognition system with front-end and robust back-end

10.21437/chime.2018-13 ◽

2018 ◽

Author(s):

Chenxing Li ◽

Tieqiang Wang

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Far Field ◽

Speech Recognition System ◽

Front End

Download Full-text

Development of Assamese Continuous Speech Recognition System

10.21437/sltu.2018-45 ◽

2018 ◽

Author(s):

Barsha Deka ◽

Nirmala S.R. ◽

Samudravijaya K.

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Continuous Speech ◽

Continuous Speech Recognition

Download Full-text

TDNN-based Multilingual Speech Recognition System for Low Resource Indian Languages

10.21437/interspeech.2018-2117 ◽

2018 ◽

Cited By ~ 7

Author(s):

Noor Fathima ◽

Tanvina Patel ◽

Mahima C ◽

Anuroop Iyengar

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Indian Languages ◽

Low Resource ◽

Multilingual Speech Recognition

Download Full-text

Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks

10.21437/interspeech.2019-2629 ◽

2019 ◽

Cited By ~ 2

Author(s):

Muhammad Umar Farooq ◽

Farah Adeeba ◽

Sahar Rauf ◽

Sarmad Hussain

Keyword(s):

Neural Networks ◽

Speech Recognition ◽

Deep Neural Networks ◽

Recognition System ◽

Speech Recognition System ◽

Large Vocabulary

Download Full-text

Development of Large Vocabulary Speech Recognition System with Keyword Search for Manipuri

10.21437/interspeech.2018-2133 ◽

2018 ◽

Cited By ~ 1

Author(s):

Tanvina Patel ◽

Krishna D N ◽

Noor Fathima ◽

Nisar Shah ◽

Mahima C ◽

...

Keyword(s):

Speech Recognition ◽

Keyword Search ◽

Recognition System ◽

Speech Recognition System ◽

Large Vocabulary ◽

Large Vocabulary Speech Recognition

Download Full-text