Experiments With Fast Fourier Transform, Linear Predictive and Cepstral Coefficients in Dysarthric Speech Recognition Algorithms Using Hidden Markov Model

Abstrak - Pelajaran utama dalam membaca Al Qur'an adalah mengenali dan melafalkan huruf-huruf Hijaiyah. Beberapa fakta menunjukkan bahwa pengucapan yang salah dapat memengaruhi makna secara harafiah. Speech Recognition, sebagai teknologi saat ini, dapat digunakan untuk memeriksa kesalahan dalam melafalkan surat Hijaiyah melalui pengenalan suara atau ucapan. Itu dapat dikonversi menjadi data yang dapat dipahami oleh sistem. Tujuan dari penelitian ini adalah untuk menerapkan Speech Recognition dengan Hidden Markov Model untuk pelafalan huruf Hijaiyah ketika belajar membaca Alquran. Pengenalan ucapan dan Model Hidden Markov dilakukan untuk mengembangkan sistem antar muka mesin berbasis suara. Dalam penelitian ini juga menggunakan metode Fast Fourier Transform (FFT) untuk mengekstraksi sifat. Hidden Markov Model (HMM) yang digunakan dalam proses pelatihan. Juga, menghasilkan karakteristik khusus untuk setiap huruf Hijaiyah. Dan kemudian, Euclidean Distance (ED) untuk klasifikasi akhir dalam mendeteksi pelafalan huruf Hijaiyah. Hasil penelitian menunjukkan bahwa hasil tes huruf Hijaiyah pada tingkat akurasi yang sama adalah 100%, sedangkan pengujian huruf yang berbeda adalah 54,6%. Dengan demikian, penelitian ini akan memberikan kontribusi kepada siswa yang sedang belajar membaca Al-Qur'an untuk dapat mengenali dan melafalkan huruf-huruf Hijaiyah.Abstract – The main lesson in reading the Al Qur'an is recognizing and reciting the letters Hijaiyah. Some facts show that incorrect pronunciation can affect meaning literally. Speech Recognition, as the current technology, can be used to check the mistakes in pronouncing the Hijaiyah's letter through recognizing the voice or speech. It can convert into data that can be understood by the system. The purpose of this study is to implement Speech Recognition with Hidden Markov Model for Hijaiyah letter pronunciation when learning to read the Qur'an. Speech recognition and Hidden Markov Models were carried out to develop a sound-based machine interface system. In this study also used the Fast Fourier Transform (FFT) method to extract traits. Hidden Markov Model (HMM) used in the training process. Also, produced the especially characteristics for each letter of Hijaiyah. And then, Euclidean Distance (ED) for the final classification in detecting Hijaiyah letter pronunciation. The results of the study show that the results of the Hijaiyah letter test on the same level of accuracy are 100%, while the testing of different letters is 54.6%. Thus, this study will contribute to students who are learning to read Al-Qur'an to be able to recognize and recite the Hijaiyah letters.Keywords - Speech Recognition, Hidden Markov Model, Recognizing, Reciting, Letter Hijaiyah

Download Full-text

Dysarthric Speech Recognition Using Kullback-Leibler Divergence-Based Hidden Markov Model

10.21437/interspeech.2016-776 ◽

2016 ◽

Cited By ~ 5

Author(s):

Myungjong Kim ◽

Jun Wang ◽

Hoirin Kim

Keyword(s):

Speech Recognition ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Leibler Divergence ◽

Dysarthric Speech

Download Full-text

Implementation of Android Based Speech Recognition for Indonesian Geography Dictionary

Jurnal ULTIMA Computing ◽

10.31937/sk.v7i2.296 ◽

2016 ◽

Vol 7 (2) ◽

pp. 76-82

Author(s):

Hugeng Hugeng ◽

Edbert Hansel

Keyword(s):

Speech Recognition ◽

Markov Model ◽

Hidden Markov Model ◽

Hidden Markov ◽

Spoken Word ◽

Silent Condition ◽

Word Detection ◽

Index Terms ◽

To Receive ◽

Noisy Condition

We have built an application of speech recognition for Indonesian geography dictionary based on Android operating system, named GAIA. This application uses a smartphone as a device to receive input in the form of a spoken word from a user. The approach used in recognition is Hidden Markov Model which is contained in the Pocketsphinx library. The phonemes used are Indonesian phonemes’ rule. The advantage of this application is that it can be used without internet access. In the application testing, word detection is done with four conditions to determine the level of accuracy. The four conditions are near silent, near noisy, far silent, and far noisy. From the testing and analysis conducted, it can be concluded that GAIA application can be built as a speech recognition application on Android for Indonesian geography dictionary; with the results in the near silent condition accuracy of word recognition reaches an average of 52.87%, in the near noisy reaches an average of 14.5%, in the far silent condition reaches an average of 23.2%, and in the far noisy condition reaches an average of 2.8%. Index Terms—speech recognition, Indonesian geography dictionary, Hidden Markov Model, Pocketsphinx, Android.

Download Full-text