IMPLEMENTASI METODE PATTERN RECOGNITION UNTUK PENGENALAN UCAPAN HURUF HIJAIYYAH

[Id] Pattern recognition memiliki kemampuan untuk mengenali suara dengan melakukan pengenalan pola suara melalui fitur-fitur sinyal suara yang kemudian dilakukan pengenalan pola melalui perbandingan pola suara uji dengan suara referensi. Untuk mendapatkan fitur-fitur sinyal suara, diperlukan metode untuk mengekstraksi sinyal suara sehingga fitur-fitur sinyal suara yang dibutuhkan terpenuhi. MFCC (Mel Frequency Cepstral Coefficients) merupakan alternatif metode untuk melakukan ektraksi sinyal yang menghasilkan koefisien cepstral dari sinyal suara. Koefisien cepstral sinyal suara dari hasil ektraksi tersebut, kemudian dilakukan perbandingan kesesuaian antara suara uji dan suara referensi. DTW (Dynamic Time Warping) salah satu algoritma untuk dapat melakukan perbandingan koefisien tersebut. Dalam kasus pegenalan ucapan huruf hijaiyyah umumnya dilakukan secara talaqqi (belajar intensif) antar seorang guru dengan murid, penilaian yang dilakukan bersifat subjektif berdasarkan kemampuan indera dari seorang guru, untuk itu aplikasi pengucapan huruf hijaiyyah merupakan salah satu alternatif untuk mengenali dan menguji kesesuaian ucapan secara objektif melalui penghitungan matematis dengan melakukan pengenalan pola suara. Dari pengujian yang telah dilakukan, dari 6 orang yang diuji melakukan pengucapan 29 huruf 3 tanda baca dan pengulang sebanyak 5 kali menghasilkan persentase kecocokan suara mencapai di atas 90 %, nilai threshold 1,3 Kata kunci: Speech Recognition, Pattern Recognition, MFCC, DTW, Hijaiyyah [En] Pattern recognition has ability to recognize voice by voice pattern recognition through voice signal features which then carried out voice pattern recognition through comparison of tester voice pattern with a reference voice. To get the sound signal features, it needs a method for extracting sound signal so that required sound signals features are fulfilled. MFCC is an alternative method to perform signal extraction which is produce cepstral coefficients of the sound signal. Cepstral coefficients of sound signal from the extraction then will be compared by the match between tester voice and reference voice. DTW is one of algorithm to do a comparison of the coefficients. In the case of introducing hijaiyyah generally talaqqi (intensively) conducted between a teacher and students, the appraisal is subjective based on the sensory capabilities of the teacher, therefore hijaiyyah pronunciation application is an alternative to identify and test the suitability of speech objectively through mathematical calculations by performing voice pattern recognition. From the testing that has been done, from 6 people tested do pronunciations 29 letters and punctuation repeater 3 to 5 times the yield percentage matches the sound reaches above 90%, a threshold value of 1.3.

Download Full-text

IMPLEMENTASI TEKNIK DYNAMIC TIME WARPING (DTW) PADA APLIKASI SPEECH TO TEXT

JURNAL TEKNIK INFORMATIKA ◽

10.15408/jti.v10i1.6816 ◽

2018 ◽

Vol 10 (1) ◽

pp. 49-58

Author(s):

Candra Dinata ◽

Diyah Puspitaningrum ◽

Ernawati Erna

Keyword(s):

Dynamic Time Warping ◽

Test Word ◽

Simulation Software ◽

Sound Signal ◽

Human Beings ◽

Time Warping ◽

Mel Frequency Cepstral Coefficients ◽

Sound Processing ◽

Cepstral Coefficients ◽

Dynamic Time

ABSTRAK Suara/ucapan adalah salah satu cara kita sebagai manusia untuk berkomunikasi dan mengekspresikan diri. Speech to text (ucapan ke text), merupakan salah satu bidang sains computer yaitu bidang pengolahan suara. Speech to text (STT) adalah penerjemahan kalimat (kata yang diucapkan) ke dalam text. STT merupakan proses pengolahan suatu sinyal suara, mengekstrak fitur dari sinyal suara tersebut yang selanjutkan dibandingkan dengan hasil ekstraksi dari sinyal suara yang lain untuk dapat dikenali persamaannya. Penelitian ini merancang dan membangun suatu program aplikasi Speech to Text yang mampu identifikasi suatu sinyal suara menggunakan perangkat lunak simulasi MATLAB R2016a. Terdapat dua proses umum pada bidang pengolahan suara, yaitu ekstraksi fitur dan pencocokan fitur. Pada sistem ini metode mel-frequency cepstral coefficients digunakan untuk mengekstraksi fitur dan metode dynamic time warping digunakan untuk pencocokan fitur. Metode DTW yang digunakan dapat menghitung jarak atau selisih antara dua data yang dibandingkan. Rata-rata akurasi yang didapat setelah dilakukan percobaan pada pengujian kata adalah 95.85% dan pada pengujian kalimat adalah 94%. ABSTRACT Voice / speech is one of the ways we as human beings to communicate and express themselves. Speech to text (STT), is one of computer science is the field of sound processing. Speech to text (STT) is the translation of the sentence (the spoken word) in the text. STT is a voice signal processing, extracting features from the speech signal and then compared it with the extraction of the other sound signal to recognize the signal similarities. This research design and build an application program Speech to Text that is capable of identifying a sound signal using simulation software MATLAB R2016a. There are two common processes in the field of sound processing, feature extraction and matching features. In this system, the method mel-frequency cepstral coefficients are used to extract features and dynamic time warping method used for matching features. DTW method used can calculate the distance or the difference between the two data being compared. The average accuracy is obtained after experiments on the test word was 95.85% and the testing of the sentence is 94%. How to Cite : Dinata, C. Puspitaningrum, D. Erna, E. (2017). IMPLEMENTASI TEKNIK DYNAMIC TIME WARPING (DTW) PADA APLIKASI SPEECH TO TEXT. Jurnal Teknik Informatika, 10(1), 49-58. doi:10.15408/jti.v10i1.6816 Permalink/DOI: http://dx.doi.org/10.15408/jti.v10i1.6816

Download Full-text

Voice Detection with Noise Reduction Using Dynamic Time Warping and Mel-Frequency Cepstral Coefficients Algorithm Applied to Home Automation

International Journal of Simulation Systems Science & Technology ◽

10.5013/ijssst.a.20.s2.36 ◽

2019 ◽

Author(s):

Anna Liza A Ramos ◽

Michael John Alzona ◽

Ronaldo Legasto ◽

Franz Aerol Salinog ◽

Karlo Lao ◽

...

Keyword(s):

Noise Reduction ◽

Dynamic Time Warping ◽

Home Automation ◽

Time Warping ◽

Mel Frequency Cepstral Coefficients ◽

Voice Detection ◽

Cepstral Coefficients ◽

Dynamic Time

Download Full-text

Cough Recognition Based on Mel Frequency Cepstral Coefficients and Dynamic Time Warping

Mechanical Engineering and Control Systems ◽

10.1142/9789814740616_0071 ◽

2016 ◽

Author(s):

Chunmei ZHU ◽

Baojun LIU ◽

Ping LI

Keyword(s):

Dynamic Time Warping ◽

Time Warping ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients ◽

Dynamic Time

Download Full-text

Voice Recognition using Dynamic Time Warping and Mel-Frequency Cepstral Coefficients Algorithms

International Journal of Computer Applications ◽

10.5120/20312-2362 ◽

2015 ◽

Vol 116 (2) ◽

pp. 34-41 ◽

Cited By ~ 7

Author(s):

Abdelmajid H.Mansour ◽

Gafar Zen Alabdeen Salh ◽

Khalid A. Mohammed

Keyword(s):

Dynamic Time Warping ◽

Voice Recognition ◽

Time Warping ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients ◽

Dynamic Time

Download Full-text

Pattern recognition based on dynamic time warping and classification using adaptive rank-order morphological transform

10.14257/astl.2014.58.14 ◽

2014 ◽

Author(s):

Han Li ◽

Hailong Zhu

Keyword(s):

Pattern Recognition ◽

Dynamic Time Warping ◽

Rank Order ◽

Time Warping ◽

Dynamic Time

Download Full-text

Reducing the effect of wrist variation on pattern recognition of Myoelectric Hand Prostheses Control through Dynamic Time Warping

Biomedical Signal Processing and Control ◽

10.1016/j.bspc.2019.101626 ◽

2020 ◽

Vol 55 ◽

pp. 101626 ◽

Cited By ~ 2

Author(s):

Omkar S Powar ◽

Krishnan Chemmangat

Keyword(s):

Pattern Recognition ◽

Dynamic Time Warping ◽

Time Warping ◽

Dynamic Time

Download Full-text

Trading Strategies based on Pattern Recognition in Stock Futures Market using Dynamic Time Warping Algorithm

Journal of Convergence Information Technology ◽

10.4156/jcit.vol7.issue10.22 ◽

2012 ◽

Vol 7 (10) ◽

pp. 185-196 ◽

Cited By ~ 2

Author(s):

Lee ◽

Suk Jun ◽

Jeong ◽

Suk Jae

Keyword(s):

Pattern Recognition ◽

Dynamic Time Warping ◽

Futures Market ◽

Trading Strategies ◽

Time Warping ◽

Dynamic Time

Download Full-text

Dynamic Time Warping Application for Financial Pattern Recognition

SSRN Electronic Journal ◽

10.2139/ssrn.3658339 ◽

2020 ◽

Author(s):

Yangqi Li ◽

Taihua Hu

Keyword(s):

Pattern Recognition ◽

Dynamic Time Warping ◽

Time Warping ◽

Dynamic Time

Download Full-text

Inversion of speech by non-linear transformation of temporary

Health Promotion & Physical Activity ◽

10.5604/01.3001.0010.7714 ◽

2016 ◽

Vol 1 (1) ◽

pp. 139-150

Author(s):

Robert Wielgat ◽

Anita Lorenc

Keyword(s):

Dynamic Time Warping ◽

Mean Square ◽

Time Warping ◽

Mel Frequency Cepstral Coefficients ◽

Precise Method ◽

Electromagnetic Articulography ◽

Acoustic Speech Signal ◽

Preliminary Research ◽

Dynamic Time ◽

Mean Square Errors

Electromagnetic Articulography (EMA) is a precise method for speech articulators assessment which is carried out by sensors placed mainly on the tongue. Various methods are being developed in order to avoid the assessment by EMA sensors. One of them is speech inversion. Here preliminary research on speech inversion based on dynamic time warping (DTW) method has been described. Mel-frequency cepstral coefficients (MFCC) method has been chosen as the acoustic speech signal parametrization method. Root mean square errors (RMSE) of the evaluation have been presented and discussed.

Download Full-text

DTW–RADON-BASED SHAPE DESCRIPTOR FOR PATTERN RECOGNITION

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001413500080 ◽

2013 ◽

Vol 27 (03) ◽

pp. 1350008 ◽

Cited By ~ 27

Author(s):

K. C. SANTOSH ◽

BART LAMIROY ◽

LAURENT WENDLING

Keyword(s):

Pattern Recognition ◽

Dynamic Time Warping ◽

Recognition Performance ◽

Shape Descriptor ◽

Shape Descriptors ◽

Time Warping ◽

Recognition Method ◽

Dynamic Time ◽

Public Datasets ◽

Comprehensive Study

In this paper, we present a pattern recognition method that uses dynamic programming for the alignment of Radon features. The key characteristic of the method is to use dynamic time warping (DTW) to match corresponding pairs of the Radon features for all possible projections. Thanks to DTW, we avoid compressing the feature matrix into a single vector which would otherwise miss information. To reduce the possible number of matchings, we rely on a initial normalization based on the pattern orientation. A comprehensive study is made using major state-of-the-art shape descriptors over several public datasets of shapes such as graphical symbols (both printed and hand-drawn), handwritten characters and footwear prints. In all tests, the method proves its generic behavior by providing better recognition performance. Overall, we validate that our method is robust to deformed shape due to distortion, degradation and occlusion.

Download Full-text