Integration of hidden markov models in the automated speaker recognition system for critical use

Voice is one of the parameters in the identification process of a person. Through the voice, information will be obtained such as gender, age, and even the identity of the speaker. Speaker recognition is a method to narrow down crimes and frauds committed by voice. So that it will minimize the occurrence of faking one's identity. The Method of Mel Frequency Cepstrum Coefficient (MFCC) can be used in the speech recognition system. The process of feature extraction of speech signal using MFCC will produce acoustic speech signal. The classification, Hidden Markov Models (HMM) is used to match unidentified speaker’s voice with the voices in database. In this research, the system is used to verify the speaker, namely 15 text dependent in Indonesian. On testing the speaker with the same as database, the highest accuracy is 99,16%.

Download Full-text

Sistem Pengenalan Pembicara dengan Metode Wavelet-MCFF dan Pengklasifikasi Hidden Markov Models (HMM)

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.0813284 ◽

2021 ◽

Vol 8 (1) ◽

pp. 119

Author(s):

Syahroni Hidayat ◽

Andi Sofyan Anas ◽

Siti Agrippina Alodia Yusuf ◽

Muhammad Tajuddin

Keyword(s):

Hidden Markov Models ◽

Speaker Recognition ◽

Markov Models ◽

Hidden Markov ◽

Digital Signal ◽

Recognition System ◽

Level 1 ◽

New Feature ◽

Dyadic Decomposition ◽

Mel Frequency Cepstral Coefficient

Penelitian pengolahan sinyal digital yang berfokus pada pengenalan pembicara telah dimulai sejak beberapa dekade yang lalu, dan telah menghasilkan banyak metode-metode pengenalan pembicara. Di antara algoritma pembentukan koefisien ciri yang telah dikembangkan tersebut, ada dua algoritma yang dapat memberikan akurasi yang tinggi jika diterapkan pada sistem, yaitu Mel Frequency Cepstral Coefficient (MFCC) dan Wavelet. Penelitian ini bertujuan untuk menguji dan memilih kanal terbaik dari proses wavelet-MFCC yang dapat dijadikan sebagai koefisien ciri baru untuk diterapkan pada sistem pengenal pembicara. Koefisien ciri baru tersebut kemudian disebut dengan koefisien ciri Wavelet-MFCC. Kofisien ini dibentuk dari merubah kanal hasil dekomposisi wavelet, yaitu kanal aproksimasi (cA), kanal detail (cD), dan penggabungannya (cAcD), menjadi koefisien MFCC. Metode dekomposisi wavelet yang digunakan adalah metode dyadic dengan menerapkan level dekomposisi level 1 dan level 2. Setiap koefisien ciri kemudian menjadi inputan pada sistem pengklasifikasi Hidden Markov Models (HMM). Keluaran dari HMM kemudian dihitung akurasinya dan dianalisis. Dari pengujian yang dilakukan, diperoleh bahwa kanal detail (cD) sebagai ciri dapat memberikan akurasi yang sama dengan menggunakan kanal gabungan (cAcD) dan lebih tinggi dari kanal aproksimasi (cA), dengan akurasi sebesar 95%. Hal ini menunjukkan bahwa, kanal detail pada dekomposisi level 1 menyimpan ciri suara dari setiap pembicara sehingga sudah cukup untuk dijadikan sebagai koefisien ciri. Maka, penggunaan dekomposisi level 1 dan kanal detail cD sebagai ciri Wavelet-MFCC pada sistem pengenalan pembicara dapat meringankan dan mempercepat proses komputasi. AbstractResearch in digital signal that focused on speaker recognition has begun since decades ago, and has resulted many speaker recognition methods. there are two algorithms that can provide high accuracy in recognition system, which are Mel Frequency Cepstral Coefficient (MFCC) and Wavelet. the aims of this study is to examine and chose the best channel from wavelet-MFCC process that can be used as new feature coefficient, then called as Wavelet-MFCC features coefficient. The coefficient is built by converting the wavelet decomposition channels, which are approximation (cA), detail (cD), and its combination (cAcD), into the MFCC coefficient. Wavelet dyadic decomposition with level 1 and level 2 of decomposition is applied. Each feature coefficient acts as an input to the HMM classifier. The accuracy of the HMM output is calculated, then analyzed. The obtained results show that the detail chanel (cD) achieve equal accuracy as the combination chanel (cAcD), and higher accuracy compared to aproximation channel (cA), with accuracy 95%. Thus, it can be conclude that the detail channel on level 1 decomposition contains features of each speaker's. Then, cD is enough to be used as a Wavelet-MFCC feature. Thus, its implementation in the SRS can ease and speed up the computing process.

Download Full-text

Development of the hidden Markov models based Lithuanian speech recognition system

10.1117/12.872119 ◽

2010 ◽

Author(s):

Z. Ringeliene ◽

A. Lipeika

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Recognition System ◽

Speech Recognition System

Download Full-text

An off-line oriental character recognition system (OOCRS): synergy of distortion modeling, hidden Markov models and vector quantization

Pattern Recognition ◽

10.1016/s0031-3203(01)00090-5 ◽

2002 ◽

Vol 35 (5) ◽

pp. 1007-1023

Author(s):

Khue Hiang Chan

Keyword(s):

Hidden Markov Models ◽

Vector Quantization ◽

Character Recognition ◽

Markov Models ◽

Hidden Markov ◽

Recognition System

Download Full-text

Robust facial expression recognition system based on hidden Markov models

International Journal of Multimedia Information Retrieval ◽

10.1007/s13735-016-0113-8 ◽

2016 ◽

Vol 5 (4) ◽

pp. 229-236 ◽

Cited By ~ 3

Author(s):

Zineb Elgarrai ◽

Othmane El Meslouhi ◽

Mustapha Kardouchi ◽

Hakim Allali

Keyword(s):

Facial Expression ◽

Hidden Markov Models ◽

Facial Expression Recognition ◽

Markov Models ◽

Hidden Markov ◽

Recognition System ◽

Expression Recognition

Download Full-text

Limited‐Vocabulary Estonian Continuous Speech Recognition System using Hidden Markov Models

Informatica ◽

10.15388/informatica.2004.062 ◽

2004 ◽

Vol 15 (3) ◽

pp. 303-314

Author(s):

Tanel Alumäe ◽

Leo Võhandu

Keyword(s):

Speech Recognition ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Recognition System ◽

Speech Recognition System ◽

Continuous Speech ◽

Continuous Speech Recognition

Download Full-text

Infant Cry Recognition System

10.4018/978-1-6684-2408-7.ch029 ◽

2022 ◽

pp. 629-647

Author(s):

Yosra Abdulaziz Mohammed

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Recognition System ◽

Acoustic Features ◽

Infant Cry ◽

Identification Rate ◽

The One

Cries of infants can be seen as an indicator of pain. It has been proven that crying caused by pain, hunger, fear, stress, etc., show different cry patterns. The work presented here introduces a comparative study between the performance of two different classification techniques implemented in an automatic classification system for identifying two types of infants' cries, pain, and non-pain. The techniques are namely, Continuous Hidden Markov Models (CHMM) and Artificial Neural Networks (ANN). Two different sets of acoustic features were extracted from the cry samples, those are MFCC and LPCC, the feature vectors generated by each were eventually fed into the classification module for the purpose of training and testing. The results of this work showed that the system based on CDHMM have better performance than that based on ANN. CDHMM gives the best identification rate at 96.1%, which is much higher than 79% of ANN whereby in general the system based on MFCC features performed better than the one that utilizes LPCC features.

Download Full-text