SPEAKER IDENTIFICATION MENGGUNAKAN TRANSFORMASI WAVELET DISKRIT DAN JARINGAN SARAF TIRUAN BACK-PROPAGATION

Anny Tandyo; Martono Martono; Adi Widyatmoko

doi:10.21512/commit.v2i1.482

SPEAKER IDENTIFICATION MENGGUNAKAN TRANSFORMASI WAVELET DISKRIT DAN JARINGAN SARAF TIRUAN BACK-PROPAGATION

CommIT (Communication and Information Technology) Journal ◽

10.21512/commit.v2i1.482 ◽

2008 ◽

Vol 2 (1) ◽

pp. 1

Author(s):

Anny Tandyo ◽

Martono Martono ◽

Adi Widyatmoko

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Speaker Recognition ◽

Speaker Identification ◽

Back Propagation ◽

Identification Accuracy ◽

Identification System ◽

Accuracy Rate ◽

Discrete Transformation ◽

Artificial Neural

Article discussed a speaker identification system. Which was a part of speaker recognition. The system identified asubject based on the voice from a group of pattern had been saved before. This system used a wavelet discrete transformationas a feature extraction method and an artificial neural network of back-propagation as a classification method. The voiceinput was processed by the wavelet discrete transformation in order to obtain signal coefficient of low frequency as adecomposition result which kept voice characteristic of everyone. The coefficient then was classified artificial neural networkof back-propagation. A system trial was conducted by collecting voice samples directly by using 225 microphones in nonsoundproof rooms; contained of 15 subjects (persons) and each of them had 15 voice samples. The 10 samples were used as atraining voice and 5 others as a testing voice. Identification accuracy rate reached 84 percent. The testing was also done onthe subjects who pronounced same words. It can be concluded that, the similar selection of words by different subjects has noinfluence on the accuracy rate produced by system.Keywords: speaker identification, wavelet discrete transformation, artificial neural network, back-propagation.

Download Full-text

COMPARATIVE STUDY OF CONTINUOUS HIDDEN MARKOV MODELS (CHMM) AND ARTIFICIAL NEURAL NETWORK (ANN) ON SPEAKER IDENTIFICATION SYSTEM

International Journal of Uncertainty Fuzziness and Knowledge-Based Systems ◽

10.1142/s0218488501001149 ◽

2001 ◽

Vol 09 (06) ◽

pp. 673-683 ◽

Cited By ~ 2

Author(s):

SAWIT KASURIYA ◽

CHAI WUTIWIWATCHAI ◽

VARIN ACHARIYAKULPORN ◽

CHULARAT TANPRASERT

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Comparative Study ◽

Speaker Identification ◽

Hidden Markov ◽

Identification System ◽

Office Environment ◽

Identification Rate ◽

Artificial Neural ◽

Artificial Neural Network Ann

This paper reports a comparative study between a continuous hidden Markov model (CHMM) and an artificial neural network (ANN) on a text dependent, closed set speaker identification (SID) system with Thai language recording in office and telephone environment. Thai isolated digit "0–9" and their concatenation are used as speaking text. Mel frequency cepstral coefficients (MFCC) are selected as the studied features. Two well-known recognition engines, CHMM and ANN, are conducted and compared. The ANN system (multilayer perceptron network with backpropagation learning algorithm) is applied with a special design of input feeding methods in avoiding the distortion from the normalization process. The general Gaussian density distribution HMM is developed for CHMM system. After optimizing some system's parameters by performing some preliminary experiments, CHMM gives the best identification rate at 90.4%, which is slightly better than 90.1% of ANN on digit "5" in office environment. For telephone environment, ANN gives the best identification rate at 88.84% on digit "0" which is higher than 81.1% of CHMM on digit "3". When using 3-concatenated digit, the identification rate of ANN and CHMM achieves 97.3% and 95.7% respectively for office environment, and 92.1% and 96.3% respectively for telephone environment.

Download Full-text

Development of a Gemstone Type Identification System Based on HSV Space Colour Using an Artificial Neural Network Back Propagation Algorithm

Proceedings of the International Conference on Science and Technology (ICOSAT 2017) ◽

10.2991/icosat-17.2018.24 ◽

2018 ◽

Author(s):

Ismatul Maula ◽

Victor Amrizal ◽

Anif Hanifa Setianingrum ◽

Nashrul Hakiem

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Back Propagation ◽

Identification System ◽

Back Propagation Algorithm ◽

Propagation Algorithm ◽

Artificial Neural

Download Full-text

Development of Quran Reciter Identification System Using MFCC and Neural Network

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v1.i1.pp168-175 ◽

2016 ◽

Vol 1 (1) ◽

pp. 168 ◽

Cited By ~ 1

Author(s):

Tayseer Mohammed Hasan Asda ◽

Teddy Surya Gunawan

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Feature Extraction ◽

Learning Algorithm ◽

Back Propagation ◽

Identification System ◽

Successful Match ◽

Artificial Neural ◽

Artificial Neural Network Ann ◽

Mel Frequency Cepstral Coefficient

Currently, the Quran is recited by so many reciters with different ways and voices. Some people like to listen to this reciter and others like to listen to other reciters. Sometimes we hear a very nice recitation of al-Quran and want to know who the reciter is. Therefore, this paper is about the development of Quran reciter recognition and identification system based on Mel Frequency Cepstral Coefficient (MFCC) feature extraction and artificial neural network (ANN). From every speech, characteristics from the utterances will be extracted through neural network model. In this paper a database of five Quran reciters is created and used in training and testing. The feature vector will be fed into Neural Network back propagation learning algorithm for training and identification processes of different speakers. Consequently, 91.2% of the successful match between targets and input occurred with certain number of hidden layers which shows how efficient are Mel Frequency Cepstral Coefficient (MFCC) feature extraction and artificial neural network (ANN) in identifying the reciter voice perfectly.

Download Full-text

Speaker identification system using empirical mode decomposition and an artificial neural network

Expert Systems with Applications ◽

10.1016/j.eswa.2010.11.013 ◽

2011 ◽

Vol 38 (5) ◽

pp. 6112-6117 ◽

Cited By ~ 31

Author(s):

Jian-Da Wu ◽

Yi-Jang Tsai

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Empirical Mode Decomposition ◽

Speaker Identification ◽

Identification System ◽

Mode Decomposition ◽

Artificial Neural

Download Full-text

Implementation of Back Propagation Artificial Neural Network for Heart Disease Abnormality Diagnosis

Journal of Physics Conference Series ◽

10.1088/1742-6596/1764/1/012165 ◽

2021 ◽

Vol 1764 (1) ◽

pp. 012165

Author(s):

Jaya Kuncara Rosa Susila ◽

Muhammad Afit ◽

Pujo Laksono

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Heart Disease ◽

Back Propagation ◽

Artificial Neural

Download Full-text

Test on Flood Prediction-Model Using Artificial Neural Network for ShiiLiAn Hydrologic Station on MinChiang,China

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.39.555 ◽

2010 ◽

Vol 39 ◽

pp. 555-561 ◽

Cited By ~ 1

Author(s):

Qing Hua Luan ◽

Yao Cheng ◽

Zha Xin Ima

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Flood Control ◽

Back Propagation ◽

Forecast Model ◽

Flood Forecast ◽

Back Propagation Algorithm ◽

Nonlinear Method ◽

Flood Prediction ◽

Artificial Neural

The establishing of a precise simulation model for runoff prediction in river with several tributaries is the difficulty of flood forecast, which is also one of the difficulties in hydrologic research. Due to the theory of Artificial Neural Network, using Back Propagation algorithm, the flood forecast model for ShiLiAn hydrologic station in Minjiang River is constructed and validated in this study. Through test, the result shows that the forecast accuracy is satisfied for all check standards of flood forecast and then proves the feasibility of using nonlinear method for flood forecast. This study provides a new method and reference for flood control and water resources management in the local region.

Download Full-text

Cerebrovascular Accident Attack Classification Using Multilayer Feed Forward Artificial Neural Network with Back Propagation Error

Journal of Computer Science ◽

10.3844/jcssp.2012.18.25 ◽

2012 ◽

Vol 8 (1) ◽

pp. 18-25 ◽

Cited By ~ 1

Author(s):

Olabode

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Cerebrovascular Accident ◽

Back Propagation ◽

Feed Forward ◽

Artificial Neural

Download Full-text

Non-invasive prediction of bloodstain age using the principal component and a back propagation artificial neural network

Laser Physics Letters ◽

10.1088/1612-202x/aa7c48 ◽

2017 ◽

Vol 14 (9) ◽

pp. 095601 ◽

Cited By ~ 3

Author(s):

Huimin Sun ◽

Yaoyong Meng ◽

Pingli Zhang ◽

Yajing Li ◽

Nan Li ◽

...

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Back Propagation ◽

Principal Component ◽

Non Invasive ◽

Artificial Neural

Download Full-text

Rainfall Forecasting Using Various Artificial Neural Network Techniques - A Review

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit2173159 ◽

2021 ◽

pp. 506-526

Author(s):

Nisha Thakur ◽

Sanjeev Karmakar ◽

Sunita Soni

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Moving Average ◽

Weather Prediction ◽

Optimization Technique ◽

Back Propagation ◽

Rainfall Forecasting ◽

Rainfall Prediction ◽

Artificial Neural ◽

Review Reports

The present review reports the work done by the various authors towards rainfall forecasting using the different techniques within Artificial Neural Network concepts. Back-Propagation, Auto-Regressive Moving Average (ARIMA), ANN , K- Nearest Neighbourhood (K-NN), Hybrid model (Wavelet-ANN), Hybrid Wavelet-NARX model, Rainfall-runoff models, (Two-stage optimization technique), Adaptive Basis Function Neural Network (ABFNN), Multilayer perceptron, etc., algorithms/technologies were reviewed. A tabular representation was used to compare the above-mentioned technologies for rainfall predictions. In most of the articles, training and testing, accuracy was found more than 95%. The rainfall prediction done using the ANN techniques was found much superior to the other techniques like Numerical Weather Prediction (NWP) and Statistical Method because of the non-linear and complex physical conditions affecting the occurrence of rainfall.

Download Full-text

Comparison of feature extraction and normalization methods for speaker recognition using grid-audiovisual database

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i2.pp782-789 ◽

2020 ◽

Vol 18 (2) ◽

pp. 782

Author(s):

Musab T. S. Al-Kaltakchi ◽

Haithem Abd Al-Raheem Taha ◽

Mohanad Abd Shehab ◽

Mohamed A.M. Abdullah

Keyword(s):

Feature Extraction ◽

Speaker Recognition ◽

Speaker Identification ◽

Gaussian Mixture ◽

Identification Accuracy ◽

Identification System ◽

Good Representation ◽

Mel Frequency Cepstral Coefficients ◽

Normalization Methods ◽

Cepstral Coefficients

<p><span lang="EN-GB">In this paper, different feature extraction and feature normalization methods are investigated for speaker recognition. With a view to give a good representation of acoustic speech signals, Power Normalized Cepstral Coefficients (PNCCs) and Mel Frequency Cepstral Coefficients (MFCCs) are employed for feature extraction. Then, to mitigate the effect of linear channel, Cepstral Mean-Variance Normalization (CMVN) and feature warping are utilized. The current paper investigates Text-independent speaker identification system by using 16 coefficients from both the MFCCs and PNCCs features. Eight different speakers are selected from the GRID-Audiovisual database with two females and six males. The speakers are modeled using the coupling between the Universal Background Model and Gaussian Mixture Models (GMM-UBM) in order to get a fast scoring technique and better performance. The system shows 100% in terms of speaker identification accuracy. The results illustrated that PNCCs features have better performance compared to the MFCCs features to identify females compared to male speakers. Furthermore, feature wrapping reported better performance compared to the CMVN method. </span></p>

Download Full-text