scholarly journals SPEAKER IDENTIFICATION MENGGUNAKAN TRANSFORMASI WAVELET DISKRIT DAN JARINGAN SARAF TIRUAN BACK-PROPAGATION

Author(s):  
Anny Tandyo ◽  
Martono Martono ◽  
Adi Widyatmoko

Article discussed a speaker identification system. Which was a part of speaker recognition. The system identified asubject based on the voice from a group of pattern had been saved before. This system used a wavelet discrete transformationas a feature extraction method and an artificial neural network of back-propagation as a classification method. The voiceinput was processed by the wavelet discrete transformation in order to obtain signal coefficient of low frequency as adecomposition result which kept voice characteristic of everyone. The coefficient then was classified artificial neural networkof back-propagation. A system trial was conducted by collecting voice samples directly by using 225 microphones in nonsoundproof rooms; contained of 15 subjects (persons) and each of them had 15 voice samples. The 10 samples were used as atraining voice and 5 others as a testing voice. Identification accuracy rate reached 84 percent. The testing was also done onthe subjects who pronounced same words. It can be concluded that, the similar selection of words by different subjects has noinfluence on the accuracy rate produced by system.Keywords: speaker identification, wavelet discrete transformation, artificial neural network, back-propagation.

Author(s):  
SAWIT KASURIYA ◽  
CHAI WUTIWIWATCHAI ◽  
VARIN ACHARIYAKULPORN ◽  
CHULARAT TANPRASERT

This paper reports a comparative study between a continuous hidden Markov model (CHMM) and an artificial neural network (ANN) on a text dependent, closed set speaker identification (SID) system with Thai language recording in office and telephone environment. Thai isolated digit "0–9" and their concatenation are used as speaking text. Mel frequency cepstral coefficients (MFCC) are selected as the studied features. Two well-known recognition engines, CHMM and ANN, are conducted and compared. The ANN system (multilayer perceptron network with backpropagation learning algorithm) is applied with a special design of input feeding methods in avoiding the distortion from the normalization process. The general Gaussian density distribution HMM is developed for CHMM system. After optimizing some system's parameters by performing some preliminary experiments, CHMM gives the best identification rate at 90.4%, which is slightly better than 90.1% of ANN on digit "5" in office environment. For telephone environment, ANN gives the best identification rate at 88.84% on digit "0" which is higher than 81.1% of CHMM on digit "3". When using 3-concatenated digit, the identification rate of ANN and CHMM achieves 97.3% and 95.7% respectively for office environment, and 92.1% and 96.3% respectively for telephone environment.


Author(s):  
Tayseer Mohammed Hasan Asda ◽  
Teddy Surya Gunawan

Currently, the Quran is recited by so many reciters with different ways and voices.  Some people like to listen to this reciter and others like to listen to other reciters. Sometimes we hear a very nice recitation of al-Quran and want to know who the reciter is. Therefore, this paper is about  the development of Quran reciter recognition and identification system based on Mel Frequency Cepstral Coefficient (MFCC) feature extraction and artificial neural network (ANN). From every speech, characteristics from the utterances will be extracted through neural network model. In this paper a database of five Quran reciters is created and used in training and testing. The feature vector will be fed into Neural Network back propagation learning algorithm for training and identification processes of different speakers. Consequently,  91.2%  of the successful match between targets and input occurred with certain number of hidden layers  which shows how efficient are Mel Frequency Cepstral Coefficient (MFCC) feature extraction  and artificial neural network (ANN) in identifying the reciter voice perfectly.


2010 ◽  
Vol 39 ◽  
pp. 555-561 ◽  
Author(s):  
Qing Hua Luan ◽  
Yao Cheng ◽  
Zha Xin Ima

The establishing of a precise simulation model for runoff prediction in river with several tributaries is the difficulty of flood forecast, which is also one of the difficulties in hydrologic research. Due to the theory of Artificial Neural Network, using Back Propagation algorithm, the flood forecast model for ShiLiAn hydrologic station in Minjiang River is constructed and validated in this study. Through test, the result shows that the forecast accuracy is satisfied for all check standards of flood forecast and then proves the feasibility of using nonlinear method for flood forecast. This study provides a new method and reference for flood control and water resources management in the local region.


2017 ◽  
Vol 14 (9) ◽  
pp. 095601 ◽  
Author(s):  
Huimin Sun ◽  
Yaoyong Meng ◽  
Pingli Zhang ◽  
Yajing Li ◽  
Nan Li ◽  
...  

Author(s):  
Nisha Thakur ◽  
Sanjeev Karmakar ◽  
Sunita Soni

The present review reports the work done by the various authors towards rainfall forecasting using the different techniques within Artificial Neural Network concepts. Back-Propagation, Auto-Regressive Moving Average (ARIMA), ANN , K- Nearest Neighbourhood (K-NN), Hybrid model (Wavelet-ANN), Hybrid Wavelet-NARX model, Rainfall-runoff models, (Two-stage optimization technique), Adaptive Basis Function Neural Network (ABFNN), Multilayer perceptron, etc., algorithms/technologies were reviewed. A tabular representation was used to compare the above-mentioned technologies for rainfall predictions. In most of the articles, training and testing, accuracy was found more than 95%. The rainfall prediction done using the ANN techniques was found much superior to the other techniques like Numerical Weather Prediction (NWP) and Statistical Method because of the non-linear and complex physical conditions affecting the occurrence of rainfall.


Author(s):  
Musab T. S. Al-Kaltakchi ◽  
Haithem Abd Al-Raheem Taha ◽  
Mohanad Abd Shehab ◽  
Mohamed A.M. Abdullah

<p><span lang="EN-GB">In this paper, different feature extraction and feature normalization methods are investigated for speaker recognition. With a view to give a good representation of acoustic speech signals, Power Normalized Cepstral Coefficients (PNCCs) and Mel Frequency Cepstral Coefficients (MFCCs) are employed for feature extraction. Then, to mitigate the effect of linear channel, Cepstral Mean-Variance Normalization (CMVN) and feature warping are utilized. The current paper investigates Text-independent speaker identification system by using 16 coefficients from both the MFCCs and PNCCs features. Eight different speakers are selected from the GRID-Audiovisual database with two females and six males. The speakers are modeled using the coupling between the Universal Background Model and Gaussian Mixture Models (GMM-UBM) in order to get a fast scoring technique and better performance. The system shows 100% in terms of speaker identification accuracy. The results illustrated that PNCCs features have better performance compared to the MFCCs features to identify females compared to male speakers. Furthermore, feature wrapping reported better performance compared to the CMVN method. </span></p>


Sign in / Sign up

Export Citation Format

Share Document