A Novel Approach to Increase the Efficiency of a Multi-lingual Real-time Speaker Identification System

International Journal of Systems Applications, Engineering & Development ◽

10.46300/91015.2020.14.21 ◽

2020 ◽

Vol 14 ◽

Keyword(s):

Real Time ◽

Speech Enhancement ◽

Speaker Recognition ◽

Speaker Identification ◽

Cost Effective ◽

Recognition System ◽

Identification System ◽

Novel Approach ◽

Activity Method ◽

Voice Activity

Nowadays, the real-time speaker recognition system is very popular due to its cost-effective nature. However, it is a very challenging one to produce a more efficient speaker identification system. In our work, we work on a multi-lingual real-time speaker identification system. We work in a novel way to enhance the efficiency of the said system. We take some real speech signals and use different speech enhancement methods and our proposed voice activity method (VAD) to enhance the efficiency of said system. By doing so, we increase the accuracy of the said system relatively by 2% as compared to existing methods.

Download Full-text

A CNN based Speaker Recognition System using an Alternate Bone Microphone

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b7647.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 4224-4227

Keyword(s):

Speaker Recognition ◽

Input Data ◽

Speaker Identification ◽

Recognition System ◽

The Other ◽

Identification System ◽

Video Recorder ◽

State Of Art

State-of-art speaker recognition system uses acoustic microphone speech to identify/verify a speaker. The multimodal speaker recognition system includes modality of input data recorded using sources like acoustics mic,array mic ,throat mic, bone mic and video recorder. In this paper we implemented a multi-modal speaker identification system with three modality of speech as input, recorded from different microphones like air mic, throat mic and bone mic . we propose and claim an alternate way of recording the bone speech using a throat microphone and the results of a implemented speaker recognition using CNN and spectrogram is presented. The obtained results supports our claim to use the throat microphone as suitable mic to record the bone conducted speech and the accuracy of the speaker recognition system with signal speech recorded from air microphone get improved about 10% after including the other modality of speech like throat and bone speech along with the air conducted speech.

Download Full-text

A Novel Approach To Enhance The Efficiency Of Real-Time Speaker Identification System

2018 International Conference on Recent Innovations in Electrical, Electronics & Communication Engineering (ICRIEECE) ◽

10.1109/icrieece44171.2018.9009340 ◽

2018 ◽

Author(s):

Subhashree Priyadarshini ◽

Susanta Kumar Sarangi ◽

K. C Bhuyan

Keyword(s):

Real Time ◽

Speaker Identification ◽

Identification System ◽

Novel Approach

Download Full-text

New Feature Vectors using GFCC for Speaker Identification

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i8.146 ◽

2018 ◽

Vol 6 (8) ◽

pp. 243

Author(s):

A. Nagesh

Keyword(s):

Speaker Recognition ◽

Speaker Identification ◽

Signal To Noise Ratio ◽

Main Idea ◽

Extraction Methods ◽

Identification System ◽

Identification Performance ◽

Feature Vectors ◽

Overall Performance ◽

New Feature

The feature vectors of speaker identification system plays a crucial role in the overall performance of the system. There are many new feature vectors extraction methods based on MFCC, but ultimately we want to maximize the performance of SID system. The objective of this paper to derive Gammatone Frequency Cepstral Coefficients (GFCC) based a new set of feature vectors using Gaussian Mixer model (GMM) for speaker identification. The MFCC are the default feature vectors for speaker recognition, but they are not very robust at the presence of additive noise. The GFCC features in recent studies have shown very good robustness against noise and acoustic change. The main idea is GFCC features based on GMM feature extraction is to improve the overall speaker identification performance in low signal to noise ratio (SNR) conditions.

Download Full-text

Performance Evaluation of Mel and Bark Scale based Features for Text-Independent Speaker Identification

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k1999.0981119 ◽

2019 ◽

Vol 8 (11) ◽

pp. 3734-3738

Keyword(s):

Speaker Recognition ◽

Filter Bank ◽

Work Performance ◽

Speaker Identification ◽

Recognition Rate ◽

Identification System ◽

Bank Structure ◽

Speech Features ◽

Recognition Systems ◽

Human Ear

The performance of Mel scale and Bark scale is evaluated for text-independent speaker identification system. Mel scale and Bark scale are designed according to human auditory system. The filter bank structure is defined using Mel and Bark scales for speech and speaker recognition systems to extract speaker specific speech features. In this work, performance of Mel scale and Bark scale is evaluated for text-independent speaker identification system. It is found that Bark scale centre frequencies are more effective than Mel scale centre frequencies in case of Indian dialect speaker databases. Mel scale is defined as per interpretation of pitch by human ear and Bark scale is based on critical band selectivity at which loudness becomes significantly different. The recognition rate achieved using Bark scale filter bank is 96% for AISSMSIOIT database and 95% for Marathi database.

Download Full-text

Comparison of feature extraction and normalization methods for speaker recognition using grid-audiovisual database

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i2.pp782-789 ◽

2020 ◽

Vol 18 (2) ◽

pp. 782

Author(s):

Musab T. S. Al-Kaltakchi ◽

Haithem Abd Al-Raheem Taha ◽

Mohanad Abd Shehab ◽

Mohamed A.M. Abdullah

Keyword(s):

Feature Extraction ◽

Speaker Recognition ◽

Speaker Identification ◽

Gaussian Mixture ◽

Identification Accuracy ◽

Identification System ◽

Good Representation ◽

Mel Frequency Cepstral Coefficients ◽

Normalization Methods ◽

Cepstral Coefficients

<p><span lang="EN-GB">In this paper, different feature extraction and feature normalization methods are investigated for speaker recognition. With a view to give a good representation of acoustic speech signals, Power Normalized Cepstral Coefficients (PNCCs) and Mel Frequency Cepstral Coefficients (MFCCs) are employed for feature extraction. Then, to mitigate the effect of linear channel, Cepstral Mean-Variance Normalization (CMVN) and feature warping are utilized. The current paper investigates Text-independent speaker identification system by using 16 coefficients from both the MFCCs and PNCCs features. Eight different speakers are selected from the GRID-Audiovisual database with two females and six males. The speakers are modeled using the coupling between the Universal Background Model and Gaussian Mixture Models (GMM-UBM) in order to get a fast scoring technique and better performance. The system shows 100% in terms of speaker identification accuracy. The results illustrated that PNCCs features have better performance compared to the MFCCs features to identify females compared to male speakers. Furthermore, feature wrapping reported better performance compared to the CMVN method. </span></p>

Download Full-text

Improved text-independent speaker identification system for real time applications

2016 Fourth International Japan-Egypt Conference on Electronics, Communications and Computers (JEC-ECC) ◽

10.1109/jec-ecc.2016.7518967 ◽

2016 ◽

Cited By ~ 5

Author(s):

Nagwa M. AboElenein ◽

Khalid M. Amin ◽

Mina. Ibrahim ◽

Mohiy M. Hadhoud

Keyword(s):

Real Time ◽

Speaker Identification ◽

Identification System ◽

Real Time Applications

Download Full-text

Real Time Speaker Recognition System for Hindi Words

International Journal of Information Engineering and Electronic Business ◽

10.5815/ijieeb.2014.02.04 ◽

2014 ◽

Vol 6 (2) ◽

pp. 35-40 ◽

Cited By ~ 1

Author(s):

Geeta Nijhawan ◽

M.K Soni

Keyword(s):

Real Time ◽

Speaker Recognition ◽

Recognition System

Download Full-text

SPEAKER IDENTIFICATION MENGGUNAKAN TRANSFORMASI WAVELET DISKRIT DAN JARINGAN SARAF TIRUAN BACK-PROPAGATION

CommIT (Communication and Information Technology) Journal ◽

10.21512/commit.v2i1.482 ◽

2008 ◽

Vol 2 (1) ◽

pp. 1

Author(s):

Anny Tandyo ◽

Martono Martono ◽

Adi Widyatmoko

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Speaker Recognition ◽

Speaker Identification ◽

Back Propagation ◽

Identification Accuracy ◽

Identification System ◽

Accuracy Rate ◽

Discrete Transformation ◽

Artificial Neural

Article discussed a speaker identification system. Which was a part of speaker recognition. The system identified asubject based on the voice from a group of pattern had been saved before. This system used a wavelet discrete transformationas a feature extraction method and an artificial neural network of back-propagation as a classification method. The voiceinput was processed by the wavelet discrete transformation in order to obtain signal coefficient of low frequency as adecomposition result which kept voice characteristic of everyone. The coefficient then was classified artificial neural networkof back-propagation. A system trial was conducted by collecting voice samples directly by using 225 microphones in nonsoundproof rooms; contained of 15 subjects (persons) and each of them had 15 voice samples. The 10 samples were used as atraining voice and 5 others as a testing voice. Identification accuracy rate reached 84 percent. The testing was also done onthe subjects who pronounced same words. It can be concluded that, the similar selection of words by different subjects has noinfluence on the accuracy rate produced by system.Keywords: speaker identification, wavelet discrete transformation, artificial neural network, back-propagation.

Download Full-text

Real Time Speaker Recognition System using MFCC and Vector Quantization Technique

International Journal of Computer Applications ◽

10.5120/20520-2361 ◽

2015 ◽

Vol 117 (1) ◽

pp. 25-31 ◽

Cited By ~ 7

Author(s):

Roma Bharti ◽

Priyanka Bansal

Keyword(s):

Real Time ◽

Vector Quantization ◽

Speaker Recognition ◽

Recognition System

Download Full-text

A Novel Approach to Vehicle Number Identification using Raspberry pi 3

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit1952346 ◽

2019 ◽

pp. 64-67

Author(s):

N. Varshini ◽

Sumedha Kasarla ◽

Shaik Subhani

Keyword(s):

Real Time ◽

Traffic Signals ◽

Raspberry Pi ◽

Identification System ◽

License Plate ◽

Text Format ◽

Novel Approach ◽

Image Conversion ◽

Image Capturing ◽

Conversion Technology

Vehicle Number Identification using Raspberry pi 3 is an image conversion technology which captures the license plate of a vehicle. The main aim is to make an effective and accurate license number plate identification system. This system is carried out and performed in the areas where traffic signals are present and the camera is placed on the signal which is connected to raspberry pi and it sends signals to the server and it can also be used in apartments or residencies for capturing all the vehicle numbers entering the building. This system at first detects the vehicle license plate and then captures it .It then converts the image into the text. The text of the license plate is displayed on the screen using the image conversion. Open CV and OCR are the two software's used for image capturing and conversion of that into text format respectively. The resulting data is then displayed on the screen and saved into a folder. The whole system is developed on Raspberry Pi desktop and its performance is used in real-time. It is observed from this experiment that the system mainly detects and captures the vehicle license plate, converts the image into text and displays it on the screen successfully.

Download Full-text