scholarly journals Likelihood Ratio Based Score Fusion for Audio-Visual Speaker Identification in Challenging Environment

2010 ◽  
Vol 6 (7) ◽  
pp. 6-11 ◽  
Author(s):  
Rabiul Islam ◽  
Fayzur Rahman
2014 ◽  
Vol 2014 ◽  
pp. 1-13
Author(s):  
Md. Rabiul Islam ◽  
Md. Abdus Sobhan

This paper deals with a new and improved approach of Back-propagation learning neural network based likelihood ratio score fusion technique for audio-visual speaker Identification in various noisy environments. Different signal preprocessing and noise removing techniques have been used to process the speech utterance and LPC, LPCC, RCC, MFCC, ΔMFCC and ΔΔMFCC methods have been applied to extract the features from the audio signal. Active Shape Model has been used to extract the appearance and shape based facial features. To enhance the performance of the proposed system, appearance and shape based facial features are concatenated and Principal Component Analysis method has been used to reduce the dimension of the facial feature vector. The audio and visual feature vectors are then fed to Hidden Markov Model separately to find out the log-likelihood of each modality. The reliability of each modality has been calculated using reliability measurement method. Finally, these integrated likelihood ratios are fed to Back-propagation learning neural network algorithm to discover the final speaker identification result. For measuring the performance of the proposed system, three different databases, that is, NOIZEUS speech database, ORL face database and VALID audio-visual multimodal database have been used for audio-only, visual-only, and audio-visual speaker identification. To identify the accuracy of the proposed system with existing techniques under various noisy environment, different types of artificial noise have been added at various rates with audio and visual signal and performance being compared with different variations of audio and visual features.


IEEE Access ◽  
2017 ◽  
Vol 5 ◽  
pp. 13010-13019 ◽  
Author(s):  
Raheel Zafar ◽  
Sarat C. Dass ◽  
Aamir Saeed Malik ◽  
Nidal Kamel ◽  
M. Javvad Ur Rehman ◽  
...  

2008 ◽  
Vol 30 (2) ◽  
pp. 342-347 ◽  
Author(s):  
K. Nandakumar ◽  
Yi Chen ◽  
S.C. Dass ◽  
A.K. Jain

2011 ◽  
Vol 38 (1) ◽  
pp. 58-63 ◽  
Author(s):  
Loris Nanni ◽  
Alessandra Lumini ◽  
Sheryl Brahnam

2014 ◽  
Vol 2014 ◽  
pp. 1-11 ◽  
Author(s):  
Md. Rabiul Islam

The aim of this work is to propose a new feature and score fusion based iris recognition approach where voting method on Multiple Classifier Selection technique has been applied. Four Discrete Hidden Markov Model classifiers output, that is, left iris based unimodal system, right iris based unimodal system, left-right iris feature fusion based multimodal system, and left-right iris likelihood ratio score fusion based multimodal system, is combined using voting method to achieve the final recognition result. CASIA-IrisV4 database has been used to measure the performance of the proposed system with various dimensions. Experimental results show the versatility of the proposed system of four different classifiers with various dimensions. Finally, recognition accuracy of the proposed system has been compared with existingNhamming distance score fusion approach proposed by Ma et al., log-likelihood ratio score fusion approach proposed by Schmid et al., and single level feature fusion approach proposed by Hollingsworth et al.


Sign in / Sign up

Export Citation Format

Share Document