Likelihood Ratio Based Score Fusion for Audio-Visual Speaker Identification in Challenging Environment

This paper deals with a new and improved approach of Back-propagation learning neural network based likelihood ratio score fusion technique for audio-visual speaker Identification in various noisy environments. Different signal preprocessing and noise removing techniques have been used to process the speech utterance and LPC, LPCC, RCC, MFCC, ΔMFCC and ΔΔMFCC methods have been applied to extract the features from the audio signal. Active Shape Model has been used to extract the appearance and shape based facial features. To enhance the performance of the proposed system, appearance and shape based facial features are concatenated and Principal Component Analysis method has been used to reduce the dimension of the facial feature vector. The audio and visual feature vectors are then fed to Hidden Markov Model separately to find out the log-likelihood of each modality. The reliability of each modality has been calculated using reliability measurement method. Finally, these integrated likelihood ratios are fed to Back-propagation learning neural network algorithm to discover the final speaker identification result. For measuring the performance of the proposed system, three different databases, that is, NOIZEUS speech database, ORL face database and VALID audio-visual multimodal database have been used for audio-only, visual-only, and audio-visual speaker identification. To identify the accuracy of the proposed system with existing techniques under various noisy environment, different types of artificial noise have been added at various rates with audio and visual signal and performance being compared with different variations of audio and visual features.

Download Full-text

Prediction of Human Brain Activity Using Likelihood Ratio Based Score Fusion

IEEE Access ◽

10.1109/access.2017.2698068 ◽

2017 ◽

Vol 5 ◽

pp. 13010-13019 ◽

Cited By ~ 4

Author(s):

Raheel Zafar ◽

Sarat C. Dass ◽

Aamir Saeed Malik ◽

Nidal Kamel ◽

M. Javvad Ur Rehman ◽

...

Keyword(s):

Human Brain ◽

Likelihood Ratio ◽

Brain Activity ◽

Score Fusion

Download Full-text

Likelihood Ratio-Based Biometric Score Fusion

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/tpami.2007.70796 ◽

2008 ◽

Vol 30 (2) ◽

pp. 342-347 ◽

Cited By ~ 325

Author(s):

K. Nandakumar ◽

Yi Chen ◽

S.C. Dass ◽

A.K. Jain

Keyword(s):

Likelihood Ratio ◽

Score Fusion

Download Full-text

Blind Extraction of Moving Audio Source in a Challenging Environment Supported by Speaker Identification Via X-Vectors

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414331 ◽

2021 ◽

Author(s):

Jiri Malek ◽

Jakub Jansky ◽

Tomas Kounovsky ◽

Zbynek Koldovsky ◽

Jindrich Zdansky

Keyword(s):

Speaker Identification ◽

Challenging Environment ◽

Blind Extraction

Download Full-text

Speaker identification using pairwise log-likelihood ratio measures

2012 9th International Conference on Fuzzy Systems and Knowledge Discovery ◽

10.1109/fskd.2012.6234345 ◽

2012 ◽

Author(s):

Yi-Hsiang Chao

Keyword(s):

Likelihood Ratio ◽

Speaker Identification ◽

Log Likelihood ◽

Log Likelihood Ratio

Download Full-text

Likelihood ratio based features for a trained biometric score fusion

Expert Systems with Applications ◽

10.1016/j.eswa.2010.06.006 ◽

2011 ◽

Vol 38 (1) ◽

pp. 58-63 ◽

Cited By ~ 18

Author(s):

Loris Nanni ◽

Alessandra Lumini ◽

Sheryl Brahnam

Keyword(s):

Likelihood Ratio ◽

Score Fusion

Download Full-text

Strength of forensic speaker identification evidence: multispeaker formant- and cepstrum-based segmental discrimination with a Bayesian likelihood ratio as threshold

International Journal of Speech Language and the Law ◽

10.1558/sll.2003.10.2.179 ◽

2003 ◽

Vol 10 (2) ◽

pp. 179-202 ◽

Cited By ~ 13

Author(s):

Phil Rose ◽

Takashi Osanai ◽

Yuko Kinoshita

Keyword(s):

Likelihood Ratio ◽

Speaker Identification ◽

Forensic Speaker Identification

Download Full-text

Modality Selection Attacks and Modality Restriction in Likelihood-Ratio Based Biometric Score Fusion

IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences ◽

10.1587/transfun.e100.a.3023 ◽

2017 ◽

Vol E100.A (12) ◽

pp. 3023-3037

Author(s):

Takao MURAKAMI ◽

Yosuke KAGA ◽

Kenta TAKAHASHI

Keyword(s):

Likelihood Ratio ◽

Score Fusion

Download Full-text

Feature and Score Fusion Based Multiple Classifier Selection for Iris Recognition

Computational Intelligence and Neuroscience ◽

10.1155/2014/380585 ◽

2014 ◽

Vol 2014 ◽

pp. 1-11 ◽

Cited By ~ 8

Author(s):

Md. Rabiul Islam

Keyword(s):

Likelihood Ratio ◽

Iris Recognition ◽

Feature Fusion ◽

Ratio Score ◽

Multimodal System ◽

Classifier Selection ◽

Multiple Classifier ◽

Score Fusion ◽

Fusion Approach ◽

Voting Method

The aim of this work is to propose a new feature and score fusion based iris recognition approach where voting method on Multiple Classifier Selection technique has been applied. Four Discrete Hidden Markov Model classifiers output, that is, left iris based unimodal system, right iris based unimodal system, left-right iris feature fusion based multimodal system, and left-right iris likelihood ratio score fusion based multimodal system, is combined using voting method to achieve the final recognition result. CASIA-IrisV4 database has been used to measure the performance of the proposed system with various dimensions. Experimental results show the versatility of the proposed system of four different classifiers with various dimensions. Finally, recognition accuracy of the proposed system has been compared with existingNhamming distance score fusion approach proposed by Ma et al., log-likelihood ratio score fusion approach proposed by Schmid et al., and single level feature fusion approach proposed by Hollingsworth et al.

Download Full-text

Hybrid Feature and Decision Fusion Based Audio-Visual Speaker Identification in Challenging Environment

International Journal of Computer Applications ◽

10.5120/1384-1864 ◽

2010 ◽

Vol 9 (5) ◽

pp. 9-15 ◽

Cited By ~ 1

Author(s):

Rabiul Islam ◽

Fayzur Rahman

Keyword(s):

Speaker Identification ◽

Decision Fusion ◽

Challenging Environment

Download Full-text