Voice activity detection for speaker verification systems

Mapping Intimacies ◽

10.1117/12.784839 ◽

2007 ◽

Author(s):

Filip Borowski

Keyword(s):

Speaker Verification ◽

Voice Activity Detection ◽

Activity Detection ◽

Verification Systems ◽

Download Full-text

Voice Activity Detection for Speaker Verification Systems

Rough Sets and Knowledge Technology - Lecture Notes in Computer Science ◽

10.1007/978-3-540-72458-2_22 ◽

2007 ◽

pp. 181-186 ◽

Author(s):

Jaroslaw Baszun

Keyword(s):

Speaker Verification ◽

Voice Activity Detection ◽

Activity Detection ◽

Verification Systems ◽

Download Full-text

Speaker Verification Under Degraded Conditions Using Empirical Mode Decomposition Based Voice Activity Detection Algorithm

Journal of Intelligent Systems ◽

10.1515/jisys-2013-0085 ◽

2014 ◽

Vol 23 (4) ◽

pp. 359-378

Author(s):

M. S. Rudramurthy ◽

V. Kamakshi Prasad ◽

R. Kumaraswamy

Keyword(s):

Speaker Recognition ◽

Speaker Verification ◽

Signal To Noise Ratio ◽

Gaussian Mixture ◽

Detection Algorithm ◽

Voice Activity Detection ◽

Activity Detection ◽

Front End ◽

Different Types ◽

AbstractThe performance of most of the state-of-the-art speaker recognition (SR) systems deteriorates under degraded conditions, owing to mismatch between the training and testing sessions. This study focuses on the front end of the speaker verification (SV) system to reduce the mismatch between training and testing. An adaptive voice activity detection (VAD) algorithm using zero-frequency filter assisted peaking resonator (ZFFPR) was integrated into the front end of the SV system. The performance of this proposed SV system was studied under degraded conditions with 50 selected speakers from the NIST 2003 database. The degraded condition was simulated by adding different types of noises to the original speech utterances. The different types of noises were chosen from the NOISEX-92 database to simulate degraded conditions at signal-to-noise ratio levels from 0 to 20 dB. In this study, widely used 39-dimension Mel frequency cepstral coefficient (MFCC; i.e., 13-dimension MFCCs augmented with 13-dimension velocity and 13-dimension acceleration coefficients) features were used, and Gaussian mixture model–universal background model was used for speaker modeling. The proposed system’s performance was studied against the energy-based VAD used as the front end of the SV system. The proposed SV system showed some encouraging results when EMD-based VAD was used at its front end.

Download Full-text

Robust voice activity detection for narrow-bandwidth speaker verification under adverse environments

10.21437/interspeech.2007-169 ◽

2007 ◽

Author(s):

Tuan Van Pham ◽

Michael Neffe ◽

Gernot Kubin

Keyword(s):

Speaker Verification ◽

Voice Activity Detection ◽

Activity Detection ◽

Narrow Bandwidth ◽

Voice Activity ◽

Adverse Environments

Download Full-text

The role of Voice Activity Detection in forensic speaker verification

2011 17th International Conference on Digital Signal Processing (DSP) ◽

10.1109/icdsp.2011.6004980 ◽

2011 ◽

Author(s):

Francesco Beritelli ◽

Andrea Spadaccini

Keyword(s):

Speaker Verification ◽

Voice Activity Detection ◽

Activity Detection ◽

Download Full-text

Self-Adaptive Soft Voice Activity Detection Using Deep Neural Networks for Robust Speaker Verification

2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) ◽

10.1109/asru46091.2019.9003935 ◽

2019 ◽

Author(s):

Youngmoon Jung ◽

Yeunju Choi ◽

Hoirin Kim

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Speaker Verification ◽

Voice Activity Detection ◽

Activity Detection ◽

Voice Activity ◽

Download Full-text

Influence of Noise and Voice Activity Detection on Speaker Verification

Computer Networks - Communications in Computer and Information Science ◽

10.1007/978-3-319-39207-3_18 ◽

2016 ◽

pp. 207-215

Author(s):

Adam Dustor

Keyword(s):

Speaker Verification ◽

Voice Activity Detection ◽

Activity Detection ◽

Influence Of Noise ◽

Download Full-text

Voice activity detection method based on inter-frame correlation

Journal of Computer Applications ◽

10.3724/sp.j.1087.2011.01447 ◽

2011 ◽

Vol 31 (5) ◽

pp. 1447-1449

Author(s):

Yu LI ◽

Lei-yong GUO ◽

Hong-zhou TAN

Keyword(s):

Detection Method ◽

Voice Activity Detection ◽

Activity Detection ◽

Voice Activity ◽

Download Full-text

Robust speaker recognition based on level-building voice activity detection

JOURNAL OF SHENZHEN UNIVERSITY SCIENCE AND ENGINEERING ◽

10.3724/sp.j.1249.2012.04328 ◽

2012 ◽

Vol 29 (4) ◽

pp. 328-334

Author(s):

Yan-lu XIE ◽

Jing-song ZHANG ◽

Ming-hui LIU ◽

Zhong-wei HUANG

Keyword(s):

Speaker Recognition ◽

Voice Activity Detection ◽

Activity Detection ◽

Robust Speaker Recognition ◽

Level Building ◽

Download Full-text

Joint Learning Using Denoising Variational Autoencoders for Voice Activity Detection

10.21437/interspeech.2018-1151 ◽

2018 ◽

Author(s):

Youngmoon Jung ◽

Younggwan Kim ◽

Yeunju Choi ◽

Hoirin Kim

Keyword(s):

Voice Activity Detection ◽

Activity Detection ◽

Joint Learning ◽

Download Full-text

Optimizing Voice Activity Detection for Noisy Conditions

10.21437/interspeech.2019-1776 ◽

2019 ◽

Author(s):

Ruixi Lin ◽

Charles Costello ◽

Charles Jankowski ◽

Vishwas Mruthyunjaya

Keyword(s):

Voice Activity Detection ◽

Activity Detection ◽

Noisy Conditions ◽

Download Full-text