Discriminative Training for Multiple Observation Likelihood Ratio Based Voice Activity Detection

IEEE Signal Processing Letters ◽

10.1109/lsp.2010.2066561 ◽

2010 ◽

Vol 17 (11) ◽

pp. 897-900 ◽

Author(s):

Tao Yu ◽

John H L Hansen

Keyword(s):

Likelihood Ratio ◽

Voice Activity Detection ◽

Discriminative Training ◽

Activity Detection ◽

Download Full-text

Improved voice activity detection based on a smoothed statistical likelihood ratio

2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221) ◽

10.1109/icassp.2001.941020 ◽

2002 ◽

Author(s):

Y.D. Cho ◽

K. Al-Naimi ◽

A. Kondoz

Keyword(s):

Likelihood Ratio ◽

Voice Activity Detection ◽

Activity Detection ◽

Download Full-text

Statistical voice activity detection based on integrated bispectrum likelihood ratio tests for robust speech recognition

The Journal of the Acoustical Society of America ◽

10.1121/1.2714915 ◽

2007 ◽

Vol 121 (5) ◽

pp. 2946-2958 ◽

Author(s):

J. Ramírez ◽

J. M. Górriz ◽

J. C. Segura

Keyword(s):

Speech Recognition ◽

Likelihood Ratio ◽

Voice Activity Detection ◽

Likelihood Ratio Tests ◽

Robust Speech Recognition ◽

Activity Detection ◽

Download Full-text

Likelihood ratio sign test for voice activity detection

IET Signal Processing ◽

10.1049/iet-spr.2011.0109 ◽

2012 ◽

Vol 6 (4) ◽

pp. 306 ◽

Author(s):

S. Deng ◽

J. Han

Keyword(s):

Likelihood Ratio ◽

Voice Activity Detection ◽

Sign Test ◽

Activity Detection ◽

Download Full-text

A Bayesian approach to voice activity detection using multiple statistical models and discriminative training

10.21437/interspeech.2010-775 ◽

2010 ◽

Author(s):

Tao Yu ◽

John H. L. Hansen

Keyword(s):

Bayesian Approach ◽

Statistical Models ◽

Voice Activity Detection ◽

Discriminative Training ◽

Activity Detection ◽

Download Full-text

Mel-Scaled Autoregressive (Mel-AR) Model based Voice Activity Detection using Likelihood Ratio Measure

International Journal of Computer Applications ◽

10.5120/ijca2019918600 ◽

2019 ◽

Vol 182 (45) ◽

pp. 1-4

Author(s):

M. Babul

Keyword(s):

Likelihood Ratio ◽

Voice Activity Detection ◽

Ar Model ◽

Activity Detection ◽

Model Based ◽

Voice Activity ◽

Download Full-text

Robust statistical voice activity detection using a likelihood ratio sign test

10.21437/interspeech.2010-778 ◽

2010 ◽

Author(s):

Shiwen Deng ◽

Jiqing Han

Keyword(s):

Likelihood Ratio ◽

Voice Activity Detection ◽

Sign Test ◽

Activity Detection ◽

Download Full-text

Voice activity detection using harmonic frequency components in likelihood ratio test

2010 IEEE International Conference on Acoustics, Speech and Signal Processing ◽

10.1109/icassp.2010.5495611 ◽

2010 ◽

Author(s):

Lee Ngee Tan ◽

Bengt J. Borgstrom ◽

Abeer Alwan

Keyword(s):

Likelihood Ratio ◽

Likelihood Ratio Test ◽

Voice Activity Detection ◽

Activity Detection ◽

Harmonic Frequency ◽

Frequency Components ◽

Download Full-text

Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test

2010 20th International Conference on Pattern Recognition ◽

10.1109/icpr.2010.30 ◽

2010 ◽

Author(s):

Shiwen Deng ◽

Jiqing Han

Keyword(s):

Likelihood Ratio ◽

Likelihood Ratio Test ◽

Atomic Decomposition ◽

Voice Activity Detection ◽

Activity Detection ◽

Complex Exponential ◽

Download Full-text

Auditory Device Voice Activity Detection Based on Statistical Likelihood-Ratio Order Statistics

Applied Sciences ◽

10.3390/app10155026 ◽

2020 ◽

Vol 10 (15) ◽

pp. 5026

Author(s):

Seon Man Kim

Keyword(s):

Statistical Model ◽

Order Statistics ◽

Likelihood Ratio ◽

Voice Activity Detection ◽

Activity Detection ◽

Noisy Environments ◽

Likelihood Ratio Order ◽

Voice Activity ◽

False Rejection

This paper proposes a technique for improving statistical-model-based voice activity detection (VAD) in noisy environments to be applied in an auditory hearing aid. The proposed method is implemented for a uniform polyphase discrete Fourier transform filter bank satisfying an auditory device time latency of 8 ms. The proposed VAD technique provides an online unified framework to overcome the frequent false rejection of the statistical-model-based likelihood-ratio test (LRT) in noisy environments. The method is based on the observation that the sparseness of speech and background noise cause high false-rejection error rates in statistical LRT-based VAD—the false rejection rate increases as the sparseness increases. We demonstrate that the false-rejection error rate can be reduced by incorporating likelihood-ratio order statistics into a conventional LRT VAD. We confirm experimentally that the proposed method relatively reduces the average detection error rate by 15.8% compared to a conventional VAD with only minimal change in the false acceptance probability for three different noise conditions whose signal-to-noise ratio ranges from 0 to 20 dB.

Download Full-text

Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection

10.21437/interspeech.2020-1177 ◽

2020 ◽

Author(s):

Tianjiao Xu ◽

Hui Zhang ◽

Xueliang Zhang

Keyword(s):

Supervised Learning ◽

Likelihood Ratio ◽

Likelihood Ratio Test ◽

Voice Activity Detection ◽

Activity Detection ◽

Download Full-text