LTSD and GDMD features for Telephone Speech Endpoint Detection

Abstract In this paper, a brief summary of the author’s research in the field of the contour-based telephone speech Endpoint Detection (ED) is presented. This research includes: development of new robust features for ED – the Mean-Delta feature and the Group Delay Mean-Delta feature and estimation of the effect of the analyzed ED features and two additional features in the Dynamic Time Warping fixed-text speaker verification task with short noisy telephone phrases in Bulgarian language.

Download Full-text

Telephone Speech Endpoint Detection using Mean-Delta Feature

Cybernetics and Information Technologies ◽

10.2478/cait-2014-0025 ◽

2014 ◽

Vol 14 (2) ◽

pp. 127-139 ◽

Cited By ~ 3

Author(s):

Atanas Ouzounov

Keyword(s):

Dynamic Time Warping ◽

Speaker Verification ◽

Verification Task ◽

Endpoint Detection ◽

Time Warping ◽

Energy Entropy ◽

Telephone Speech ◽

Dynamic Time ◽

Teager Energy ◽

Speech Endpoint Detection

Abstract In the study the efficiency of three features for trajectory-based endpoint detection is experimentally evaluated in the fixed-text Dynamic Time Warping (DTW) - a based speaker verification task with short phrases of telephone speech. The employed features are Modified Teager Energy (MTE), Energy-Entropy (EE) feature and Mean-Delta (MD) feature. The utterance boundaries in the endpoint detector are provided by means of state automaton and a set of thresholds based only on trajectory characteristics. The training and testing have been done with noisy telephone speech (short phrases in Bulgarian language with length of about 2 s) selected from BG-SRDat corpus. The results of the experiments have shown that the MD feature demonstrates the best performance in the endpoint detection tests in terms of the verification rate.

Download Full-text

A Low-Power Text-Dependent Speaker Verification System with Narrow-Band Feature Pre-Selection and Weighted Dynamic Time Warping

10.21437/odyssey.2016-1 ◽

2016 ◽

Author(s):

Qing He ◽

Gregory Wornell ◽

Wei Ma

Keyword(s):

Low Power ◽

Dynamic Time Warping ◽

Narrow Band ◽

Speaker Verification ◽

Time Warping ◽

Verification System ◽

Dynamic Time ◽

Text Dependent Speaker Verification

Download Full-text

Addressing Text-Dependent Speaker Verification Using Singing Speech

Applied Sciences ◽

10.3390/app9132636 ◽

2019 ◽

Vol 9 (13) ◽

pp. 2636 ◽

Cited By ~ 2

Author(s):

Yan Shi ◽

Juanjuan Zhou ◽

Yanhua Long ◽

Yijie Li ◽

Hongwei Mao

Keyword(s):

Dynamic Time Warping ◽

State Of The Art ◽

Speaker Verification ◽

Gaussian Mixture ◽

Time Warping ◽

Normal Reading ◽

Feature Spaces ◽

Dynamic Time ◽

The One ◽

Normal Speech

The automatic speaker verification (ASV) has achieved significant progress in recent years. However, it is still very challenging to generalize the ASV technologies to new, unknown and spoofing conditions. Most previous studies focused on extracting the speaker information from natural speech. This paper attempts to address the speaker verification from another perspective. The speaker identity information was exploited from singing speech. We first designed and released a new corpus for speaker verification based on singing and normal reading speech. Then, the speaker discrimination was compared and analyzed between natural and singing speech in different feature spaces. Furthermore, the conventional Gaussian mixture model, the dynamic time warping and the state-of-the-art deep neural network were investigated. They were used to build text-dependent ASV systems with different training-test conditions. Experimental results show that the voiceprint information in the singing speech was more distinguishable than the one in the normal speech. More than relative 20% reduction of equal error rate was obtained on both the gender-dependent and independent 1 s-1 s evaluation tasks.

Download Full-text

Building Sequence Kernels for Speaker Verification and Word Recognition

Intelligent Information Technologies ◽

10.4018/978-1-59904-941-0.ch033 ◽

2011 ◽

pp. 575-589

Author(s):

Vincent Wan

Keyword(s):

Speech Recognition ◽

Speech Processing ◽

Kernel Methods ◽

Speaker Recognition ◽

Dynamic Time Warping ◽

Speaker Verification ◽

Dimensional Space ◽

Time Warping ◽

Recognition Systems ◽

Dynamic Time

This chapter describes the adaptation and application of kernel methods for speech processing. It is divided into two sections dealing with speaker verification and isolated-word speech recognition applications. Significant advances in kernel methods have been realised in the field of speaker verification, particularly relating to the direct scoring of variable-length speech utterances by sequence kernel SVMs. The improvements are so substantial that most state-of-the-art speaker recognition systems now incorporate SVMs. We describe the architecture of some of these sequence kernels. Speech recognition presents additional challenges to kernel methods and their application in this area is not as straightforward as for speaker verification. We describe a sequence kernel that uses dynamic time warping to capture temporal information within the kernel directly. The formulation also extends the standard dynamic time-warping algorithm by enabling the dynamic alignment to be computed in a high-dimensional space induced by a kernel function. This kernel is shown to work well in an application for recognising low-intelligibility speech of severely dysarthric individuals.

Download Full-text

Building Sequence Kernels for Speaker Verification and Word Recognition

Kernel Methods in Bioengineering, Signal and Image Processing ◽

10.4018/978-1-59904-042-4.ch010 ◽

2011 ◽

pp. 246-262

Author(s):

Vincent Wan

Keyword(s):

Speech Recognition ◽

Speech Processing ◽

Kernel Methods ◽

Speaker Recognition ◽

Dynamic Time Warping ◽

Speaker Verification ◽

Dimensional Space ◽

Time Warping ◽

Recognition Systems ◽

Dynamic Time

This chapter describes the adaptation and application of kernel methods for speech processing. It is divided into two sections dealing with speaker verification and isolated-word speech recognition applications. Significant advances in kernel methods have been realised in the field of speaker verification, particularly relating to the direct scoring of variable-length speech utterances by sequence kernel SVMs. The improvements are so substantial that most state-of-the-art speaker recognition systems now incorporate SVMs. We describe the architecture of some of these sequence kernels. Speech recognition presents additional challenges to kernel methods and their application in this area is not as straightforward as for speaker verification. We describe a sequence kernel that uses dynamic time warping to capture temporal information within the kernel directly. The formulation also extends the standard dynamic time-warping algorithm by enabling the dynamic alignment to be computed in a high-dimensional space induced by a kernel function. This kernel is shown to work well in an application for recognising low-intelligibility speech of severely dysarthric individuals.

Download Full-text

Design of speaker verification systems with the use of an algorithm of Dynamic Time Warping (DTW)

Pattern Recognition and Image Analysis ◽

10.1134/s1054661807040050 ◽

2007 ◽

Vol 17 (4) ◽

pp. 470-479

Author(s):

V. V. Geppener ◽

K. K. Simonchik ◽

A. S. Haidar

Keyword(s):

Dynamic Time Warping ◽

Speaker Verification ◽

Time Warping ◽

Dynamic Time ◽

Verification Systems

Download Full-text

Text-Independent Speaker Verification Based on Deep Neural Networks and Segmental Dynamic Time Warping

2018 IEEE Spoken Language Technology Workshop (SLT) ◽

10.1109/slt.2018.8639574 ◽

2018 ◽

Cited By ~ 1

Author(s):

Mohamed Adel ◽

Mohamed Afify ◽

Akram Gaballah ◽

Magda Fayek

Keyword(s):

Neural Networks ◽

Dynamic Time Warping ◽

Deep Neural Networks ◽

Speaker Verification ◽

Time Warping ◽

Segmental Dynamic ◽

Dynamic Time ◽

Text Independent Speaker Verification

Download Full-text

HiLAM-state discriminative multi-task deep neural network in dynamic time warping framework for text-dependent speaker verification

Speech Communication ◽

10.1016/j.specom.2020.03.007 ◽

2020 ◽

Vol 121 ◽

pp. 29-43

Author(s):

Mohammad Azharuddin Laskar ◽

Rabul Hussain Laskar

Keyword(s):

Neural Network ◽

Dynamic Time Warping ◽

Deep Neural Network ◽

Speaker Verification ◽

Time Warping ◽

Dynamic Time ◽

Text Dependent Speaker Verification

Download Full-text

Long-term p21 and p53 trends regulate the frequency of mitosis events and cell cycle arrest

10.1101/2021.08.17.456721 ◽

2021 ◽

Author(s):

Anh Phong Tran ◽

Christopher J. Tralie ◽

Caroline Moosmüller ◽

Zehor Belkhatir ◽

José Reyes ◽

...

Keyword(s):

Cell Cycle ◽

Dynamic Time Warping ◽

Time Warping ◽

Biological Mechanisms ◽

Term Trend ◽

X Ray ◽

Cycle Arrest ◽

Dynamic Time ◽

Long Term Trend

Radiation exposure of healthy cells can halt cell cycle temporarily or permanently. In this work, two single cell datasets that monitored the time evolution of p21 and p53, one subjected to gamma irradiation and the other to x-ray irradiation, are analyzed to uncover the dynamics of this process. New insights into the biological mechanisms were found by decomposing the p53 and p21 signals into transient and oscillatory components. Through the use of dynamic time warping on the oscillatory components of the two signals, we found that p21 signaling lags behind its lead signal, p53, by about 3.5 hours with oscillation periods of around 6 hours. Additionally, through various quantification methods, we showed how p21 levels, and to a lesser extent p53 levels, dictate whether the cells are arrested in their cell cycle and how fast these cells divide depending on their long-term trend in these signals.

Download Full-text