speech endpoint detection
Recently Published Documents


TOTAL DOCUMENTS

64
(FIVE YEARS 1)

H-INDEX

6
(FIVE YEARS 0)

2021 ◽  
Vol 2010 (1) ◽  
pp. 012178
Author(s):  
Hongli Wang ◽  
Qingning Zeng ◽  
Xuejun Zhao


2020 ◽  
Vol 2020 ◽  
pp. 1-19
Author(s):  
Nan Jiang ◽  
Ting Liu

This paper studies the segmentation and clustering of speaker speech. In order to improve the accuracy of speech endpoint detection, the traditional double-threshold short-time average zero-crossing rate is replaced by a better spectrum centroid feature, and the local maxima of the statistical feature sequence histogram are used to select the threshold, and a new speech endpoint detection algorithm is proposed. Compared with the traditional double-threshold algorithm, it effectively improves the detection accuracy and antinoise in low SNR. The k-means algorithm of conventional clustering needs to give the number of clusters in advance and is greatly affected by the choice of initial cluster centers. At the same time, the self-organizing neural network algorithm converges slowly and cannot provide accurate clustering information. An improved k-means speaker clustering algorithm based on self-organizing neural network is proposed. The number of clusters is predicted by the winning situation of the competitive neurons in the trained network, and the weights of the neurons are used as the initial cluster centers of the k-means algorithm. The experimental results of multiperson mixed speech segmentation show that the proposed algorithm can effectively improve the accuracy of speech clustering and make up for the shortcomings of the k-means algorithm and self-organizing neural network algorithm.



2020 ◽  
Vol 1617 ◽  
pp. 012070
Author(s):  
Zhenye Gan ◽  
Miaomiao Hou ◽  
Hexiang Hou ◽  
Hongwu Yang


2020 ◽  
Vol 160 ◽  
pp. 107133 ◽  
Author(s):  
Tao Zhang ◽  
Yangyang Shao ◽  
Yaqin Wu ◽  
Yanzhang Geng ◽  
Long Fan


2019 ◽  
Vol 31 (1) ◽  
pp. 70-77
Author(s):  
Yongping Dan ◽  
Yaming Song ◽  
Dongyun Wang ◽  
Fenghui Zhang ◽  
Wei Liu ◽  
...  

A snoring recognition algorithm based on machine learning is proposed to effectively and precisely recognize snoring. To obtain a dataset, the speech endpoint detection algorithm and Mel frequency cepstrum coefficient feature extraction algorithm are applied to process speech signal samples. The dataset is classified into snoring and nonsnoring data (other speech signals) using support vector machines. Experimental results show that the algorithm recognizes snoring signals with a high accuracy rate of 97% and positively impacts subsequent research and related engineering applications.



2017 ◽  
Vol 17 (4) ◽  
pp. 114-133
Author(s):  
Atanas Ouzounov

AbstractThis paper proposes a new contour-based speech endpoint detector which combines the log-Group Delay Mean-Delta (log-GDMD) feature, an adaptive twothreshold scheme and an eight-state automaton. The adaptive thresholds scheme uses two pairs of thresholds - for the starting and for the ending points, respectively. Each pair of thresholds is calculated by using the contour characteristics in the corresponded region of the utterance. The experimental results have shown that the proposed detector demonstrates better performance compared to the Long-Term Spectral Divergence (LTSD) one in terms of endpoint accuracy. Additional fixed-text speaker verification tests with short phrases of telephone speech based on the Dynamic Time Warping (DTW) and left-to-right Hidden Markov Model (HMM) frameworks confirm the improvements of the verification rate due to the better endpoint accuracy.



2017 ◽  
Vol 20 (3) ◽  
pp. 651-658 ◽  
Author(s):  
Linhui Sun ◽  
Min Su ◽  
Zhenzhen Yang


Sign in / Sign up

Export Citation Format

Share Document