scholarly journals Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks

Author(s):  
A. Kastanos ◽  
A. Ragni ◽  
M. J. F. Gales
2016 ◽  
Vol 41 (4) ◽  
pp. 669-682 ◽  
Author(s):  
Gábor Gosztolya ◽  
András Beke ◽  
Tilda Neuberger ◽  
László Tóth

Abstract Laughter is one of the most important paralinguistic events, and it has specific roles in human conversation. The automatic detection of laughter occurrences in human speech can aid automatic speech recognition systems as well as some paralinguistic tasks such as emotion detection. In this study we apply Deep Neural Networks (DNN) for laughter detection, as this technology is nowadays considered state-of-the-art in similar tasks like phoneme identification. We carry out our experiments using two corpora containing spontaneous speech in two languages (Hungarian and English). Also, as we find it reasonable that not all frequency regions are required for efficient laughter detection, we will perform feature selection to find the sufficient feature subset.


Sign in / Sign up

Export Citation Format

Share Document