An analysis-by-synthesis study of Mandarin speech prosody

Author(s):  
Na Zhi ◽  
Daniel Hirst ◽  
Pier Marco Bertinetto ◽  
Aijun Li ◽  
Yuan Jia
2021 ◽  
Vol 11 (5) ◽  
pp. 1990
Author(s):  
Vinod Devaraj ◽  
Philipp Aichinger

The characterization of voice quality is important for the diagnosis of a voice disorder. Vocal fry is a voice quality which is traditionally characterized by a low frequency and a long closed phase of the glottis. However, we also observed amplitude modulated vocal fry glottal area waveforms (GAWs) without long closed phases (positive group) which we modelled using an analysis-by-synthesis approach. Natural and synthetic GAWs are modelled. The negative group consists of euphonic, i.e., normophonic GAWs. The analysis-by-synthesis approach fits two modelled GAWs for each of the input GAW. One modelled GAW is modulated to replicate the amplitude and frequency modulations of the input GAW and the other modelled GAW is unmodulated. The modelling errors of the two modelled GAWs are determined to classify the GAWs into the positive and the negative groups using a simple support vector machine (SVM) classifier with a linear kernel. The modelling errors of all vocal fry GAWs obtained using the modulating model are smaller than the modelling errors obtained using the unmodulated model. Using the two modelling errors as predictors for classification, no false positives or false negatives are obtained. To further distinguish the subtypes of amplitude modulated vocal fry GAWs, the entropy of the modulator’s power spectral density and the modulator-to-carrier frequency ratio are obtained.


2013 ◽  
Vol 61 (22) ◽  
pp. 5789-5800 ◽  
Author(s):  
Amirpasha Shirazinia ◽  
Saikat Chatterjee ◽  
Mikael Skoglund

1996 ◽  
Vol 4 (3) ◽  
pp. 243-247 ◽  
Author(s):  
S. Cucchi ◽  
M. Fratti ◽  
M. Ronchi

2018 ◽  
Vol 104 ◽  
pp. 95-105
Author(s):  
Jonathan S. Brumberg ◽  
Jill C. Thorson ◽  
Rupal Patel
Keyword(s):  

2015 ◽  
Vol 111 ◽  
pp. 14-25 ◽  
Author(s):  
M.D. Pell ◽  
K. Rothermich ◽  
P. Liu ◽  
S. Paulmann ◽  
S. Sethi ◽  
...  
Keyword(s):  

2012 ◽  
Vol 54 (2) ◽  
pp. 37-54 ◽  
Author(s):  
Maciej Karpiński

ABSTRACT Maciej Karpiński. The Boundaries of Language: Dealing with Paralinguistic Features. Lingua Posnaniensis, vol. LIV (2)/2012. The Poznań Society for the Advancement of the Arts and Sciences. PL ISSN 0079-4740, ISBN 978-83-7654-252-2, pp. 37-54. The paralinguistic component of communication attracted a great deal of attention from contemporary linguists in the 1960s. The seminal works written then by Trager, Crystal and others had a powerful influence on the concept of paralanguage that lasted for many years. But, with the focus shifting towards the socio-psychological context of communication in the 1970s, the development of spoken corpora and databases and the significant progress in speech technology in the 1980s and 1990s, the need has arisen for a more comprehensive, coherent and formalised - but also flexible - approach to paralinguistic features. This study advances some preliminary proposals for a revised treatment of paralanguage that would meet some of these requirements and provide a conceptual basis for a new system of annotation for paralinguistic features. A range of views on paralinguistic features, which come mostly from the fields of speech prosody and gesture analysis, are briefly discussed. A number of assumptions and postulates are formulated to allow for a more consistent approach to paralinguistic features. The study suggests that there should be more reliance on continua than on binary categorisations of features, that multi-functionality and multimodality should be fully acknowledged and that clear distinctions should be made among the levels of description, and between the properties of speakers and the speech signal itself.


Sign in / Sign up

Export Citation Format

Share Document