Contribution of vocal tract and glottal source spectral cues in the generation of happy and aggressive [a] vowels

Author(s):  
Marc Freixes ◽  
Francesc Alías ◽  
Joan Claudi Socoró
2005 ◽  
Vol 100 (2) ◽  
pp. 421-431 ◽  
Author(s):  
Manish K. Rami ◽  
Joseph Kalinowski ◽  
Michael P. Rastatter ◽  
Donald Holbert ◽  
Mark Allen

This study investigated use of choral reading with filtered components of speech and whispered speech on the frequency of stuttering. Three passages read by a normal adult male were lowpass filtered with kneepoint frequencies at 100 Hz (approximate glottal source), 500 Hz (source and first formant), and 1 kHz (source and the first two formants). Along with a whispered passage, a normal passage, and a control condition, these stimuli were used in a repeated-measures design with 12 adult stutterers as they read passages while listening to one of the stimuli. Frequencies of stuttering in each condition were analyzed. The choral speech, the 500-Hz, the 1-kHz, and the whispered speech conditions all decreased the frequency of stuttering while the 100-Hz stimuli did not. It is suggested that articulatory events, chiefly the encoded speech output from the vocal tract, create effective cues and may induce fluent speech in people who stutter.


2017 ◽  
Vol 28 (11) ◽  
pp. 115902 ◽  
Author(s):  
Antti Hannukainen ◽  
Juha Kuortti ◽  
Jarmo Malinen ◽  
Antti Ojalammi

Author(s):  
Yongwei Li ◽  
Jianhua Tao ◽  
Donna Erickson ◽  
Bin Liu ◽  
Masato Akagi

2019 ◽  
Vol 9 (21) ◽  
pp. 4535 ◽  
Author(s):  
Marc Freixes ◽  
Marc Arnela ◽  
Joan Socoró ◽  
Francesc Alías ◽  
Oriol Guasch

Articulatory speech synthesis has long been based on one-dimensional (1D) approaches. They assume plane wave propagation within the vocal tract and disregard higher order modes that typically appear above 5 kHz. However, such modes may be relevant in obtaining a more natural voice, especially for phonation types with significant high frequency energy (HFE) content. This work studies the contribution of the glottal source at high frequencies in the 3D numerical synthesis of vowels. The spoken vocal range is explored using an LF (Liljencrants–Fant) model enhanced with aspiration noise and controlled by the R d glottal shape parameter. The vowels [ɑ], [i], and [u] are generated with a finite element method (FEM) using realistic 3D vocal tract geometries obtained from magnetic resonance imaging (MRI), as well as simplified straight vocal tracts of a circular cross-sectional area. The symmetry of the latter prevents the onset of higher order modes. Thus, the comparison between realistic and simplified geometries enables us to analyse the influence of such modes. The simulations indicate that higher order modes may be perceptually relevant, particularly for tense phonations (lower R d values) and/or high fundamental frequency values, F 0 s. Conversely, vowels with a lax phonation and/or low F0s may result in inaudible HFE levels, especially if aspiration noise is not considered in the glottal source model.


Sign in / Sign up

Export Citation Format

Share Document