Analysis of the voice source in different phonation types: simultaneous high-sped imaging of the vocal fold vibration and glottal inverse filtering

Author(s):  
Hannu Pulakka ◽  
Paavo Alku ◽  
Svante Granqvist ◽  
Stellan Hertegard ◽  
Hans Larsson ◽  
...  
Author(s):  
Johan Sundberg

The function of the voice organ is basically the same in classical singing as in speech. However, loud orchestral accompaniment has necessitated the use of the voice in an economical way. As a consequence, the vowel sounds tend to deviate considerably from those in speech. Male voices cluster formant three, four, and five, so that a marked peak is produced in spectrum envelope near 3,000 Hz. This helps them to get heard through a loud orchestral accompaniment. They seem to achieve this effect by widening the lower pharynx, which makes the vowels more centralized than in speech. Singers often sing at fundamental frequencies higher than the normal first formant frequency of the vowel in the lyrics. In such cases they raise the first formant frequency so that it gets somewhat higher than the fundamental frequency. This is achieved by reducing the degree of vocal tract constriction or by widening the lip and jaw openings, constricting the vocal tract in the pharyngeal end and widening it in the mouth. These deviations from speech cause difficulties in vowel identification, particularly at high fundamental frequencies. Actually, vowel identification is almost impossible above 700 Hz (pitch F5). Another great difference between vocal sound produced in speech and the classical singing tradition concerns female voices, which need to reduce the timbral differences between voice registers. Females normally speak in modal or chest register, and the transition to falsetto tends to happen somewhere above 350 Hz. The great timbral differences between these registers are avoided by establishing control over the register function, that is, over the vocal fold vibration characteristics, so that seamless transitions are achieved. In many other respects, there are more or less close similarities between speech and singing. Thus, marking phrase structure, emphasizing important events, and emotional coloring are common principles, which may make vocal artists deviate considerably from the score’s nominal description of fundamental frequency and syllable duration.


2016 ◽  
Vol 59 (5) ◽  
pp. 994-1001 ◽  
Author(s):  
Bruce R. Gerratt ◽  
Jody Kreiman ◽  
Marc Garellek

Purpose The question of what type of utterance—a sustained vowel or continuous speech—is best for voice quality analysis has been extensively studied but with equivocal results. This study examines whether previously reported differences derive from the articulatory and prosodic factors occurring in continuous speech versus sustained phonation. Method Speakers with voice disorders sustained vowels and read sentences. Vowel samples were excerpted from the steadiest portion of each vowel in the sentences. In addition to sustained and excerpted vowels, a 3rd set of stimuli was created by shortening sustained vowel productions to match the duration of vowels excerpted from continuous speech. Acoustic measures were made on the stimuli, and listeners judged the severity of vocal quality deviation. Results Sustained vowels and those extracted from continuous speech contain essentially the same acoustic and perceptual information about vocal quality deviation. Conclusions Perceived and/or measured differences between continuous speech and sustained vowels derive largely from voice source variability across segmental and prosodic contexts and not from variations in vocal fold vibration in the quasisteady portion of the vowels. Approaches to voice quality assessment by using continuous speech samples average across utterances and may not adequately quantify the variability they are intended to assess.


1998 ◽  
Vol 41 (5) ◽  
pp. 990-1002 ◽  
Author(s):  
Paavo Alku ◽  
Erkki Vilkman ◽  
Anne-Maria Laukkanen

A new method is presented for the parameterization of glottal volume velocity waveforms that have been estimated by inverse filtering acoustic speech pressure signals. The new technique, Parameter for Spectral and Amplitude Features of the Glottal Flow (PSA), combines two features of voice production, the AC value and the spectral decay of the glottal flow, both of which contribute to changes in vocal loudness. PSA yields a single parameter that characterizes the glottal flow in different loudness conditions. By analyzing voices of 8 speakers it was shown that the new parameter correlates strongly with the sound pressure level of speech.


2004 ◽  
Vol 29 (4) ◽  
pp. 147-153 ◽  
Author(s):  
D. Zangger Borch ◽  
J. Sundberg ◽  
P.-Å. Lindestad ◽  
M. Thalén

1983 ◽  
Vol 26 (3) ◽  
pp. 329-340 ◽  
Author(s):  
J. Sundberg

The chest wall vibrations generated by phonation were examined with respect to origin, amplitude, and spectrum in seven singers. The vibrations were picked up in male and female singer by means of small accelerometers, which were fastened to the skin on the center of the sternum, and, for the purpose of comparison, on the thyroid cartilage and on the trachea a few cm below the larynx. Among the factors which influence the chest wall vibrations, the subglottal pressure oscillations seem to represent the most important excitation, while the mechanical shocks from the vocal fold vibrations seem secondary. Supra- and subglottal resonances seem moderately influential. The results of the investigation suggest that it is the voice source, particularly the amplitude of its fundamental, which is reflected in the chest wall vibrations. The amplitude of these vibrations seem to lie above the threshold of the Pacinian receptors for vibratory sensation as long as the fundamental frequency is below approximately 300 Hz. The amplitude of the voice source fundamental and hence the sensation of chest vibrations vary considerably as phonation is changed along the phonatory dimension which ranges from "pressed/tense/strained" through "flow" to "breaty" phonation. For this reason it seems likely that the sensation of chest vibrations can serve as a useful nonauditory and hence room-independent signal for the voluntary control of phonation.


2021 ◽  
Vol 11 (3) ◽  
pp. 1221
Author(s):  
Dariush Bodaghi ◽  
Qian Xue ◽  
Xudong Zheng ◽  
Scott Thomson

An in-house 3D fluid–structure–acoustic interaction numerical solver was employed to investigate the effect of subglottic stenosis (SGS) on dynamics of glottal flow, vocal fold vibration and acoustics during voice production. The investigation focused on two SGS properties, including severity defined as the percentage of area reduction and location. The results show that SGS affects voice production only when its severity is beyond a threshold, which is at 75% for the glottal flow rate and acoustics, and at 90% for the vocal fold vibrations. Beyond the threshold, the flow rate, vocal fold vibration amplitude and vocal efficiency decrease rapidly with SGS severity, while the skewness quotient, vibration frequency, signal-to-noise ratio and vocal intensity decrease slightly, and the open quotient increases slightly. Changing the location of SGS shows no effect on the dynamics. Further analysis reveals that the effect of SGS on the dynamics is primarily due to its effect on the flow resistance in the entire airway, which is found to be related to the area ratio of glottis to SGS. Below the SGS severity of 75%, which corresponds to an area ratio of glottis to SGS of 0.1, changing the SGS severity only causes very small changes in the area ratio; therefore, its effect on the flow resistance and dynamics is very small. Beyond the SGS severity of 75%, increasing the SGS severity, leads to rapid increases of the area ratio, resulting in rapid changes in the flow resistance and dynamics.


1997 ◽  
Vol 106 (7) ◽  
pp. 594-598 ◽  
Author(s):  
Sina Nasri ◽  
Joel A. Sercarz ◽  
Pouneh Beizai ◽  
Young-Mo Kim ◽  
Ming Ye ◽  
...  

The neuroanatomy of the larynx was explored in seven dogs to assess whether there is motor innervation to the thyroarytenoid (TA) muscle from the external division of the superior laryngeal nerve (ExSLN). In 3 animals, such innervation was identified. Electrical stimulation of microelectrodes applied to the ExSLN resulted in contraction of the TA muscle, indicating that this nerve is motor in function. This was confirmed by electromyographic recordings from the TA muscle. Videolaryngostroboscopy revealed improvement in vocal fold vibration following stimulation of the ExSLN compared to without it. Previously, the TA muscle was thought to be innervated solely by the recurrent laryngeal nerve. This additional pathway from the ExSLN to the TA muscle may have important clinical implications in the treatment of neurologic laryngeal disorders such as adductor spasmodic dysphonia.


1980 ◽  
Vol 83 (2) ◽  
pp. 201-212
Author(s):  
KATSUTAKE HAYASAKI

Sign in / Sign up

Export Citation Format

Share Document