Simulation of the coupling between vocal-fold vibration and time-varying vocal tract

2011 ◽  
Vol 130 (4) ◽  
pp. 2441-2441
Author(s):  
Yosuke Tanabe ◽  
Parham Mokhtari ◽  
Hironori Takemoto ◽  
Tatsuya Kitamura
Author(s):  
Johan Sundberg

The function of the voice organ is basically the same in classical singing as in speech. However, loud orchestral accompaniment has necessitated the use of the voice in an economical way. As a consequence, the vowel sounds tend to deviate considerably from those in speech. Male voices cluster formant three, four, and five, so that a marked peak is produced in spectrum envelope near 3,000 Hz. This helps them to get heard through a loud orchestral accompaniment. They seem to achieve this effect by widening the lower pharynx, which makes the vowels more centralized than in speech. Singers often sing at fundamental frequencies higher than the normal first formant frequency of the vowel in the lyrics. In such cases they raise the first formant frequency so that it gets somewhat higher than the fundamental frequency. This is achieved by reducing the degree of vocal tract constriction or by widening the lip and jaw openings, constricting the vocal tract in the pharyngeal end and widening it in the mouth. These deviations from speech cause difficulties in vowel identification, particularly at high fundamental frequencies. Actually, vowel identification is almost impossible above 700 Hz (pitch F5). Another great difference between vocal sound produced in speech and the classical singing tradition concerns female voices, which need to reduce the timbral differences between voice registers. Females normally speak in modal or chest register, and the transition to falsetto tends to happen somewhere above 350 Hz. The great timbral differences between these registers are avoided by establishing control over the register function, that is, over the vocal fold vibration characteristics, so that seamless transitions are achieved. In many other respects, there are more or less close similarities between speech and singing. Thus, marking phrase structure, emphasizing important events, and emotional coloring are common principles, which may make vocal artists deviate considerably from the score’s nominal description of fundamental frequency and syllable duration.


Author(s):  
Christine Ericsdotter Nordgren

Speech sounds are commonly divided into two main categories in human languages: vowels, such as ‘e’, ‘a’, ‘o’, and consonants, such as ‘k’, ‘n’, ‘s’. This division is made on the basis of both phonetic and phonological principles, which is useful from a general linguistic point of view but problematic for detailed description and analysis. The main differences between vowels and consonants are that (1) vowels are sounds produced with an open airway between the larynx and the lips, at least along the midline, whereas consonants are produced with a stricture or closure somewhere along it; and (2) that vowels tend to be syllabic in languages, meaning that they embody a sonorous peak in a syllable, whereas only some kinds of consonants tend to be syllabic. There are two main physical components needed to produce a vowel: a sound source, typically a tone produced by vocal fold vibration at the larynx, and a resonator, typically the upper airways. When the tone resonates in the upper airways, it gets a specific quality of sound, perceived and interpreted as a vowel quality, for example, ‘e’ or ‘a’. Which vowel quality is produced is determined by the shape of the inner space of the throat and mouth, the vocal tract shape, created by the speaker’s configuration of the articulators, which include the lips, tongue, jaw, hard and soft palate, pharynx, and larynx. Which vowel is perceived is determined by the auditory and visual input as well as by the listener’s expectations and language experience. Diachronic and synchronic studies on vowel typology show main trends in the vowel inventories in the worlds’ languages, which can be associated with human phonetic aptitude.


2021 ◽  
Vol 26 (4) ◽  
pp. 921-932
Author(s):  
Ji Sung Kim ◽  
Seong Hee Choi ◽  
Kyoungjae Lee ◽  
Chul-Hee Choi ◽  
Soo-Geun Wang ◽  
...  

Objectives: The purpose of this study is to investigate the characteristics of vocal fold vibration during sustained vowel /a/ phonation and various semi-occluded vocal tract exercise (SOVTEs) using a vibration simulator and digital kymography (DKG).Methods: A total of 12 normal young speakers (6 males, 6 females) aged 20-30 years participated in the study. They phonated a sustained /a/ vowel and performed SOVTE. The vocal fold vibration characteristics were measured according to the number of vibration sources (single vs. double), and vocal tract occlusion degree using a vibration simulator and DKG. Glottal gap quotient (GQ, %), speed quotient (SQ, %) and amplitude (pixel) were estimated quantitatively from the DKG image.Results: The results showed that significantly higher GQ (p = .000) and SQ (p = .000) were observed in the humming and bilabial fricative /β/ compared to open vowels. The amplitude was significantly higher in the open vowel /a/ than in humming (p = .018) and bilabial fricative /β/ (p = .003). Also, when comparing the vocal fold vibration parameters according to vibration type (single source: straw phonation vs. double source: straw phonation with water), the double source presented a significantly higher GQ (p = .000) as well as SQ (p = .008) in comparison with a single source.Conclusion: SOVTE showed a glottal gap that is different from the opened vowel /a/. It also had a longer opening of the vocal fold and a smaller amplitude than the vowel. This suggests that SOVTE may be helpful for facilitating vocal fold vibration and good voice quality in clinical practice. The current study can be meaningful in providing theoretical and clinical evidence for SOVTE.


2014 ◽  
Vol 24 (2) ◽  
pp. 71-79 ◽  
Author(s):  
Marci D. Rosenberg

Semi-occluded vocal tract (SOVT) exercises have long been used by voice trainers and pedagogues and have been particularly popular in Scandinavia dating as far back as the 1800s. Titze (1988, 1994, 2006; Titze, Riede, & Popolo, 2008; Titze & Verdolini-Abbot, 2012) has contributed significantly to the exploration of the SOVT and impact on voice production, and these types of exercise have become ubiquitous in the clinical voice arena. Although SOVT exercises are commonly used, there continue to be questions about the exact nature of how they impact phonation and improved vocal economy. This article aims to explore the physiology of a SOVT on vocal fold vibration and vocal output. Several variations are described within context of recent research.


2005 ◽  
Vol 48 (3) ◽  
pp. 567-576 ◽  
Author(s):  
Heather E. Gunter ◽  
Robert D. Howe ◽  
Steven M. Zeitels ◽  
James B. Kobler ◽  
Robert E. Hillman

Forces applied to vocal fold tissue as the vocal folds collide may cause tissue injury that manifests as benign organic lesions. A novel method for measuring this quantity in humans in vivo uses a low-profile force sensor that extends along the length and depth of the glottis. Sensor design facilitates its placement and stabilization so that phonation can be initiated and maintained while it is in place, with minimal interference in vocal fold vibration. In 2 individuals with 1 vibrating vocal fold and 1 nonvibrating vocal fold, peak collision force correlates more strongly with voice intensity than pitch. Vocal fold collision forces in 1 individual with 2 vibrating vocal folds are of the same order of magnitude as in previous studies. Correlations among peak collision force, voice intensity, and pitch were indeterminate in this participant because of the small number of data points. Sensor modifications are proposed so that it can be used to reliably estimate collision force in individuals with 2 vibrating vocal folds and with changing vocal tract conformations.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Wioletta Pietruszewska ◽  
Marcin Just ◽  
Joanna Morawska ◽  
Jakub Malinowski ◽  
Joanna Hoffman ◽  
...  

AbstractHigh-Speed Videoendoscopy (HSV) is becoming a robust tool for the assessment of vocal fold vibration in laboratory investigation and clinical practice. We describe the first successful application of flexible High Speed Videoendoscopy with innovative laser light source conducted in clinical settings. The acquired image and simultaneously recorded audio data are compared to the results obtained by means of a rigid endoscope. We demonstrated that the HSV recordings with fiber-optic laryngoscope have enabled obtaining consistently bright, color images suitable for parametrization of vocal fold oscillation similarly as in the case of the HSV data obtained from a rigid laryngoscope. The comparison of period and amplitude perturbation parameters calculated on the basis of image and audio data acquired from flexible and rigid HSV recording objectively confirm that flexible High-Speed Videoendoscopy is a more suitable method for examination of natural phonation. The HSV-based measures generated from this kymographic analysis are arguably a superior representation of the vocal fold vibrations than the acoustic analysis because their quantification is independent of the vocal tract influences. This experimental study has several implications for further research in the field of HSV application in clinical assessment of glottal pathologies nature and its effect on vocal folds vibrations.


2020 ◽  
Vol 63 (1) ◽  
pp. 109-124
Author(s):  
Carly Jo Hosbach-Cannon ◽  
Soren Y. Lowell ◽  
Raymond H. Colton ◽  
Richard T. Kelley ◽  
Xue Bao

Purpose To advance our current knowledge of singer physiology by using ultrasonography in combination with acoustic measures to compare physiological differences between musical theater (MT) and opera (OP) singers under controlled phonation conditions. Primary objectives addressed in this study were (a) to determine if differences in hyolaryngeal and vocal fold contact dynamics occur between two professional voice populations (MT and OP) during singing tasks and (b) to determine if differences occur between MT and OP singers in oral configuration and associated acoustic resonance during singing tasks. Method Twenty-one singers (10 MT and 11 OP) were included. All participants were currently enrolled in a music program. Experimental procedures consisted of sustained phonation on the vowels /i/ and /ɑ/ during both a low-pitch task and a high-pitch task. Measures of hyolaryngeal elevation, tongue height, and tongue advancement were assessed using ultrasonography. Vocal fold contact dynamics were measured using electroglottography. Simultaneous acoustic recordings were obtained during all ultrasonography procedures for analysis of the first two formant frequencies. Results Significant oral configuration differences, reflected by measures of tongue height and tongue advancement, were seen between groups. Measures of acoustic resonance also showed significant differences between groups during specific tasks. Both singer groups significantly raised their hyoid position when singing high-pitched vowels, but hyoid elevation was not statistically different between groups. Likewise, vocal fold contact dynamics did not significantly differentiate the two singer groups. Conclusions These findings suggest that, under controlled phonation conditions, MT singers alter their oral configuration and achieve differing resultant formants as compared with OP singers. Because singers are at a high risk of developing a voice disorder, understanding how these two groups of singers adjust their vocal tract configuration during their specific singing genre may help to identify risky vocal behavior and provide a basis for prevention of voice disorders.


2021 ◽  
Vol 11 (3) ◽  
pp. 1221
Author(s):  
Dariush Bodaghi ◽  
Qian Xue ◽  
Xudong Zheng ◽  
Scott Thomson

An in-house 3D fluid–structure–acoustic interaction numerical solver was employed to investigate the effect of subglottic stenosis (SGS) on dynamics of glottal flow, vocal fold vibration and acoustics during voice production. The investigation focused on two SGS properties, including severity defined as the percentage of area reduction and location. The results show that SGS affects voice production only when its severity is beyond a threshold, which is at 75% for the glottal flow rate and acoustics, and at 90% for the vocal fold vibrations. Beyond the threshold, the flow rate, vocal fold vibration amplitude and vocal efficiency decrease rapidly with SGS severity, while the skewness quotient, vibration frequency, signal-to-noise ratio and vocal intensity decrease slightly, and the open quotient increases slightly. Changing the location of SGS shows no effect on the dynamics. Further analysis reveals that the effect of SGS on the dynamics is primarily due to its effect on the flow resistance in the entire airway, which is found to be related to the area ratio of glottis to SGS. Below the SGS severity of 75%, which corresponds to an area ratio of glottis to SGS of 0.1, changing the SGS severity only causes very small changes in the area ratio; therefore, its effect on the flow resistance and dynamics is very small. Beyond the SGS severity of 75%, increasing the SGS severity, leads to rapid increases of the area ratio, resulting in rapid changes in the flow resistance and dynamics.


1997 ◽  
Vol 106 (7) ◽  
pp. 594-598 ◽  
Author(s):  
Sina Nasri ◽  
Joel A. Sercarz ◽  
Pouneh Beizai ◽  
Young-Mo Kim ◽  
Ming Ye ◽  
...  

The neuroanatomy of the larynx was explored in seven dogs to assess whether there is motor innervation to the thyroarytenoid (TA) muscle from the external division of the superior laryngeal nerve (ExSLN). In 3 animals, such innervation was identified. Electrical stimulation of microelectrodes applied to the ExSLN resulted in contraction of the TA muscle, indicating that this nerve is motor in function. This was confirmed by electromyographic recordings from the TA muscle. Videolaryngostroboscopy revealed improvement in vocal fold vibration following stimulation of the ExSLN compared to without it. Previously, the TA muscle was thought to be innervated solely by the recurrent laryngeal nerve. This additional pathway from the ExSLN to the TA muscle may have important clinical implications in the treatment of neurologic laryngeal disorders such as adductor spasmodic dysphonia.


1980 ◽  
Vol 83 (2) ◽  
pp. 201-212
Author(s):  
KATSUTAKE HAYASAKI

Sign in / Sign up

Export Citation Format

Share Document