The Vocal Tract in Singing

Author(s):  
Brad Story

Precise control of the vocal tract configuration is of critical importance for producing the desired acoustic characteristics of singing. The pattern of acoustic resonances generated by a given vocal tract shape influences vowel identity, voice quality (timbre), and, to some degree, the spectral characteristics of the voice excitation source itself. This chapter is broadly focused on how the vocal tract shape can be tuned (i.e., modified) in subtle ways to enhance the signal radiated from a singer to an audience. In particular, the vocal tract shape contributions to the “singing formant,” the enhancement of vibrato, and harmonic/formant alignment are discussed.

Author(s):  
Jesús Bernardino Alonso Hernández ◽  
Patricia Henríquez Rodríguez

It is possible to implement help systems for diagnosis oriented to the evaluation of the fonator system using speech signal, by means of techniques based on expert systems. The application of these techniques allows the early detection of alterations in the fonator system or the temporary evaluation of patients with certain treatment, to mention some examples. The procedure of measuring the voice quality of a speaker from a digital recording consists of quantifying different acoustic characteristics of speech, which makes it possible to compare it with certain reference patterns, identified previously by a “clinical expert”. A speech acoustic quality measurement based on an auditory assessment is very hard to assess as a comparative reference amongst different voices and different human experts carrying out the assessment or evaluation. In the current bibliography, some attempts have been made to obtain objective measures of speech quality by means of multidimensional clinical measurements based on auditory methods. Well-known examples are: GRBAS scale from Japon (Hirano, M.,1981) and its extension developed and applied in Europe (Dejonckere, P. H. Remacle, M. Fresnel-Elbaz, E. Woisard, V. Crevier- Buchman, L. Millet, B.,1996), a set of perceptual and acoustic characteristics in Sweden (Hammarberg, B. & Gauffin, J., 1995), a set of phonetics characteristics with added information about the excitement of the vocal tract. The aim of these (quality speech measurements) procedures is to obtain an objective measurement from a subjective evaluation. There exist different works in which objective measurements of speech quality obtained from a recording are proposed (Alonso J. B.,2006), (Boyanov, B & Hadjitodorov, S., 1997),(Hansen, J.H.L., Gavidia-Ceballos, L. & Kaiser, J.F., 1998),(Stefan Hadjitodorov & Petar Mitev, 2002),(Michaelis D.; Frohlich M. & Strube H. W. ,1998),(Boyanov B., Doskov D., Mitev P., Hadjitodorov S. & Teston B.,2000),(Godino-Llorente, J.I.; Aguilera-Navarro, S. & Gomez-Vilda, P. , 2000). In these works a voiced sustained sound (usually a vowel) is recorded and then used to compute speech quality measurements. The utilization of a voiced sustained sound is due to the fact that during the production of this kind of sound, the speech system uses almost all its mechanisms (glottal flow of constant air, vocal folds vibration in a continuous way, …), enabling us to detect any anomaly in these mechanisms. In these works different sets of measurements are suggested in order to quantify speech quality objectively. In all these works one important fact is revealed; it is necessary to obtain different measurements of the speech signal in order to compile the different aspects of acoustic characteristics of the speech signal.


2018 ◽  
Vol 7 (4) ◽  
pp. 1-7
Author(s):  
Anna Kuligowska ◽  
Barbara Jamróz ◽  
Joanna Chmielewska ◽  
Katarzyna Jędra ◽  
Tomasz Czernicki ◽  
...  

Aim of study: Evaluation of the speech therapy on voice quality in patients with unilateral vocal fold palsy. Material and methods: The study group included 11 patients, 8 women and 3 men, in age between 16 to 72 years, with unilateral vocal fold palsy, diagnosed in ENT Department of Warsaw Medical University between 2017-2018. Each person completed questionnaires: the voice disability self-assessment scale (VHI), the voice-based quality of life (VRQoL) scale, the vocal tract discomfort scale (VTD). All questionnaires were completed twice, before and after the voice therapy. In addition, the acoustic analysis of the voice, the assessment of the maximum phonation time and the breathing tract were performed twice in each patient. Each of the patients had a voice rehabilitation consisting of a series of 10 meetings. Results: Statistical analysis of the results of maximum phonation time, the self-assessment of voice disability, the quality of life depending on the voice, discomfort of the vocal tract voice acoustic analysis showed statistically significant differences in the results before and after rehabilitation (p <0.005). In addition, the improvement of the respiratory tract was observed in the majority of patients. Conclusions: Speech therapy significantly affects the voice quality of patients with unilateral laryngeal nerve palsy.


1976 ◽  
Vol 19 (1) ◽  
pp. 168-180 ◽  
Author(s):  
Ralph O. Coleman

Comparisons were made between the contributions of the fundamental frequency (F 0 ) on one hand, and vocal tract resonances on the other, to a perception of maleness and femaleness in the adult voice. In the first of two experiments, the F 0 of natural voice was found to be very highly correlated with the degree of maleness and femalenesss in the voice. The vocal tract resonances were less highly correlated and it is apparent that in the presence of the natural laryngeal tone, these perceptions are based on the frequency of the F 0 . In the second experiment, a tone produced by a laryngeal vibrator was substituted for the normal glottal tone at simulated F 0 's representing both males (120 Hz) and females (240 Hz). When listeners were asked to identify the sex of the speakers some inconsistency with the findings of the first experiment was seen. The female F 0 was a weak indicator of female voice quality when combined with male vocal tract resonance although the male F 0 retained the perceptual prominence seen in the first experiment. This finding may be indicative of some basic difference in the normal glottal characteristics of males and females.


2011 ◽  
pp. 1008-1016
Author(s):  
Jesús Bernardino Alonso Hernández ◽  
Patricia Henríquez Rodríguez

It is possible to implement help systems for diagnosis oriented to the evaluation of the fonator system using speech signal, by means of techniques based on expert systems. The application of these techniques allows the early detection of alterations in the fonator system or the temporary evaluation of patients with certain treatment, to mention some examples. The procedure of measuring the voice quality of a speaker from a digital recording consists of quantifying different acoustic characteristics of speech, which makes it possible to compare it with certain reference patterns, identified previously by a “clinical expert”. A speech acoustic quality measurement based on an auditory assessment is very hard to assess as a comparative reference amongst different voices and different human experts carrying out the assessment or evaluation. In the current bibliography, some attempts have been made to obtain objective measures of speech quality by means of multidimensional clinical measurements based on auditory methods. Well-known examples are: GRBAS scale from Japon (Hirano, M.,1981) and its extension developed and applied in Europe (Dejonckere, P. H. Remacle, M. Fresnel-Elbaz, E. Woisard, V. Crevier- Buchman, L. Millet, B.,1996), a set of perceptual and acoustic characteristics in Sweden (Hammarberg, B. & Gauffin, J., 1995), a set of phonetics characteristics with added information about the excitement of the vocal tract. The aim of these (quality speech measurements) procedures is to obtain an objective measurement from a subjective evaluation. There exist different works in which objective measurements of speech quality obtained from a recording are proposed (Alonso J. B.,2006), (Boyanov, B & Hadjitodorov, S., 1997),(Hansen, J.H.L., Gavidia-Ceballos, L. & Kaiser, J.F., 1998),(Stefan Hadjitodorov & Petar Mitev, 2002),(Michaelis D.; Frohlich M. & Strube H. W. ,1998),(Boyanov B., Doskov D., Mitev P., Hadjitodorov S. & Teston B.,2000),(Godino-Llorente, J.I.; Aguilera-Navarro, S. & Gomez-Vilda, P. , 2000). In these works a voiced sustained sound (usually a vowel) is recorded and then used to compute speech quality measurements. The utilization of a voiced sustained sound is due to the fact that during the production of this kind of sound, the speech system uses almost all its mechanisms (glottal flow of constant air, vocal folds vibration in a continuous way, …), enabling us to detect any anomaly in these mechanisms. In these works different sets of measurements are suggested in order to quantify speech quality objectively. In all these works one important fact is revealed; it is necessary to obtain different measurements of the speech signal in order to compile the different aspects of acoustic characteristics of the speech signal.


1998 ◽  
Vol 104 (3) ◽  
pp. 1805-1805
Author(s):  
Brad H. Story ◽  
Ingo R. Titze ◽  
Eric A. Hoffman

Author(s):  
Johan Sundberg

The sound quality of singing is determined by three basic factors—the air pressure under the vocal folds (or the subglottal pressure), the mechanical properties of the vocal folds, and the resonance properties of the vocal tract. Subglottal pressure is controlled by the respiratory apparatus. It regulates vocal loudness and is varied with pitch in singing. Together with the mechanical properties of the folds, which are controlled by laryngeal muscles, it has a decisive influence on vocal fold vibrationswhich convert the tracheal airstream to a pulsating airflow, the voice source. The voice source determines pitch, vibrato, and register, and also the overall slope of the spectrum. The sound of the voice source is filtered by the resonances of the vocal tract, or the formants, of which the two lowest determine the vowel quality and the higher ones the personal voice quality. Timing is crucial for creating emotional expressivity; it uses an acoustic code that shows striking similarities to that used in speech. The perceived loudness of a vowel sound seems more closely related to the subglottal pressure with which it was produced than with the acoustical sound level. Some investigations of acoustical correlates of tone placement and variation of larynx height are described, as are properties that affect the perceived naturalness of synthesized singing. Finally, subglottal pressure, voice source, and formant-frequency characteristics of some non-classical styles of singing are discussed.


1987 ◽  
Vol 96 (5) ◽  
pp. 573-577 ◽  
Author(s):  
Colin Painter ◽  
John M. Fredrickson ◽  
Timothy Kaiser ◽  
Roanne Karzon

An electromagnetic artificial larynx was implanted in two volunteer laryngectomees. Both patients were able to communicate well, but the voice quality still needed improving. Therefore, in this investigation, listener judgments were obtained of 22 different sound sources with a view to incorporating the preferred speech sound in a new version of the device. Electroglottograms were used as sound sources in a speech synthesizer and sentences were produced with different voice qualities for judgmental tests. The results of the listening tests showed a distinct preference for waveforms corresponding to a long completely open phase, a very brief completely closed phase, and an abrupt closing gesture. The optimum acoustic characteristics for the device will be used by electrical engineers to manufacture a new version of the artificial larynx with an improved voice quality.


2021 ◽  
Vol 75 (5) ◽  
pp. 1-8
Author(s):  
Piotr Bryk ◽  
Stanislaw Głuszek

Voice dysfunction is the most common complication of thyroid surgery. The use of intraoperative neuromonitoring (IONM) is to protect the recurrent laryngeal nerves, the damage of which causes voice dysfunction. The aim of the study was to evaluate voice quality in patients who underwent complete thyroidectomy operated on with the application of IONM as well as a group of patients operated on with only macroscopic nerve visualization. In the analysis, clinical voice assessment was performed with particular focus on voice efficiency using the Voice Handicap Index (VHI), Vocal Tract Discomfort (VTD) and GRBAS scale. The study group consisted of 205 patients operated on with IONM. The control group consisted of 162 patients subjected to surgery only with macroscopic visualization of recurrent laryngeal nerves, without IONM. During the follow-up period from 2 to 10 years after surgery, checkups were performed. Each patient who came for a checkup was subjected to perceptual voice evaluation with the use of the GRBAS scale, indirect laryngoscopy procedure and voice selfevaluation with two questionnaires (VHI and VTD). The frequency of vocal fold palsy did not differ significantly statistically in the study group and the control group. Both in the study group and in the control group, patients with vocal fold paralysis had statistically significantly higher results in the VHI and VTD questionnaires as well as in the GRBAS study. Patients with recurrent laryngeal nerve injury show significant differences in the scope of voice handicap, both in the voice quality assessment with the use of the GRBAS scale, and self-evaluation questionnaires: VHI and VTD. All voice disorders evaluated with self-assessment are medium voice disability.


2020 ◽  
Vol 8 (1) ◽  
pp. 15-26
Author(s):  
Zuleica Camargo ◽  
Paula da Costa Canton

Children with a disordered lingual frenulum are said to be more prone to various difficulties in the development process. Although these children tend to present changes in chewing, swallowing and speech, it can be assumed that the voice is practically not focused, with few studies concerned on vocal function (Camargo et al, 2017, 2017a). With the aim of investigating voice quality in children with and without lingual frenulum disorder, 61 children were clinically evaluated regarding the characteristics of the lingual frenulum (Marchesan, 2012) and the perceptual judgments of voice quality in semi-spontaneous (audio) speech samples. Video samples of orofacial clinical examinations and semi-spontaneous (audio) speech samples were recorded in a soundproof room. 28 children presented altered lingual frenulum (19 boys and 09 girls) and 33 frenulum without alterations (20 boys and 13 girls). We adopted the phonetic description of voice quality model as the theoretical background (Laver, 1980) and the Vocal Profile Analysis Scheme – VPAS (Laver et al, 1981; adapted for brazilian portuguese: Camargo, Madureira, 2008) for perceptual data analysis. Voice quality settings regarding the position of lips, tongue (tip and body), jaw, pharyngeal cavity configuration and vocal tract muscle tension occurred in a greater proportion (in terms of number of the occurrences and the degree of manifestation) in children with lingual frenulum disorded. When observing the data differently by male and female subgroups, we estimate that the former accounted for most of the occurrences related to labiodentalization, retracted and lowered tongue body and pharyngeal constriction. In the girls' subgroup, pharyngeal constriction was also recurrent. With regard to our previous explorations on the topic (Camargo et al, 2017, 2017a), we were able to advance in terms of a broader understanding of children's voice quality profiles and, particularly, those related to congenital orofacial motricity limitations. The relationships between size and position of the hyoid bone in children with an altered frenulum of the tongue (Ardekani et al, 2016), especially the raised and posterior position of the hyoid bone and the posteriorization of the jaw compared to children without frenulum disorerss, seem to influence the voice quality findings reported in this study. The high incidence rates of dysphonia in childhood tend to make it difficult to associate the alterations of the lingual frenulum to phonatory settings, although they may, hypothetically, appear as aggravating factors for dysphonia.


Sign in / Sign up

Export Citation Format

Share Document