A Quantitative Output-Cost Ratio in Voice Production

2001 ◽  
Vol 44 (1) ◽  
pp. 29-37 ◽  
Author(s):  
David A. Berry ◽  
Katherine Verdolini ◽  
Douglas W. Montequin ◽  
Markus M. Hess ◽  
Roger W. Chan ◽  
...  

A quantitative output-cost ratio (OCR) is proposed for objective use in voice production and is defined as the ratio of the acoustic output intensity to the collision intensity of the vocal folds. Measurement of the OCR is demonstrated in a laboratory experiment using 5 excised larynges and a transducer designed for use on human subjects. Data were gathered at constant fundamental frequency (150 Hz). Subglottal pressure was varied from 1.0 to 1.6 kPa, and glottal width at the vocal processes was varied from a pressed condition to a 2-mm gap. The OCR was plotted as a function of glottal width. With no vocal tract, the excised larynx experiments yielded a broad maxima in the OCR curves, across all subglottal pressure conditions, at about 0.6 mm. Computer simulations indicate that sharper maxima may occur when the influence of the vocal tract is taken into account. The potential clinical utility of the OCR is discussed for treatment of a wide range of voice disorders, including those involving both hyper- and hypoadduction.

2021 ◽  
Vol 11 (4) ◽  
pp. 1970
Author(s):  
Martin Lasota ◽  
Petr Šidlof ◽  
Manfred Kaltenbacher ◽  
Stefan Schoder

In an aeroacoustic simulation of human voice production, the effect of the sub-grid scale (SGS) model on the acoustic spectrum was investigated. In the first step, incompressible airflow in a 3D model of larynx with vocal folds undergoing prescribed two-degree-of-freedom oscillation was simulated by laminar and Large-Eddy Simulations (LES), using the One-Equation and Wall-Adaptive Local-Eddy (WALE) SGS models. Second, the aeroacoustic sources and the sound propagation in a domain composed of the larynx and vocal tract were computed by the Perturbed Convective Wave Equation (PCWE) for vowels [u:] and [i:]. The results show that the SGS model has a significant impact not only on the flow field, but also on the spectrum of the sound sampled 1 cm downstream of the lips. With the WALE model, which is known to handle the near-wall and high-shear regions more precisely, the simulations predict significantly higher peak volumetric flow rates of air than those of the One-Equation model, only slightly lower than the laminar simulation. The usage of the WALE SGS model also results in higher sound pressure levels of the higher harmonic frequencies.


Author(s):  
Byron D. Erath ◽  
Matías Zañartu ◽  
Sean D. Peterson ◽  
Michael W. Plesniak

Voiced speech is initiated as air is expelled from the lungs and passes through the vocal tract inciting self-sustained oscillations of the vocal folds. While various approaches exist for investigating both normal and pathological speech, the relative inaccessibility of the vocal folds make multi-mass speech models an attractive alternative. Their behavior has been benchmarked with excised larynx experiments, and they have been used as analysis tools for both normal and disordered speech, including investigations of paralysis, vocal tremor, and breathiness. However, during pathological speech, vocal fold motion is often unstructured, resulting in chaotic motion and a wealth of nonlinear phenomena. Unfortunately, current methodologies for multi-mass speech models are unable to replicate the nonlinear vocal fold behavior that often occurs in physiological diseased voice for realistic values of subglottal pressure.


2004 ◽  
Author(s):  
Michael Barry

The design and testing of an experimental apparatus for in vitro study of phonatory aerodynamics (voice production) in humans is presented. The presentation includes not only the details of apparatus design, but flow visualization and Digital Particle Image Velocimetry (DPIV) measurements of the developing flow that occurs during the opening of the constriction from complete closure. The main features of the phonation process have long been understood. A proper combination of air flow from the lungs and of vocal fold tension initiates a vibration of the vocal folds, which in turn valves the airflow. The resulting periodic acceleration of the airstream through the glottis excites the acoustic modes of the vocal tract. It is further understood that the pressure gradient driving glottal flow is related to flow separation on the downstream side of the vocal folds. However, the details of this process and how it may contribute to effects such as aperiodicity of the voice and energy losses in voiced sound production are still not fully grasped. The experimental apparatus described in this paper is designed to address these issues. The apparatus itself consists of a scaled-up duct in which water flows through a constriction whose width is modulated by motion of the duct wall in a manner mimicking vocal fold vibration. Scaling the duct up 10 times and using water as the working fluid allows temporally and spatially resolved measurements of the dynamically similar flow velocity field using DPIV at video standard framing rates (15Hz). Dynamic similarity is ensured by matching the Reynolds number (based on glottal flow speed and glottis width) of 8000, and by varying the Strouhal number (based on vocal fold length, glottal flow speed, and a time scale characterizing the motion of the vocal folds) ranging from 0.01 to 0.1. The walls of the 28 cm × 28 cm test section and the vocal fold pieces are made of clear cast acrylic to allow optical access. The vocal fold pieces are 12.7 cm × 14 cm × 28 cm and are rectangular in shape, except for the surfaces which form the glottis, which are 6.35 cm radius half-circles. Dye injection slots are placed on the upstream side of both vocal field pieces to allow flow visualization. Prescribed motion of the vocal folds is provided by two linear stages. Linear bearings ensure smooth execution of the motion prescribed using a computer interface. Measurements described here use the Laser-Induced Fluorescence (LIF) flow visualization and DPIV techniques and are performed for two Strouhal numbers to assess the effect of opening time on the development of the glottal jet. These measurements are conducted on a plane oriented perpendicular to the glottis, at the duct midplane. LIF measurements use a 5W Argon ion laser to produce a light sheet, which illuminates the dye injected through a slot in each vocal fold piece. Two dye colors are used, one for each side. Quantitative information about the velocity and vorticity fields are obtained through DPIV measurements at the same location as the LIF measurements.


2019 ◽  
Vol 9 (9) ◽  
pp. 1963 ◽  
Author(s):  
Fabian Thornton ◽  
Michael Döllinger ◽  
Stefan Kniesburges ◽  
David Berry ◽  
Christoph Alexiou ◽  
...  

Normal voice is characterized by periodic oscillations of the vocal folds. On the other hand, disordered voice dynamics (e.g., subharmonic and aperiodic oscillations) are often associated with voice pathologies and dysphonia. Unfortunately, not all investigations may be conducted on human subjects; hence animal laryngeal studies have been performed for many years to better understand human phonation. The rabbit larynx has been shown to be a potential model of the human larynx. Despite this fact, only a few studies regarding the phonatory parameters of rabbit larynges have been performed. Further, to the best of our knowledge, no ex vivo study has systematically investigated phonatory parameters from high-speed, audio and subglottal pressure data with irregular oscillations. To remedy this, the present study analyzes experiments with sustained phonation in 11 ex vivo rabbit larynges for 51 conditions of disordered vocal fold dynamics. (1) The results of this study support previous findings on non-disordered data, that the stronger the glottal closure insufficiency is during phonation, the worse the phonatory characteristics are; (2) aperiodic oscillations showed worse phonatory results than subharmonic oscillations; (3) in the presence of both types of irregular vibrations, the voice quality (i.e., cepstral peak prominence) of the audio and subglottal signal greatly deteriorated compared to normal/periodic vibrations. In summary, our results suggest that the presence of both types of irregular vibration have a major impact on voice quality and should be considered along with glottal closure measures in medical diagnosis and treatment.


Author(s):  
Johan Sundberg

The sound quality of singing is determined by three basic factors—the air pressure under the vocal folds (or the subglottal pressure), the mechanical properties of the vocal folds, and the resonance properties of the vocal tract. Subglottal pressure is controlled by the respiratory apparatus. It regulates vocal loudness and is varied with pitch in singing. Together with the mechanical properties of the folds, which are controlled by laryngeal muscles, it has a decisive influence on vocal fold vibrationswhich convert the tracheal airstream to a pulsating airflow, the voice source. The voice source determines pitch, vibrato, and register, and also the overall slope of the spectrum. The sound of the voice source is filtered by the resonances of the vocal tract, or the formants, of which the two lowest determine the vowel quality and the higher ones the personal voice quality. Timing is crucial for creating emotional expressivity; it uses an acoustic code that shows striking similarities to that used in speech. The perceived loudness of a vowel sound seems more closely related to the subglottal pressure with which it was produced than with the acoustical sound level. Some investigations of acoustical correlates of tone placement and variation of larynx height are described, as are properties that affect the perceived naturalness of synthesized singing. Finally, subglottal pressure, voice source, and formant-frequency characteristics of some non-classical styles of singing are discussed.


Author(s):  
John H. Esling

<p>The ‘laryngeal articulator,’ consisting of the glottal mechanism, the supraglottic tube, the pharyngeal/epiglottal mechanism, and including three levels of folds: the vocal folds, the ventricular folds, and the aryepiglottic folds, is shown to be responsible for the generation of multiple source vibrations and for the complex modification of the pharyngeal resonating chamber that accounts for a wide range of contrastive auditory qualities. These qualities are observed in a surprisingly large number of the languages of the world, both linguistically and paralinguistically, and they account for sounds which have been labeled as ‘pharyngeal,’ as ‘epiglottal,’ and as various phonation types. They reflect an expanding range of what have been known as the ‘states of the glottis’ and which may be more properly termed ‘states of the larynx.’ It has also been observed that infants, in their first months of life, produce a range of qualities, reflecting both phonatory possibilities and stricture types, that can also be attributed to the laryngeal articulator mechanism. Systematic observation of infants’ early speech production reveals that the control of articulatory detail in the pharynx is mastered during the first year of life. Understanding and control of manner of articulation in the pharynx appears to be a prerequisite for expanding articulatory control into the oral vocal tract. Taking the pharynx as a starting point for the ontogenetic learning of the speech production capacity offers fruitful insights into the phylogenetic development of speech.</p>


Author(s):  
Joseph R. Nielson ◽  
David J. Daily ◽  
Tadd T. Truscott ◽  
Georg Luegmair ◽  
Michael Döllinger ◽  
...  

Synthetic aperture particle image velocimetry is used with an excised human vocal fold model to study the airflow between the vocal folds during voice production. A whole field, time-resolved, 3D description of the flow is presented over multiple cycles of vocal fold oscillations. The 3D flow data are synchronized with a 3D reconstruction of the superior surface of the vocal folds and with the subglottal pressure signal.


2021 ◽  
Vol 11 (12) ◽  
pp. 5469
Author(s):  
Takeshi Ikuma ◽  
Andrew J. McWhorter ◽  
Lacey Adkins ◽  
Melda Kunduk

Pathological vocal folds are known to exhibit multiple oscillation patterns, depending on tissue imbalance, subglottal pressure level, and other factors. This includes mid-phonation changes due to bifurcations in the underlying voice source system. Knowledge of when changes in oscillation patterns occur is helpful in the assessments of voice disorders, and the knowledge could be transformed into useful objective measures. Mid-phonation bifurcations can occur in rapid succession; hence, a fast classification of oscillation pattern is critical to minimize the averaging of data across bifurcations. This paper proposes frequency-ratio based short-term measures, named harmonic disturbance factor (HDF) and biphonic index (BI), towards the detection of the bifurcations. For the evaluation of HDF and BI, a frequency selection algorithm for glottal source signals is devised, and its efficacy is demonstrated with the glottal area waveforms of four cases, representing the wide range of oscillatory behaviors. The HDF and BI exhibit clear transitions when the voice bifurcations are apparent in the spectrograms. The presented proof-of-concept experiment’s outcomes warrant a larger scale study to formalize the parameters of the frequency selection algorithm.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Hugo Lehoux ◽  
Vít Hampala ◽  
Jan G. Švec

AbstractExcised larynges serve as natural models for studying behavior of the voice source. Acoustic resonances inside the air-supplying tubes below the larynx (i.e., subglottal space), however, interact with the vibratory behavior of the larynges and obscure their inherent vibration properties. Here, we explore a newly designed anechoic subglottal space which allows removing its acoustic resonances. We performed excised larynx experiments using both anechoic and resonant subglottal spaces in order to analyze and compare, for the very first time, the corresponding subglottal pressures, electroglottographic and radiated acoustic waveforms. In contrast to the resonant conditions, the anechoic subglottal pressure waveforms showed negligible oscillations during the vocal fold contact phase, as expected. When inverted, these waveforms closely matched the inverse filtered radiated sound waveforms. Subglottal resonances modified also the radiated sound pressures (Level 1 interactions). Furthermore, they changed the fundamental frequency (fo) of the vocal fold oscillations and offset phonation threshold pressures (Level 2 interactions), even for subglottal resonance frequencies 4–10 times higher than fo. The obtained data offer the basis for better understanding the inherent vibratory properties of the vocal folds, for studying the impact of structure-acoustic interactions on voice, and for validation of computational models of voice production.


Linguistics ◽  
2020 ◽  
Author(s):  
Jody Kreiman

The sound of a voice—its quality—plays an integral role in the biological and social existences of animal species ranging from frogs to birds to elephants to primates and humans. Across animal species, voice plays a part in many, many aspects of behavior, including mate selection and attraction, social organization, identification of parent/child/significant others, signaling of emotion and aggressive intent, and providing information about health, age, sex, and size. Voice quality is a critical component of acting, oratory, singing, motivating and persuading others, and projecting a likeable image across space (and of course is essential to the production of spoken language). The study of voice quality is thus by its very nature inherently interdisciplinary, to an extent that can create a large burden of scholarship on those wishing to understand not just what sounds occur, but why and how they carry the messages they do. The study of voice quality also depends critically on understanding voice production and acoustics. Biologically, a voice necessarily reflects the body that produced it—the size and shape of the vocal folds and vocal tract, patterns of articulation, and so on—and perception has co-evolved with those phonating bodies, so it functions as it does to take advantage of the information on offer about the speaker. In the same way, speakers produce sound in order to communicate with or influence listeners, so the voice production apparatus has evolved to produce sounds listeners can hear and evaluate, and to send biologically relevant messages. Because voice production and perception are inseparably intertwined in this way, this bibliography includes references describing voice production as well as voice perception. Beyond these fundamentals, the study of voice quality includes work on perception of personal attributes (identity, age, size, race, sex, and so on), expression and perception of emotion and personality, linguistic uses of changes in quality, and a host of other research areas. This bibliography provides representative and/or foundational studies in many of these areas, as an invitation and a gateway to further exploration.


Sign in / Sign up

Export Citation Format

Share Document