Articulatory Measures of Prosody

Author(s):  
Taehong Cho ◽  
Doris Mücke

Prosodic research in speech production usually focuses on the way the prosodic structure influences the phonetic implementation of segmental and suprasegmental features. The realization of a tone, for instance, involves not only dynamic changes so as to regulate the vocal fold vibration to produce f0 contours, but also the movement of articulators to simultaneously produce consonants and vowels. Articulatory measuring techniques help us to directly observe how these two systems are coordinated in the spatio-temporal dimension. A number of such techniques are discussed, along with examples indicating how each technique may be or has been used to study various aspects of prosody. They include laryngoscopy and electroglottography to examine laryngeal events associated with vocal fold vibration; systems such as electromagnetic articulography, an optoelectronic device, electropalatography, and ultrasound systems to explore supralaryngeal articulatory events; and aerodynamic measurement systems to record oral/subglottal pressure and oral/nasal flow.

1997 ◽  
Vol 106 (7) ◽  
pp. 533-543 ◽  
Author(s):  
Steven M. Zeitels ◽  
Glenn W. Bunting ◽  
Robert E. Hillman ◽  
Traci Vaughn

Reinke's edema (RE) has been associated typically with smoking and sometimes with vocal abuse, but aspects of the pathophysiology of RE remain unclear. To gain new insights into phonatory mechanisms associated with RE pathophysiology, weused an integrated battery of objective vocal function tests to analyze 20 patients (19 women) who underwent phonomicrosurgical resection. Preoperative stroboscopic examinations demonstrated that the superficial lamina propria is distended primarily on the superior vocal fold surface. Acoustically, these individuals have an abnormally low average speaking fundamental frequency (123 Hz), and they generate abnormally high average subglottal pressures (9.7 cm H20). The presence of elevated aerodynamic driving pressures reflects difficulties in producing vocal fold vibration that are most likely the result of mass loading associated with RE, and possibly vocal hyperfunction. Furthermore, it is hypothesized that in the environment of chronic glottal mucositis secondary to smoking and reflux, the cephalad force on the vocal folds by the subglottal driving pressure contributes to the superior distention of the superficial lamina propria. Surgical reduction of the volume of the superficial lamina propria resulted in a significant elevation in fundamental frequency (154 Hz) and improvement in perturbation measures. In almost all instances, both the clinician and the patient perceived the voice as improved. However, these patients continued to generate elevated subglottal pressure (probably a sign of persistent hyperfunction) that was accompanied by visually observed supraglottal strain despite the normalsized vocal folds. This finding suggests that persistent hyperfunctional vocal behaviors may contribute to postsurgical RE recurrence if therapeutic strategies are not instituted to modify such behavior.


2021 ◽  
Vol 11 (3) ◽  
pp. 1221
Author(s):  
Dariush Bodaghi ◽  
Qian Xue ◽  
Xudong Zheng ◽  
Scott Thomson

An in-house 3D fluid–structure–acoustic interaction numerical solver was employed to investigate the effect of subglottic stenosis (SGS) on dynamics of glottal flow, vocal fold vibration and acoustics during voice production. The investigation focused on two SGS properties, including severity defined as the percentage of area reduction and location. The results show that SGS affects voice production only when its severity is beyond a threshold, which is at 75% for the glottal flow rate and acoustics, and at 90% for the vocal fold vibrations. Beyond the threshold, the flow rate, vocal fold vibration amplitude and vocal efficiency decrease rapidly with SGS severity, while the skewness quotient, vibration frequency, signal-to-noise ratio and vocal intensity decrease slightly, and the open quotient increases slightly. Changing the location of SGS shows no effect on the dynamics. Further analysis reveals that the effect of SGS on the dynamics is primarily due to its effect on the flow resistance in the entire airway, which is found to be related to the area ratio of glottis to SGS. Below the SGS severity of 75%, which corresponds to an area ratio of glottis to SGS of 0.1, changing the SGS severity only causes very small changes in the area ratio; therefore, its effect on the flow resistance and dynamics is very small. Beyond the SGS severity of 75%, increasing the SGS severity, leads to rapid increases of the area ratio, resulting in rapid changes in the flow resistance and dynamics.


Sensors ◽  
2021 ◽  
Vol 21 (9) ◽  
pp. 3099
Author(s):  
V. Javier Traver ◽  
Judith Zorío ◽  
Luis A. Leiva

Temporal salience considers how visual attention varies over time. Although visual salience has been widely studied from a spatial perspective, its temporal dimension has been mostly ignored, despite arguably being of utmost importance to understand the temporal evolution of attention on dynamic contents. To address this gap, we proposed Glimpse, a novel measure to compute temporal salience based on the observer-spatio-temporal consistency of raw gaze data. The measure is conceptually simple, training free, and provides a semantically meaningful quantification of visual attention over time. As an extension, we explored scoring algorithms to estimate temporal salience from spatial salience maps predicted with existing computational models. However, these approaches generally fall short when compared with our proposed gaze-based measure. Glimpse could serve as the basis for several downstream tasks such as segmentation or summarization of videos. Glimpse’s software and data are publicly available.


1997 ◽  
Vol 106 (7) ◽  
pp. 594-598 ◽  
Author(s):  
Sina Nasri ◽  
Joel A. Sercarz ◽  
Pouneh Beizai ◽  
Young-Mo Kim ◽  
Ming Ye ◽  
...  

The neuroanatomy of the larynx was explored in seven dogs to assess whether there is motor innervation to the thyroarytenoid (TA) muscle from the external division of the superior laryngeal nerve (ExSLN). In 3 animals, such innervation was identified. Electrical stimulation of microelectrodes applied to the ExSLN resulted in contraction of the TA muscle, indicating that this nerve is motor in function. This was confirmed by electromyographic recordings from the TA muscle. Videolaryngostroboscopy revealed improvement in vocal fold vibration following stimulation of the ExSLN compared to without it. Previously, the TA muscle was thought to be innervated solely by the recurrent laryngeal nerve. This additional pathway from the ExSLN to the TA muscle may have important clinical implications in the treatment of neurologic laryngeal disorders such as adductor spasmodic dysphonia.


1980 ◽  
Vol 83 (2) ◽  
pp. 201-212
Author(s):  
KATSUTAKE HAYASAKI

2002 ◽  
Vol 111 (10) ◽  
pp. 902-908 ◽  
Author(s):  
Renée Speyer ◽  
Pieter A. Kempen ◽  
George Wieneke ◽  
Willem Kersing ◽  
Elham Ghazi Hosseini ◽  
...  

Objective measurements derived from digitized laryngeal stroboscopic images were used to demonstrate changes in vocal fold vibration and in the size of benign lesions after 3 months of voice therapy. Forty chronically dysphonic patients were studied. By means of a rigid stroboscope, pretreatment and posttreatment recordings were made of the vocal folds at rest and under stroboscopic light during phonation. From each recording, images of the positions at rest and during vibration at maximal opening and at maximal closure were digitized. The surface areas of any lesions and of the glottal gap were independently measured in the digitized images by 2 experienced laryngologists. Referential distances were determined in order to compensate for discrepancies in magnification in the various recordings. After 3 months of voice therapy, significant improvement in lesion size and degree of maximal closure during vibration could be demonstrated in about 50% of the patients. The degree of maximal opening did not prove to be a significant parameter.


1980 ◽  
Vol 23 (3) ◽  
pp. 495-510 ◽  
Author(s):  
Ingo R. Titze

The myoelastic-aerodynamic theory of phonation has been quantified and tested with mathematical models. The models suggest that vocal fold oscillation is produced as a result of asymmetric forcing functions over closing and opening portions of the glottal cycle. For nearly uniform tissue displacements, as in falsetto voice, the asymmetry in the driving forces can result from the inertia of the air moving through the glottis. This inertia can in turn be enhanced or suppressed by supraglottal or subglottal vocal tract coupling. More obvious and pronounced asymmetries in the driving forces are associated with non-uniform vocal fold tissue displacements. These are combinations of normal tissue modes, and can result in vertical and horizontal phase differences along the surfaces, as observed in chest voice. The ranges of oscillation increase among various models as more freedom in the simulated tissue movement is incorporated. Of particular significance in initiating and maintaining oscillation are the vertical motions that facilitate coupling of aerodynamic energy into the tissues and allow tissue deformations under conditions of incompressibility. Vertical displacements also can have a significant effect on vocal tract excitation. Control of fundamental frequency of oscillation (FO) is basically myoelastic, partially as a result of deliberate or reflex adjustments of laryngeal muscles, and partially as a result of nonlinear tissue strain over the vibrational cycle. This places limits on the control of FO by subglottal pressure, and forces such control to be inseparably connected with vibrational amplitude, or less directly, with vocal intensity.


1991 ◽  
Vol 34 (3) ◽  
pp. 517-525 ◽  
Author(s):  
Carole Ferrand ◽  
Gordon W. Blood ◽  
Harvey R. Gilbert

The purpose of this study was to validate a proposed continuous-flow model of phonatory reaction time by investigating the temporal order of selected laryngeal and neurophysiological events involved in a phonatory reaction time task. Ten normal speakers participated in a phonatory reaction time task. Laryngeal positioning movements prior to vocal fold closure (laryngeal shift) and onset of vocal fold vibration (acoustic onset) were recorded with an electroglottograph. P300 brain potentials were collected simultaneously, and they served as an index of a central process underlying reaction time. The obtained temporal ordering of laryngeal shift, P300, and acoustic onset supported a continuous-flow model of phonatory reaction time. Use of this model might yield information that is more accurate in explaining physiological function and more precise in describing temporal patterning than the serial model.


2002 ◽  
Vol 111 (6) ◽  
pp. 537-541 ◽  
Author(s):  
Tzu-Yu Hsiao ◽  
Chia-Ming Liu ◽  
Kai-Nan Lin

The mucus layer on the vocal folds was examined by videostrobolaryngoscopy in patients with laryngeal tension-fatigue syndrome, a chronic functional dysphonia due to vocal abuse and misuse. Besides the findings in previous reports (such as abnormal glottal closure, phase or amplitude asymmetry, and the irregular mucosal wave), the vocal folds during vibration had an uneven mucus surface. The occurrence of an uneven mucus layer on vocal folds was significantly greater in subjects with this voice disorder (83% or 250 of 301 patients in this series) than in those without voice disorders (18.5% or 5 of 27). The increase of mucus viscosity, mucus aggregation, and the formation of rough surfaces on the vocal folds alter the mechanical properties that contribute to vibration of the cover of the vocal folds, and thereby worsen the symptoms of dysphonia in patients with laryngeal tension-fatigue syndrome.


Sign in / Sign up

Export Citation Format

Share Document