Measuring incompleteness: Acoustic correlates of glottal articulations

2014 ◽  
Vol 44 (3) ◽  
pp. 283-296 ◽  
Author(s):  
Michael Ashby ◽  
Joanna Przedlacka

The autocorrelation function, a measure of regularity in the speech signal, is applied in demarcating the seemingly diffuse intervals of glottalization which accompany or replace voiceless oral stops in elicited recordings from 22 young speakers of Southern British English. It is shown that a local minimum in autocorrelation characterizes almost all instances heard as intervocalic glottal stops; an annotation procedure is developed and used to gather data on glottalization gestures, including duration, f0, energy and autocorrelation. The same measure is used to assess regularity of vocal fold vibration in an interval just prior to the formation of the total closure for instances of syllable-final /t/, and confirms significantly lower autocorrelation in a group auditorily judged ‘pre-glottalized’. Implications are considered both for normal speech perception and for expert phonetic judgments.

1997 ◽  
Vol 106 (7) ◽  
pp. 533-543 ◽  
Author(s):  
Steven M. Zeitels ◽  
Glenn W. Bunting ◽  
Robert E. Hillman ◽  
Traci Vaughn

Reinke's edema (RE) has been associated typically with smoking and sometimes with vocal abuse, but aspects of the pathophysiology of RE remain unclear. To gain new insights into phonatory mechanisms associated with RE pathophysiology, weused an integrated battery of objective vocal function tests to analyze 20 patients (19 women) who underwent phonomicrosurgical resection. Preoperative stroboscopic examinations demonstrated that the superficial lamina propria is distended primarily on the superior vocal fold surface. Acoustically, these individuals have an abnormally low average speaking fundamental frequency (123 Hz), and they generate abnormally high average subglottal pressures (9.7 cm H20). The presence of elevated aerodynamic driving pressures reflects difficulties in producing vocal fold vibration that are most likely the result of mass loading associated with RE, and possibly vocal hyperfunction. Furthermore, it is hypothesized that in the environment of chronic glottal mucositis secondary to smoking and reflux, the cephalad force on the vocal folds by the subglottal driving pressure contributes to the superior distention of the superficial lamina propria. Surgical reduction of the volume of the superficial lamina propria resulted in a significant elevation in fundamental frequency (154 Hz) and improvement in perturbation measures. In almost all instances, both the clinician and the patient perceived the voice as improved. However, these patients continued to generate elevated subglottal pressure (probably a sign of persistent hyperfunction) that was accompanied by visually observed supraglottal strain despite the normalsized vocal folds. This finding suggests that persistent hyperfunctional vocal behaviors may contribute to postsurgical RE recurrence if therapeutic strategies are not instituted to modify such behavior.


Author(s):  
Nele Salveste

Erinevate häälikute laad meie igapäevases kõnes varieerub tugevalt, kuid häälduse varieeruvus ei ole enamasti kõneeristusele takistuseks. See annab alust oletada, et kõnetaju on välja arendanud süsteemi, millega tuvastada foneeme väga suure varieeruvusega kõnesignaalist. See süsteem tegeleb kõne varieeruvusega nii tõhusalt ja kiiresti, et me ei ole sellest enamasti teadlikud. Seda süsteemi võiks nimetada kategoriaalseks tajuks (ingl Categorical Perception), kuid kuna taju on uurimisele üksnes kaudselt kättesaadav, siis tähistab see termin pigem eksperimentaalset mudelit või meetodit, millega uuritakse taju võimet foneeme kõnesignaalist eristada. (Schouten jt 2003) Käesolevas artiklis arutatakse kategoriaalse taju kui mudeli ja katsemeetodi üle, mille teoreetilised lähtekohad on olnud nii muudes keeltes kui eesti keeles läbi viidud tajukatsete ülesehituse ja järelduste eeldusteks.Categorical perception or the hypothesis of how we perceive linguistic units. The acoustic signal of everyday speech is very variable, but it seldom distracts the normal speech communication. This motivates the hypothesis that the speech perception must have developed a special mechanism for extracting phonemes from highly variable speech signal. This mechanism extracts phonemes so efficiently and quickly that we are often unaware of it. We would like to call this mechanism “categorical perception of speech”, but since the perceptual processes are only indirectly accessible for investigation, the term refers rather to a theoretical model or an experimental method for investigating our perceptual ability to distinguish phonemes from the speech signal so efficiently (Schouten et al. 2003). In this paper the Categorical Perception as an experimental method and its theoretical statements will be discussed in connection to perception experiments and findings in other languages as well as in Estonian language.


2021 ◽  
Vol 11 (3) ◽  
pp. 1221
Author(s):  
Dariush Bodaghi ◽  
Qian Xue ◽  
Xudong Zheng ◽  
Scott Thomson

An in-house 3D fluid–structure–acoustic interaction numerical solver was employed to investigate the effect of subglottic stenosis (SGS) on dynamics of glottal flow, vocal fold vibration and acoustics during voice production. The investigation focused on two SGS properties, including severity defined as the percentage of area reduction and location. The results show that SGS affects voice production only when its severity is beyond a threshold, which is at 75% for the glottal flow rate and acoustics, and at 90% for the vocal fold vibrations. Beyond the threshold, the flow rate, vocal fold vibration amplitude and vocal efficiency decrease rapidly with SGS severity, while the skewness quotient, vibration frequency, signal-to-noise ratio and vocal intensity decrease slightly, and the open quotient increases slightly. Changing the location of SGS shows no effect on the dynamics. Further analysis reveals that the effect of SGS on the dynamics is primarily due to its effect on the flow resistance in the entire airway, which is found to be related to the area ratio of glottis to SGS. Below the SGS severity of 75%, which corresponds to an area ratio of glottis to SGS of 0.1, changing the SGS severity only causes very small changes in the area ratio; therefore, its effect on the flow resistance and dynamics is very small. Beyond the SGS severity of 75%, increasing the SGS severity, leads to rapid increases of the area ratio, resulting in rapid changes in the flow resistance and dynamics.


1997 ◽  
Vol 106 (7) ◽  
pp. 594-598 ◽  
Author(s):  
Sina Nasri ◽  
Joel A. Sercarz ◽  
Pouneh Beizai ◽  
Young-Mo Kim ◽  
Ming Ye ◽  
...  

The neuroanatomy of the larynx was explored in seven dogs to assess whether there is motor innervation to the thyroarytenoid (TA) muscle from the external division of the superior laryngeal nerve (ExSLN). In 3 animals, such innervation was identified. Electrical stimulation of microelectrodes applied to the ExSLN resulted in contraction of the TA muscle, indicating that this nerve is motor in function. This was confirmed by electromyographic recordings from the TA muscle. Videolaryngostroboscopy revealed improvement in vocal fold vibration following stimulation of the ExSLN compared to without it. Previously, the TA muscle was thought to be innervated solely by the recurrent laryngeal nerve. This additional pathway from the ExSLN to the TA muscle may have important clinical implications in the treatment of neurologic laryngeal disorders such as adductor spasmodic dysphonia.


1980 ◽  
Vol 83 (2) ◽  
pp. 201-212
Author(s):  
KATSUTAKE HAYASAKI

2002 ◽  
Vol 111 (10) ◽  
pp. 902-908 ◽  
Author(s):  
Renée Speyer ◽  
Pieter A. Kempen ◽  
George Wieneke ◽  
Willem Kersing ◽  
Elham Ghazi Hosseini ◽  
...  

Objective measurements derived from digitized laryngeal stroboscopic images were used to demonstrate changes in vocal fold vibration and in the size of benign lesions after 3 months of voice therapy. Forty chronically dysphonic patients were studied. By means of a rigid stroboscope, pretreatment and posttreatment recordings were made of the vocal folds at rest and under stroboscopic light during phonation. From each recording, images of the positions at rest and during vibration at maximal opening and at maximal closure were digitized. The surface areas of any lesions and of the glottal gap were independently measured in the digitized images by 2 experienced laryngologists. Referential distances were determined in order to compensate for discrepancies in magnification in the various recordings. After 3 months of voice therapy, significant improvement in lesion size and degree of maximal closure during vibration could be demonstrated in about 50% of the patients. The degree of maximal opening did not prove to be a significant parameter.


1991 ◽  
Vol 34 (3) ◽  
pp. 517-525 ◽  
Author(s):  
Carole Ferrand ◽  
Gordon W. Blood ◽  
Harvey R. Gilbert

The purpose of this study was to validate a proposed continuous-flow model of phonatory reaction time by investigating the temporal order of selected laryngeal and neurophysiological events involved in a phonatory reaction time task. Ten normal speakers participated in a phonatory reaction time task. Laryngeal positioning movements prior to vocal fold closure (laryngeal shift) and onset of vocal fold vibration (acoustic onset) were recorded with an electroglottograph. P300 brain potentials were collected simultaneously, and they served as an index of a central process underlying reaction time. The obtained temporal ordering of laryngeal shift, P300, and acoustic onset supported a continuous-flow model of phonatory reaction time. Use of this model might yield information that is more accurate in explaining physiological function and more precise in describing temporal patterning than the serial model.


2002 ◽  
Vol 111 (6) ◽  
pp. 537-541 ◽  
Author(s):  
Tzu-Yu Hsiao ◽  
Chia-Ming Liu ◽  
Kai-Nan Lin

The mucus layer on the vocal folds was examined by videostrobolaryngoscopy in patients with laryngeal tension-fatigue syndrome, a chronic functional dysphonia due to vocal abuse and misuse. Besides the findings in previous reports (such as abnormal glottal closure, phase or amplitude asymmetry, and the irregular mucosal wave), the vocal folds during vibration had an uneven mucus surface. The occurrence of an uneven mucus layer on vocal folds was significantly greater in subjects with this voice disorder (83% or 250 of 301 patients in this series) than in those without voice disorders (18.5% or 5 of 27). The increase of mucus viscosity, mucus aggregation, and the formation of rough surfaces on the vocal folds alter the mechanical properties that contribute to vibration of the cover of the vocal folds, and thereby worsen the symptoms of dysphonia in patients with laryngeal tension-fatigue syndrome.


1987 ◽  
Vol 82 (S1) ◽  
pp. S16-S16 ◽  
Author(s):  
Corine Bickley ◽  
Kevin Brown

Sign in / Sign up

Export Citation Format

Share Document