Acoustic classification of coronal stops of Eastern Punjabi

Phonetica ◽  
2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Qandeel Hussain ◽  
Alexei Kochetov

Abstract Punjabi is an Indo-Aryan language which contrasts a rich set of coronal stops at dental and retroflex places of articulation across three laryngeal configurations. Moreover, all these stops occur contrastively in various positions (word-initially, -medially, and -finally). The goal of this study is to investigate how various coronal place and laryngeal contrasts are distinguished acoustically both within and across word positions. A number of temporal and spectral correlates were examined in data from 13 speakers of Eastern Punjabi: Voice Onset Time, release and closure durations, fundamental frequency, F1-F3 formants, spectral center of gravity and standard deviation, H1*-H2*, and cepstral peak prominence. The findings indicated that higher formants and spectral measures were most important for the classification of place contrasts across word positions, whereas laryngeal contrasts were reliably distinguished by durational and voice quality measures. Word-medially and -finally, F2 and F3 of the preceding vowels played a key role in distinguishing the dental and retroflex stops, while spectral noise measures were more important word-initially. The findings of this study contribute to a better understanding of factors involved in the maintenance of typologically rare and phonetically complex sets of place and laryngeal contrasts in the coronal stops of Indo-Aryan languages.

1993 ◽  
Vol 76 (1) ◽  
pp. 83-88 ◽  
Author(s):  
Linda Petrosino ◽  
Roger D. Colcord ◽  
Karen B. Kurcz ◽  
Robert J. Yonker

Voice onset time (VOT) was measured for voiced and voiceless velar stop consonants across three vowel contexts (/i, a, u/) in healthy young adult and older subjects. Analysis showed that mean VOT values for both /k/ and /g/ across the three vowel contexts did not differ between the two groups; however, differences in VOT variability (standard deviation) approached significance; the older subjects exhibited increased variability. This apparent increase in variability may be related to the subtle anatomical and physiological changes with age.


2019 ◽  
Vol 62 (4) ◽  
pp. 853-867 ◽  
Author(s):  
Joanna H. Lowenstein ◽  
Susan Nittrouer

Purpose Child phonologists have long been interested in how tightly speech input constrains the speech production capacities of young children, and the question acquires clinical significance when children with hearing loss are considered. Children with sensorineural hearing loss often show differences in the spectral and temporal structures of their speech production, compared to children with normal hearing. The current study was designed to investigate the extent to which this problem can be explained by signal degradation. Method Ten 5-year-olds with normal hearing were recorded imitating 120 three-syllable nonwords presented in unprocessed form and as noise-vocoded signals. Target segments consisted of fricatives, stops, and vowels. Several measures were made: 2 duration measures (voice onset time and fricative length) and 4 spectral measures involving 2 segments (1st and 3rd moments of fricatives and 1st and 2nd formant frequencies for the point vowels). Results All spectral measures were affected by signal degradation, with vowel production showing the largest effects. Although a change in voice onset time was observed with vocoded signals for /d/, voicing category was not affected. Fricative duration remained constant. Conclusions Results support the hypothesis that quality of the input signal constrains the speech production capacities of young children. Consequently, it can be concluded that the production problems of children with hearing loss—including those with cochlear implants—can be explained to some extent by the degradation in the signal they hear. However, experience with both speech perception and production likely plays a role as well.


2001 ◽  
Vol 48 (1) ◽  
pp. 21-31
Author(s):  
Loren Kahn ◽  
Heila Jordaan

This paper describes the swallowing difficulty and abnormal voicing characteristics of a subject with pharyngeal dystonia. This rare form of dystonia, considered to be a neurological condition resulting in involuntary spasm of the muscles of the pharynx, has not been documented in terms of its effects on the acoustic properties of the voice. This study revealed that during pharyngeal spasm, there are significant delays in voice onset time, a reduction in fundamental frequency, an increased percentage of sub-harmonics and variability in the amplitude perturbation quotient as well as shimmer. There was also evidence of these characteristics during periods of 'spasm-free' voice production, suggesting that the condition might be more consistent than what the subject described. Resonance disturbances were observed in spasm, which might explain the 'hollow' and affected voice quality. The subject also reported severe swallowing difficulties during the periods of spasm, characterised by a tight constriction at the level of the subject's throat. It is clear that an abnormality at the level of the cricopharyngeal muscle has a dual effect on the acoustic properties of the voice and on swallowing.


2021 ◽  
Vol 6 (2) ◽  
pp. 89-103
Author(s):  
Ghadah G. Alharbi ◽  
Michael P. Cannito ◽  
Eugene H. Buder ◽  
Shaheen N. Awan

Purpose: The primary purpose of this study was to assess the effect of Lee Silverman Voice Treatment (LSVT®LOUD) on selected spectral/cepstral measures of voice in connected speech. Spectral/cepstral analyses also were used to descriptively compare changes in connected speech to those previously reported in sustained vowels. An additional goal was to examine individual differences in responses to LSVT across the spectral/cepstral measures.Methods: Nine adult participants with PD were examined in a pre/post treatment design. Speech recordings were obtained on three different days within one week before and one week after LSVT. Speech recordings were analyzed for cepstral peak prominence (CPP), CPP Standard Deviation (CPP-SD), Low/High Spectral Ratio (L/H SR), and Cepstral/Spectral Index of Dysphonia (CSID) using the Analysis of Dysphonia in Speech and Voice (ADSV) program.Results: CPP and CSID were the only measures that detect treatment-related changes in connected speech. Investigating individual differences demonstrated more participants exhibiting an improvement in sustained phonation than in connected speech.Conclusions: Cepstral/spectral measures have been shown to be valid measures for estimating dysphonia severity in both continuous speech and sustained vowels. In addition, it extends the use of the cepstral/spectral measures for characterizing speech and voice aspects prior to treatment and for quantifying treatment outcomes. Moreover, it supports the use of LSVT as a treatment approach for improving voice quality in addition to intensity in PD.


2020 ◽  
Vol 63 (12) ◽  
pp. 3991-3999
Author(s):  
Benjamin van der Woerd ◽  
Min Wu ◽  
Vijay Parsa ◽  
Philip C. Doyle ◽  
Kevin Fung

Objectives This study aimed to evaluate the fidelity and accuracy of a smartphone microphone and recording environment on acoustic measurements of voice. Method A prospective cohort proof-of-concept study. Two sets of prerecorded samples (a) sustained vowels (/a/) and (b) Rainbow Passage sentence were played for recording via the internal iPhone microphone and the Blue Yeti USB microphone in two recording environments: a sound-treated booth and quiet office setting. Recordings were presented using a calibrated mannequin speaker with a fixed signal intensity (69 dBA), at a fixed distance (15 in.). Each set of recordings (iPhone—audio booth, Blue Yeti—audio booth, iPhone—office, and Blue Yeti—office), was time-windowed to ensure the same signal was evaluated for each condition. Acoustic measures of voice including fundamental frequency ( f o ), jitter, shimmer, harmonic-to-noise ratio (HNR), and cepstral peak prominence (CPP), were generated using a widely used analysis program (Praat Version 6.0.50). The data gathered were compared using a repeated measures analysis of variance. Two separate data sets were used. The set of vowel samples included both pathologic ( n = 10) and normal ( n = 10), male ( n = 5) and female ( n = 15) speakers. The set of sentence stimuli ranged in perceived voice quality from normal to severely disordered with an equal number of male ( n = 12) and female ( n = 12) speakers evaluated. Results The vowel analyses indicated that the jitter, shimmer, HNR, and CPP were significantly different based on microphone choice and shimmer, HNR, and CPP were significantly different based on the recording environment. Analysis of sentences revealed a statistically significant impact of recording environment and microphone type on HNR and CPP. While statistically significant, the differences across the experimental conditions for a subset of the acoustic measures (viz., jitter and CPP) have shown differences that fell within their respective normative ranges. Conclusions Both microphone and recording setting resulted in significant differences across several acoustic measurements. However, a subset of the acoustic measures that were statistically significant across the recording conditions showed small overall differences that are unlikely to have clinical significance in interpretation. For these acoustic measures, the present data suggest that, although a sound-treated setting is ideal for voice sample collection, a smartphone microphone can capture acceptable recordings for acoustic signal analysis.


Sign in / Sign up

Export Citation Format

Share Document