Effects of speaking rate, focal stress, and sentence position on spectral characteristics and vowel duration

1993 ◽  
Vol 94 (3) ◽  
pp. 1880-1880
Author(s):  
Dawn M. Behne ◽  
Lynne C. Nygaard
1982 ◽  
Vol 25 (4) ◽  
pp. 574-580 ◽  
Author(s):  
Richard J. Klich

This experiment was designed to investigate the relationships of listener ratings of breathiness to vowel duration, speaking rate, and the relative energy in three frequency ranges (100–500, 1500–2500, and 3500–4500 Hz) in vowel spectra. The effects of vowel SPL also were considered. Listeners used a seven-point equal-appearing interval scale to rate a sentence spoken by each of 10 young adult females in each of four voice qualities: normal speech, mildly breathy, severely breathy, and whisper. Significant Pearson correlations to the ratings were found only for mean SPL and the relative energy in the 100–500 and 3500–4500 Hz ranges. After the effects of mean SPL were accounted for in partial correlation and multiple regression analyses, all vowel parameters were related significantly to the mean ratings. The partial correlations for vowel duration were as high as those for the three frequency ranges. Vowel duration may be as important as spectral characteristics of vowels when breathiness is judged from samples of connected discourse.


2020 ◽  
Vol 63 (4) ◽  
pp. 1002-1017
Author(s):  
Kevin J. Reilly

Purpose This study investigated vowel and sibilant productions in noise to determine whether responses to noise (a) are sensitive to the spectral characteristics of the noise signal and (b) are modulated by the contribution of vowel or sibilant contrasts to word discrimination. Method Vowel and sibilant productions were elicited during serial recall of three-word sequences that were produced in quiet or during exposure to speaker-specific noise signals. These signals either masked a speaker's productions of the sibilants /s/ and /ʃ/ or their productions of the vowels /a/ and /æ/. The contribution of the vowel and sibilant contrasts to word discrimination in a sequence was manipulated by varying the number of times that the target sibilant and vowel pairs occurred in the same word position in each sequence. Results Spectral noise effects were observed for both sibilants and vowels: Responses to noise were larger and/or involved to more acoustic features when the noise signal masked the acoustic characteristics of that phoneme class. Word discrimination effects were limited and consisted of only small increases in vowel duration. Interaction effects between noise and similarity indicated that the phonological similarity of sequences containing both sibilants and/or both vowels influenced articulation in ways not related to speech clarity. Conclusion The findings of this study indicate that sensorimotor control of speech exhibits some sensitivity to noise spectral characteristics. However, productions of sibilants and vowels were not sensitive to their importance in discriminating the words in a sequence. In addition, phonological similarity effects were observed that likely reflected processing demands related to the recall and sequencing of high-similarity words.


2013 ◽  
Vol 56 (5) ◽  
pp. 1485-1502 ◽  
Author(s):  
Kris Tjaden ◽  
Jennifer Lam ◽  
Greg Wilding

Purpose The impact of clear speech, increased vocal intensity, and rate reduction on acoustic characteristics of vowels was compared in speakers with Parkinson's disease (PD), speakers with multiple sclerosis (MS), and healthy controls. Method Speakers read sentences in habitual, clear, loud, and slow conditions. Variations in clarity, intensity, and rate were stimulated using magnitude production. Formant frequency values for peripheral and nonperipheral vowels were obtained at 20%, 50%, and 80% of vowel duration to derive static and dynamic acoustic measures. Intensity and duration measures were obtained. Results Rate was maximally reduced in the slow condition, and vocal intensity was maximized in the loud condition. The clear condition also yielded a reduced articulatory rate and increased intensity, although less than for the slow or loud conditions. Overall, the clear condition had the most consistent impact on vowel spectral characteristics. Spectral and temporal distinctiveness for peripheral–nonperipheral vowel pairs was largely similar across conditions. Conclusions Clear speech maximized peripheral and nonperipheral vowel space areas for speakers with PD and MS while also reducing rate and increasing vocal intensity. These results suggest that a speech style focused on increasing articulatory amplitude yields the most robust changes in vowel segmental articulation.


2019 ◽  
Vol 16 (1) ◽  
pp. 15-28
Author(s):  
Biljana Čubrović

This vowel study looks at the intricate relationship between spectral  characteristics and vowel duration in the context of American English vowels, both from a native speaker (NS) and non-native speaker (NNS) perspective. The non-native speaker cohort is  homogeneous in the sense that all speakers have Serbian as their mother tongue, but have been long-time residents of the US. The phonetic context investigated in this study is /bVt/, where V is one of the American English monophthongs /i ɪ u ʊ ε æ ʌ ɔ ɑ/. The results of the acoustic analysis show that the NNS vowels are generally longer than the NS vowels. Furthermore, NNSs neutralise the vowel quality of two tense and lax pairs of vowels, /i ɪ/ and /u ʊ/, and rely more heavily on the phonetic duration when prononuncing them.


1998 ◽  
Vol 41 (5) ◽  
pp. 976-989 ◽  
Author(s):  
Kris Tjaden ◽  
Gary Weismer

This study examined speaking-rate-induced spectral and temporal variability of F2 formant trajectories for target words produced in a carrier phrase at speaking rates ranging from fast to slow. F2 onset frequency measured at the first glottal pulse following the stop consonant release in target words was used to quantify the extent to which adjacent consonantal and vocalic gestures overlapped; F2 target frequency was operationally defined as the first occurrence of a frequency minimum or maximum following F2 onset frequency. Regression analyses indicated 70% of functions relating F2 onset and vowel duration were statistically significant. The strength of the effect was variable, however, and the direction of significant functions often differed from that predicted by a simple model of overlapping, sliding gestures. Results of a partial correlation analysis examining interrelationships among F2 onset, F2 target frequency, and vowel duration across the speaking rate range indicated that covariation of F2 target with vowel duration may obscure the relationship between F2 onset and vowel duration across rate. The results further suggested that a sliding based model of acoustic variability associated with speaking rate change only partially accounts for the present data, and that such a view accounts for some speakers' data better than others.


2000 ◽  
Vol 107 (5) ◽  
pp. 2918-2918
Author(s):  
Kathleen M. Measer ◽  
James R. Sawusch
Keyword(s):  

2020 ◽  
Vol 63 (1) ◽  
pp. 59-73 ◽  
Author(s):  
Panying Rong

Purpose The purpose of this article was to validate a novel acoustic analysis of oral diadochokinesis (DDK) in assessing bulbar motor involvement in amyotrophic lateral sclerosis (ALS). Method An automated acoustic DDK analysis was developed, which filtered out the voice features and extracted the envelope of the acoustic waveform reflecting the temporal pattern of syllable repetitions during an oral DDK task (i.e., repetitions of /tɑ/ at the maximum rate on 1 breath). Cycle-to-cycle temporal variability (cTV) of envelope fluctuations and syllable repetition rate (sylRate) were derived from the envelope and validated against 2 kinematic measures, which are tongue movement jitter (movJitter) and alternating tongue movement rate (AMR) during the DDK task, in 16 individuals with bulbar ALS and 18 healthy controls. After the validation, cTV, sylRate, movJitter, and AMR, along with an established clinical speech measure, that is, speaking rate (SR), were compared in their ability to (a) differentiate individuals with ALS from healthy controls and (b) detect early-stage bulbar declines in ALS. Results cTV and sylRate were significantly correlated with movJitter and AMR, respectively, across individuals with ALS and healthy controls, confirming the validity of the acoustic DDK analysis in extracting the temporal DDK pattern. Among all the acoustic and kinematic DDK measures, cTV showed the highest diagnostic accuracy (i.e., 0.87) with 80% sensitivity and 94% specificity in differentiating individuals with ALS from healthy controls, which outperformed the SR measure. Moreover, cTV showed a large increase during the early disease stage, which preceded the decline of SR. Conclusions This study provided preliminary validation of a novel automated acoustic DDK analysis in extracting a useful measure, namely, cTV, for early detection of bulbar ALS. This analysis overcame a major barrier in the existing acoustic DDK analysis, which is continuous voicing between syllables that interferes with syllable structures. This approach has potential clinical applications as a novel bulbar assessment.


1995 ◽  
Vol 38 (5) ◽  
pp. 1014-1024 ◽  
Author(s):  
Robert L. Whitehead ◽  
Nicholas Schiavetti ◽  
Brenda H. Whitehead ◽  
Dale Evan Metz

The purpose of this investigation was twofold: (a) to determine if there are changes in specific temporal characteristics of speech that occur during simultaneous communication, and (b) to determine if known temporal rules of spoken English are disrupted during simultaneous communication. Ten speakers uttered sentences consisting of a carrier phrase and experimental CVC words under conditions of: (a) speech, (b) speech combined with signed English, and (c) speech combined with signed English for every word except the CVC word that was fingerspelled. The temporal features investigated included: (a) sentence duration, (b) experimental CVC word duration, (c) vowel duration in experimental CVC words, (d) pause duration before and after experimental CVC words, and (e) consonantal effects on vowel duration. Results indicated that for all durational measures, the speech/sign/fingerspelling condition was longest, followed by the speech/sign condition, with the speech condition being shortest. It was also found that for all three speaking conditions, vowels were longer in duration when preceding voiced consonants than vowels preceding their voiceless cognates, and that a low vowel was longer in duration than a high vowel. These findings indicate that speakers consistently reduced their rate of speech when using simultaneous communication, but did not violate these specific temporal rules of English important for consonant and vowel perception.


Sign in / Sign up

Export Citation Format

Share Document