Productive Voice Onset Time Characteristics of Esophageal Speech

1978 ◽  
Vol 21 (1) ◽  
pp. 56-62 ◽  
Author(s):  
John M. Christensen ◽  
Bernd Weinberg ◽  
Peter J. Alfonso

The voice onset times (VOT) of a large number of stop-consonant initiated syllables produced by esophageal and normal speakers were measured. Esophageal speakers systematically varied VOT during the production of speech-sound categories with the same manner of production. Average voice onset times associated with the production of prevocalic voiceless stops of esophageal speakers were significantly shorter than those of normal speakers, while talker-group comparisons associated with the production of voiced prevocalic stops were nonsignificant. Voice onset times of both esophageal and normal speakers were differentially sensitive to place of articulation. Findings are discussed in terms of furthering current understanding of how effectively esophageal speakers achieve important phonological contrasts.

1975 ◽  
Vol 18 (4) ◽  
pp. 686-706 ◽  
Author(s):  
Dennis H. Klatt

The voice onset time (VOT) and the duration of the burst of frication noise at the release of a plosive consonant were measured from spectrograms of word-initial consonant clusters. Mean data from three speakers reading English words in a sentence frame indicated that the VOT changed as a function of the place of articulation of the plosive and as a function of the identity of the following vowel or sonorant consonant. Burst durations varied in a similar way such that the remaining interval of aspiration in /p, t, k/ was nearly the same duration in comparable phonetic environments. The VOT was longer before sonorants and high vowels than before mid- and low vowels. Aspiration was also seen in an /s/-sonorant cluster. To explain these regularities, production strategies and perceptual cues to a voicing decision for English plosives are considered. Variations in VOT are explained in terms of articulatory mechanisms, perceptual constraints, and phonological rules. Some VOT data obtained from a connected discourse were also analyzed and organized into a set of rules for predicting voice onset time in any sentence context.


2007 ◽  
Vol 59 (3) ◽  
pp. 147-154 ◽  
Author(s):  
Hanjun Liu ◽  
Manwa L. Ng ◽  
Mingxi Wan ◽  
Supin Wang ◽  
Yi Zhang

2021 ◽  
pp. 026765832110089
Author(s):  
Daniel J Olson

Featural approaches to second language phonetic acquisition posit that the development of new phonetic norms relies on sub-phonemic features, expressed through a constellation of articulatory gestures and their corresponding acoustic cues, which may be shared across multiple phonemes. Within featural approaches, largely supported by research in speech perception, debate remains as to the fundamental scope or ‘size’ of featural units. The current study examines potential featural relationships between voiceless and voiced stop consonants, as expressed through the voice onset time cue. Native English-speaking learners of Spanish received targeted training on Spanish voiceless stop consonant production through a visual feedback paradigm. Analysis focused on the change in voice onset time, for both voiceless (i.e. trained) and voiced (i.e. non-trained) phonemes, across the pretest, posttest, and delayed posttest. The results demonstrated a significant improvement (i.e. reduction) in voice onset time for voiceless stops, which were subject to the training paradigm. In contrast, there was no significant change in the non-trained voiced stop consonants. These results suggest a limited featural relationship, with independent voice onset time (VOT) cues for voiceless and voices phonemes. Possible underlying mechanisms that limit feature generalization in second language (L2) phonetic production, including gestural considerations and acoustic similarity, are discussed.


1999 ◽  
Vol 82 (5) ◽  
pp. 2346-2357 ◽  
Author(s):  
Mitchell Steinschneider ◽  
Igor O. Volkov ◽  
M. Daniel Noh ◽  
P. Charles Garell ◽  
Matthew A. Howard

Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20–40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant-vowel (CV) syllables were recorded directly from Heschl's gyrus, the planum temporale, and the superior temporal gyrus in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyrus. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a 0-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20–25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyrus lateral to Heschl's gyrus displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.


2003 ◽  
Vol 19 (3) ◽  
pp. 209-223 ◽  
Author(s):  
Joe Pater

This article presents a follow-up to Curtin et al.’s study of the perceptual acquisition of Thai laryngeal contrasts by native speakers of English, which found that subjects performed better on contrasts in voice than aspiration. This finding - surprising in light of earlier cross-linguistic voice onset time (VOT) research - was attributed to the fact that the task tapped lexical representations, which are unspecified for aspiration according to standard assumptions in generative phonology. The present study further investigated possible task effects by examining the discrimination and categorization of the same stimuli in various experimental conditions. Stimulus effects were also investigated by performing token-based analyses of the results, and by comparing them to acoustic properties of the tokens. The outcome of the discrimination experiment was the opposite of the earlier study, with significantly better performance on contrasts in aspiration than voice, even on a lexical task. A second finding of this experiment is that place of articulation interacts with the perception of the laryngeal distinctions; the aspiration distinction is discriminated better on the labials, and voice on alveolars. A parallel effect of place of articulation was also found in a categorization experiment.


2006 ◽  
Vol 36 (2) ◽  
pp. 159-186 ◽  
Author(s):  
Matthew Gordon ◽  
Ayla Applebaum

This paper reports results of a quantitative phonetic study of Kabardian, a Northwest Caucasian language that is of typological interest from a phonetic standpoint. A number of cross-linguistically rare properties are examined. These features include the phonetic realization of Kabardian's small vowel inventory, which contains only three contrastive vowel qualities (two short vowels and one long vowel), spectral characteristics of the ten supralaryngeal voiceless fricatives of Kabardian, as well as the acoustic, palatographic, and aerodynamic characteristics of ejective fricatives, an extremely rare type of segment cross-linguistically. In addition, basic properties of the consonant stop series are explored, including closure duration and voice onset time, in order to test postulated universals linking these properties to place of articulation and laryngeal setting.


2011 ◽  
Vol 15 (2) ◽  
pp. 275-287 ◽  
Author(s):  
SUE ANN S. LEE ◽  
GREGORY K. IVERSON

The purpose of this study was to conduct an acoustic examination of the obstruent stops produced by Korean–English bilingual children in connection with the question of whether bilinguals establish distinct categories of speech sounds across languages. Stop productions were obtained from ninety children in two age ranges, five and ten years: thirty Korean–English bilinguals, thirty monolingual Koreans and thirty monolingual English speakers. Voice-Onset-Time (VOT) lag at word-initial stop and fundamental frequency (f0) in the following vowel (hereafter vowel-onset f0) were measured. The bilingual children showed different patterns of VOT in comparison to both English and Korean monolinguals, with longer VOT in their production of Korean stop consonants and shorter VOT for English. Moreover, the ten-year-old bilinguals distinguished all stop categories using both VOT and vowel-onset f0,whereas the five-year-olds tended to make stop distinctions based on VOT but not vowel-onset f0. The results of this study suggest that bilingual children at around five years of age do not yet have fully separate stop systems, and that the systems continue to evolve during the developmental period.


2006 ◽  
Vol 13 ◽  
pp. 1-16 ◽  
Author(s):  
David J. Silva

Abstract. Acoustic data elicited from 34 native speakers of Korean living in the United States pro-vide evidence for diachronic change in the voice onset time (VOT) of phrase-initial aspirated and lax stop phonemes. While older speakers produce aspirated and lax stops with clearly differentiated average VOT values, many younger speakers appear to have neutralized this difference, producing VOTs for aspirated stops that are substantially shorter than those of older speakers, and comparable to those for corresponding lax stops. The data further indicate that, within each age group, older speakers manifest sex-based differences in VOT while younger speakers do not. Despite this appar-ent shift in VOT values, the acoustic evidence suggests that all speakers in this study, regardless of age, continue to mark underlying differences between aspirated and lax stops in terms of stop closure and the fundamental frequency of the following vowel. It is concluded that the data point to a recent phonetic shift in the language, whereby VOT no longer serves as the primary cue to differentiate between lax and aspirated stops. There is not, however, evidence of any reorganization of the lan-guage as the phonemic level: the language's underlying lax ~ aspirated ~ tense contrasts endure.


Sign in / Sign up

Export Citation Format

Share Document