Productive Voice Onset Time Characteristics of Esophageal Speech

John M. Christensen; Bernd Weinberg; Peter J. Alfonso

doi:10.1044/jshr.2101.56

Productive Voice Onset Time Characteristics of Esophageal Speech

Journal of Speech and Hearing Research ◽

10.1044/jshr.2101.56 ◽

1978 ◽

Vol 21 (1) ◽

pp. 56-62 ◽

Cited By ~ 15

Author(s):

John M. Christensen ◽

Bernd Weinberg ◽

Peter J. Alfonso

Keyword(s):

Voice Onset Time ◽

Speech Sound ◽

Onset Time ◽

Stop Consonant ◽

Current Understanding ◽

Place Of Articulation ◽

Esophageal Speech ◽

Phonological Contrasts ◽

The Voice ◽

Group Comparisons

The voice onset times (VOT) of a large number of stop-consonant initiated syllables produced by esophageal and normal speakers were measured. Esophageal speakers systematically varied VOT during the production of speech-sound categories with the same manner of production. Average voice onset times associated with the production of prevocalic voiceless stops of esophageal speakers were significantly shorter than those of normal speakers, while talker-group comparisons associated with the production of voiced prevocalic stops were nonsignificant. Voice onset times of both esophageal and normal speakers were differentially sensitive to place of articulation. Findings are discussed in terms of furthering current understanding of how effectively esophageal speakers achieve important phonological contrasts.

Download Full-text

Voice Onset Time, Frication, and Aspiration in Word-Initial Consonant Clusters

Journal of Speech and Hearing Research ◽

10.1044/jshr.1804.686 ◽

1975 ◽

Vol 18 (4) ◽

pp. 686-706 ◽

Cited By ~ 261

Author(s):

Dennis H. Klatt

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

Consonant Clusters ◽

Perceptual Cues ◽

Sentence Frame ◽

Place Of Articulation ◽

Connected Discourse ◽

Production Strategies ◽

High Vowels ◽

The Voice

The voice onset time (VOT) and the duration of the burst of frication noise at the release of a plosive consonant were measured from spectrograms of word-initial consonant clusters. Mean data from three speakers reading English words in a sentence frame indicated that the VOT changed as a function of the place of articulation of the plosive and as a function of the identity of the following vowel or sonorant consonant. Burst durations varied in a similar way such that the remaining interval of aspiration in /p, t, k/ was nearly the same duration in comparable phonetic environments. The VOT was longer before sonorants and high vowels than before mid- and low vowels. Aspiration was also seen in an /s/-sonorant cluster. To explain these regularities, production strategies and perceptual cues to a voicing decision for English plosives are considered. Variations in VOT are explained in terms of articulatory mechanisms, perceptual constraints, and phonological rules. Some VOT data obtained from a connected discourse were also analyzed and organized into a set of rules for predicting voice onset time in any sentence context.

Download Full-text

Effects of Place of Articulation and Aspiration on Voice Onset Time in Mandarin Esophageal Speech

Folia Phoniatrica et Logopaedica ◽

10.1159/000101773 ◽

2007 ◽

Vol 59 (3) ◽

pp. 147-154 ◽

Cited By ~ 13

Author(s):

Hanjun Liu ◽

Manwa L. Ng ◽

Mingxi Wan ◽

Supin Wang ◽

Yi Zhang

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

Place Of Articulation ◽

Esophageal Speech

Download Full-text

Phonetic feature size in second language acquisition: Examining VOT in voiceless and voiced stops

Second language Research ◽

10.1177/02676583211008951 ◽

2021 ◽

pp. 026765832110089

Author(s):

Daniel J Olson

Keyword(s):

Second Language ◽

Second Language Acquisition ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Acoustic Similarity ◽

Stop Consonants ◽

Voiceless Stop ◽

English Speaking ◽

Underlying Mechanisms

Featural approaches to second language phonetic acquisition posit that the development of new phonetic norms relies on sub-phonemic features, expressed through a constellation of articulatory gestures and their corresponding acoustic cues, which may be shared across multiple phonemes. Within featural approaches, largely supported by research in speech perception, debate remains as to the fundamental scope or ‘size’ of featural units. The current study examines potential featural relationships between voiceless and voiced stop consonants, as expressed through the voice onset time cue. Native English-speaking learners of Spanish received targeted training on Spanish voiceless stop consonant production through a visual feedback paradigm. Analysis focused on the change in voice onset time, for both voiceless (i.e. trained) and voiced (i.e. non-trained) phonemes, across the pretest, posttest, and delayed posttest. The results demonstrated a significant improvement (i.e. reduction) in voice onset time for voiceless stops, which were subject to the training paradigm. In contrast, there was no significant change in the non-trained voiced stop consonants. These results suggest a limited featural relationship, with independent voice onset time (VOT) cues for voiceless and voices phonemes. Possible underlying mechanisms that limit feature generalization in second language (L2) phonetic production, including gestural considerations and acoustic similarity, are discussed.

Download Full-text

Temporal Encoding of the Voice Onset Time Phonetic Parameter by Field Potentials Recorded Directly From Human Auditory Cortex

Journal of Neurophysiology ◽

10.1152/jn.1999.82.5.2346 ◽

1999 ◽

Vol 82 (5) ◽

pp. 2346-2357 ◽

Cited By ~ 120

Author(s):

Mitchell Steinschneider ◽

Igor O. Volkov ◽

M. Daniel Noh ◽

P. Charles Garell ◽

Matthew A. Howard

Keyword(s):

Auditory Cortex ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Superior Temporal Gyrus ◽

Response Patterns ◽

Stop Consonants ◽

Field Potentials ◽

Heschl’S Gyrus ◽

Heschl's Gyrus

Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20–40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant-vowel (CV) syllables were recorded directly from Heschl's gyrus, the planum temporale, and the superior temporal gyrus in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyrus. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a 0-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20–25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyrus lateral to Heschl's gyrus displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.

Download Full-text

Categorical and Noncategorical Modes of Speech Perception along the Voice Onset Time Continuum

The Journal of the Acoustical Society of America ◽

10.1121/1.1982642 ◽

1973 ◽

Vol 53 (1) ◽

pp. 369-369

Author(s):

Joan House Lazarus ◽

D. B. Pisoni

Keyword(s):

Speech Perception ◽

Voice Onset Time ◽

Onset Time ◽

The Voice ◽

Time Continuum

Download Full-text

The perceptual acquisition of Thai phonology by English speakers: task and stimulus effects

Second language Research ◽

10.1191/0267658303sr220oa ◽

2003 ◽

Vol 19 (3) ◽

pp. 209-223 ◽

Cited By ~ 13

Author(s):

Joe Pater

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

Acoustic Properties ◽

Experimental Conditions ◽

Lexical Representations ◽

Place Of Articulation ◽

Task Effects ◽

Stimulus Effects ◽

Parallel Effect ◽

Native Speakers Of English

This article presents a follow-up to Curtin et al.’s study of the perceptual acquisition of Thai laryngeal contrasts by native speakers of English, which found that subjects performed better on contrasts in voice than aspiration. This finding - surprising in light of earlier cross-linguistic voice onset time (VOT) research - was attributed to the fact that the task tapped lexical representations, which are unspecified for aspiration according to standard assumptions in generative phonology. The present study further investigated possible task effects by examining the discrimination and categorization of the same stimuli in various experimental conditions. Stimulus effects were also investigated by performing token-based analyses of the results, and by comparing them to acoustic properties of the tokens. The outcome of the discrimination experiment was the opposite of the earlier study, with significantly better performance on contrasts in aspiration than voice, even on a lexical task. A second finding of this experiment is that place of articulation interacts with the perception of the laryngeal distinctions; the aspiration distinction is discriminated better on the labials, and voice on alveolars. A parallel effect of place of articulation was also found in a categorization experiment.

Download Full-text

Phonetic structures of Turkish Kabardian

Journal of the International Phonetic Association ◽

10.1017/s0025100306002532 ◽

2006 ◽

Vol 36 (2) ◽

pp. 159-186 ◽

Cited By ~ 13

Author(s):

Matthew Gordon ◽

Ayla Applebaum

Keyword(s):

Voice Onset Time ◽

Spectral Characteristics ◽

Onset Time ◽

Aerodynamic Characteristics ◽

Rare Type ◽

Place Of Articulation ◽

Short Vowels ◽

Closure Duration ◽

Phonetic Study ◽

Phonetic Realization

This paper reports results of a quantitative phonetic study of Kabardian, a Northwest Caucasian language that is of typological interest from a phonetic standpoint. A number of cross-linguistically rare properties are examined. These features include the phonetic realization of Kabardian's small vowel inventory, which contains only three contrastive vowel qualities (two short vowels and one long vowel), spectral characteristics of the ten supralaryngeal voiceless fricatives of Kabardian, as well as the acoustic, palatographic, and aerodynamic characteristics of ejective fricatives, an extremely rare type of segment cross-linguistically. In addition, basic properties of the consonant stop series are explored, including closure duration and voice onset time, in order to test postulated universals linking these properties to place of articulation and laryngeal setting.

Download Full-text

Stop consonant productions of Korean–English bilingual children

Bilingualism Language and Cognition ◽

10.1017/s1366728911000083 ◽

2011 ◽

Vol 15 (2) ◽

pp. 275-287 ◽

Cited By ~ 17

Author(s):

SUE ANN S. LEE ◽

GREGORY K. IVERSON

Keyword(s):

Fundamental Frequency ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Developmental Period ◽

English Speakers ◽

Bilingual Children ◽

Stop Consonants ◽

Speech Sounds ◽

English Bilingual

The purpose of this study was to conduct an acoustic examination of the obstruent stops produced by Korean–English bilingual children in connection with the question of whether bilinguals establish distinct categories of speech sounds across languages. Stop productions were obtained from ninety children in two age ranges, five and ten years: thirty Korean–English bilinguals, thirty monolingual Koreans and thirty monolingual English speakers. Voice-Onset-Time (VOT) lag at word-initial stop and fundamental frequency (f0) in the following vowel (hereafter vowel-onset f0) were measured. The bilingual children showed different patterns of VOT in comparison to both English and Korean monolinguals, with longer VOT in their production of Korean stop consonants and shorter VOT for English. Moreover, the ten-year-old bilinguals distinguished all stop categories using both VOT and vowel-onset f0,whereas the five-year-olds tended to make stop distinctions based on VOT but not vowel-onset f0. The results of this study suggest that bilingual children at around five years of age do not yet have fully separate stop systems, and that the systems continue to evolve during the developmental period.

Download Full-text

Variation in Voice Onset Time for Korean Stops

Korean Linguistics ◽

10.1075/kl.13.01djs ◽

2006 ◽

Vol 13 ◽

pp. 1-16 ◽

Cited By ~ 16

Author(s):

David J. Silva

Keyword(s):

United States ◽

Native Speakers ◽

Voice Onset Time ◽

Onset Time ◽

The United States ◽

Age Group ◽

Acoustic Data ◽

Diachronic Change ◽

Phonemic Level ◽

The Voice

Abstract. Acoustic data elicited from 34 native speakers of Korean living in the United States pro-vide evidence for diachronic change in the voice onset time (VOT) of phrase-initial aspirated and lax stop phonemes. While older speakers produce aspirated and lax stops with clearly differentiated average VOT values, many younger speakers appear to have neutralized this difference, producing VOTs for aspirated stops that are substantially shorter than those of older speakers, and comparable to those for corresponding lax stops. The data further indicate that, within each age group, older speakers manifest sex-based differences in VOT while younger speakers do not. Despite this appar-ent shift in VOT values, the acoustic evidence suggests that all speakers in this study, regardless of age, continue to mark underlying differences between aspirated and lax stops in terms of stop closure and the fundamental frequency of the following vowel. It is concluded that the data point to a recent phonetic shift in the language, whereby VOT no longer serves as the primary cue to differentiate between lax and aspirated stops. There is not, however, evidence of any reorganization of the lan-guage as the phonemic level: the language's underlying lax ~ aspirated ~ tense contrasts endure.

Download Full-text

Perceptual interactions among voice onset time, second formant onset, voice, and place of articulation

The Journal of the Acoustical Society of America ◽

10.1121/1.420479 ◽

1997 ◽

Vol 102 (5) ◽

pp. 3095-3095

Author(s):

José R. Benkí

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

Place Of Articulation ◽

Second Formant

Download Full-text