Adult perception of stop consonant voicing in American-English-learning toddlers: Voice onset time and secondary cues

Native speakers of Spanish with different amounts of experience with English classified stop-consonant voicing (/b/ versus /p/) across different speech accents: English-accented Spanish, native Spanish, and native English. While listeners with little experience with English classified target voicing with an English- or Spanish-like voice onset time (VOT) boundary, predicted by contextual VOT, listeners familiar with English relied on an English-like VOT boundary in an English-accented Spanish context even in the absence of clear contextual cues to English VOT. This indicates that Spanish listeners accommodated English-accented Spanish voicing differently depending on their degree of familiarization with the English norm.

Download Full-text

Effects of voice‐onset time duration, and number of adaptor repetitions on the scaling of stop consonant voicing

The Journal of the Acoustical Society of America ◽

10.1121/1.2004088 ◽

1978 ◽

Vol 64 (S1) ◽

pp. S19-S20

Author(s):

Ralph N. Ohde ◽

Donald J. Sharf

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Time Duration ◽

Consonant Voicing

Download Full-text

Phonetic feature size in second language acquisition: Examining VOT in voiceless and voiced stops

Second language Research ◽

10.1177/02676583211008951 ◽

2021 ◽

pp. 026765832110089

Author(s):

Daniel J Olson

Keyword(s):

Second Language ◽

Second Language Acquisition ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Acoustic Similarity ◽

Stop Consonants ◽

Voiceless Stop ◽

English Speaking ◽

Underlying Mechanisms

Featural approaches to second language phonetic acquisition posit that the development of new phonetic norms relies on sub-phonemic features, expressed through a constellation of articulatory gestures and their corresponding acoustic cues, which may be shared across multiple phonemes. Within featural approaches, largely supported by research in speech perception, debate remains as to the fundamental scope or ‘size’ of featural units. The current study examines potential featural relationships between voiceless and voiced stop consonants, as expressed through the voice onset time cue. Native English-speaking learners of Spanish received targeted training on Spanish voiceless stop consonant production through a visual feedback paradigm. Analysis focused on the change in voice onset time, for both voiceless (i.e. trained) and voiced (i.e. non-trained) phonemes, across the pretest, posttest, and delayed posttest. The results demonstrated a significant improvement (i.e. reduction) in voice onset time for voiceless stops, which were subject to the training paradigm. In contrast, there was no significant change in the non-trained voiced stop consonants. These results suggest a limited featural relationship, with independent voice onset time (VOT) cues for voiceless and voices phonemes. Possible underlying mechanisms that limit feature generalization in second language (L2) phonetic production, including gestural considerations and acoustic similarity, are discussed.

Download Full-text

Temporal Encoding of the Voice Onset Time Phonetic Parameter by Field Potentials Recorded Directly From Human Auditory Cortex

Journal of Neurophysiology ◽

10.1152/jn.1999.82.5.2346 ◽

1999 ◽

Vol 82 (5) ◽

pp. 2346-2357 ◽

Cited By ~ 120

Author(s):

Mitchell Steinschneider ◽

Igor O. Volkov ◽

M. Daniel Noh ◽

P. Charles Garell ◽

Matthew A. Howard

Keyword(s):

Auditory Cortex ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Superior Temporal Gyrus ◽

Response Patterns ◽

Stop Consonants ◽

Field Potentials ◽

Heschl’S Gyrus ◽

Heschl's Gyrus

Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20–40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant-vowel (CV) syllables were recorded directly from Heschl's gyrus, the planum temporale, and the superior temporal gyrus in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyrus. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a 0-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20–25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyrus lateral to Heschl's gyrus displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.

Download Full-text

Stop consonant productions of Korean–English bilingual children

Bilingualism Language and Cognition ◽

10.1017/s1366728911000083 ◽

2011 ◽

Vol 15 (2) ◽

pp. 275-287 ◽

Cited By ~ 17

Author(s):

SUE ANN S. LEE ◽

GREGORY K. IVERSON

Keyword(s):

Fundamental Frequency ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Developmental Period ◽

English Speakers ◽

Bilingual Children ◽

Stop Consonants ◽

Speech Sounds ◽

English Bilingual

The purpose of this study was to conduct an acoustic examination of the obstruent stops produced by Korean–English bilingual children in connection with the question of whether bilinguals establish distinct categories of speech sounds across languages. Stop productions were obtained from ninety children in two age ranges, five and ten years: thirty Korean–English bilinguals, thirty monolingual Koreans and thirty monolingual English speakers. Voice-Onset-Time (VOT) lag at word-initial stop and fundamental frequency (f0) in the following vowel (hereafter vowel-onset f0) were measured. The bilingual children showed different patterns of VOT in comparison to both English and Korean monolinguals, with longer VOT in their production of Korean stop consonants and shorter VOT for English. Moreover, the ten-year-old bilinguals distinguished all stop categories using both VOT and vowel-onset f0,whereas the five-year-olds tended to make stop distinctions based on VOT but not vowel-onset f0. The results of this study suggest that bilingual children at around five years of age do not yet have fully separate stop systems, and that the systems continue to evolve during the developmental period.

Download Full-text

Patterns of acquisition of native voice onset time in English-learning children

The Journal of the Acoustical Society of America ◽

10.1121/1.2945118 ◽

2008 ◽

Vol 124 (2) ◽

pp. 1180-1191 ◽

Cited By ~ 32

Author(s):

Joanna H. Lowenstein ◽

Susan Nittrouer

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

English Learning ◽

Native Voice

Download Full-text

Voice Onset Time in Individuals With Hyperfunctional Voice Disorders: Evidence for Disordered Vocal Motor Control

Journal of Speech Language and Hearing Research ◽

10.1044/2019_jslhr-19-00135 ◽

2020 ◽

Vol 63 (2) ◽

pp. 405-420 ◽

Cited By ~ 1

Author(s):

Victoria S. McKenna ◽

Jennifer A. Hylkema ◽

Monique C. Tardif ◽

Cara E. Stepp

Keyword(s):

Motor Control ◽

Coefficient Of Variation ◽

Voice Onset Time ◽

American English ◽

Onset Time ◽

Adult Women ◽

Vocal Hyperfunction ◽

The Mean ◽

Future Work ◽

Age And Sex

Purpose This study examined vocal hyperfunction (VH) using voice onset time (VOT). We hypothesized that speakers with VH would produce shorter VOTs, indicating increased laryngeal tension, and more variable VOTs, indicating disordered vocal motor control. Method We enrolled 32 adult women with VH (aged 20–74 years) and 32 age- and sex-matched controls. All were speakers of American English. Participants produced vowel–consonant–vowel combinations that varied by vowel (ɑ/u) and plosive (p/b, t/d, k/g). VOT—measured at the release of the plosive to the initiation of voicing—was averaged over three repetitions of each vowel–consonant–vowel combination. The coefficient of variation (CoV), a measure of VOT variability, was also computed for each combination. Results The mean VOTs were not significantly different between the two groups; however, the CoVs were significantly greater in speakers with VH compared to controls. Voiceless CoV values were moderately correlated with clinical ratings of dysphonia ( r = .58) in speakers with VH. Conclusion Speakers with VH exhibited greater variability in phonemic voicing targets compared to vocally healthy speakers, supporting the hypothesis for disordered vocal motor control in VH. We suggest future work incorporate VOT measures when assessing auditory discrimination and auditory–motor integration deficits in VH.

Download Full-text

Acoustic Characteristics of Stop Consonants in Children with Fetal Alcohol Syndrome

Perspectives on Speech Science and Orofacial Disorders ◽

10.1044/ssod25.1.29 ◽

2015 ◽

Vol 25 (1) ◽

pp. 29-34 ◽

Cited By ~ 1

Author(s):

Christopher Bolinger ◽

James Dembowski

Keyword(s):

Motor Control ◽

Fetal Alcohol Syndrome ◽

Voice Onset Time ◽

Prenatal Alcohol Exposure ◽

Onset Time ◽

Stop Consonant ◽

Alcohol Exposure ◽

Control Group ◽

Fetal Alcohol ◽

Oral Motor

Speech of children with fetal alcohol syndrome (FAS) has been little studied compared to language. Becker, Warr-Leeper, and Leeper (1990), found a relationship between prenatal alcohol exposure, oral motor control, and speech articulation. Behavioral tests suggest deficits in focal oral motor control specific to children with FAS (Bolinger & Dembowski, 2010). The current project extends that investigation through acoustic measures. Peak and mean frequencies of stop consonant releases were used to infer control of place of articulation. Voice onset time (VOT) was used to infer articulatory-laryngeal coordination. Preliminary measures on 3 experimental speakers and 2 matched neurotypical controls suggest higher stop consonant frequencies in the experimental group, with a poorer distinction between alveolar and velar stops than in the control group. Voiced VOT values were significantly longer for FAS children than for controls. Mean voiceless VOTs were similar across groups, but substantially more variable for the FAS children. Values may be interpreted as acoustic evidence for specific speech motor control deficits in FAS children relative to matched neurotypical children.

Download Full-text

Psychophysical Boundary for Categorization of Voiced–Voiceless Stop Consonants in Native Japanese Speakers

Journal of Speech Language and Hearing Research ◽

10.1044/2017_jslhr-h-17-0131 ◽

2018 ◽

Vol 61 (3) ◽

pp. 789-796 ◽

Cited By ~ 2

Author(s):

Shunsuke Tamura ◽

Kazuhito Ito ◽

Nobuyuki Hirose ◽

Shuji Mori

Keyword(s):

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Stop Consonants ◽

Noise Detection ◽

Native Japanese Speakers ◽

Simultaneity Judgment ◽

Japanese Speakers ◽

Voiceless Stop ◽

Speech Identification

Purpose The purpose of this study was to investigate the psychophysical boundary used for categorization of voiced–voiceless stop consonants in native Japanese speakers. Method Twelve native Japanese speakers participated in the experiment. The stimuli were synthetic stop consonant–vowel stimuli varying in voice onset time (VOT) with manipulation of the amplitude of the initial noise portion and the first formant (F1) frequency of the periodic portion. There were 3 tasks, namely, speech identification to either /d/ or /t/, detection of the noise portion, and simultaneity judgment of onsets of the noise and periodic portions. Results The VOT boundaries of /d/–/t/ were close to the shortest VOT values that allowed for detection of the noise portion but not to those for perceived nonsimultaneity of the noise and periodic portions. The slopes of noise detection functions along VOT were as sharp as those of voiced–voiceless identification functions. In addition, the effects of manipulating the amplitude of the noise portion and the F1 frequency of the periodic portion on the detection of the noise portion were similar to those on voiced–voiceless identification. Conclusion The psychophysical boundary of perception of the initial noise portion masked by the following periodic portion may be used for voiced–voiceless categorization by Japanese speakers.

Download Full-text

The Effects of Fundamental Frequency Level on Voice Onset Time in Normal Adult Male Speakers

Journal of Speech Language and Hearing Research ◽

10.1044/1092-4388(2005/069) ◽

2005 ◽

Vol 48 (5) ◽

pp. 1013-1024 ◽

Cited By ~ 20

Author(s):

Christopher R. McCrea ◽

Richard J. Morris

Keyword(s):

Fundamental Frequency ◽

Adult Male ◽

Voice Onset Time ◽

Onset Time ◽

Stop Consonant ◽

Initial Position ◽

Normal Adult ◽

Frequency Level ◽

Main Effect ◽

Voiceless Stop

The purpose of this study was to examine the effect of fundamental frequency (F 0 ) on stop consonant voice onset time (VOT). VOT was measured from the recordings of 56 young men reading phrases containing all 6 English voiced and voiceless stops in word-initial position across high-, medium-, and low-F 0 levels. Separate analyses of variance for the voiced and voiceless stops revealed no significant main effect for F 0 for the voiced stops but a significant F 0 effect for the voiceless stops. Across the voiceless stops, productions at high F 0 s displayed significantly shorter VOTs than productions at low or mid F 0 s. The findings indicated that researchers must take into account the F 0 level at which voiceless stop VOT is measured.

Download Full-text