Phonetic feature size in second language acquisition: Examining VOT in voiceless and voiced stops

2021 ◽  
pp. 026765832110089
Author(s):  
Daniel J Olson

Featural approaches to second language phonetic acquisition posit that the development of new phonetic norms relies on sub-phonemic features, expressed through a constellation of articulatory gestures and their corresponding acoustic cues, which may be shared across multiple phonemes. Within featural approaches, largely supported by research in speech perception, debate remains as to the fundamental scope or ‘size’ of featural units. The current study examines potential featural relationships between voiceless and voiced stop consonants, as expressed through the voice onset time cue. Native English-speaking learners of Spanish received targeted training on Spanish voiceless stop consonant production through a visual feedback paradigm. Analysis focused on the change in voice onset time, for both voiceless (i.e. trained) and voiced (i.e. non-trained) phonemes, across the pretest, posttest, and delayed posttest. The results demonstrated a significant improvement (i.e. reduction) in voice onset time for voiceless stops, which were subject to the training paradigm. In contrast, there was no significant change in the non-trained voiced stop consonants. These results suggest a limited featural relationship, with independent voice onset time (VOT) cues for voiceless and voices phonemes. Possible underlying mechanisms that limit feature generalization in second language (L2) phonetic production, including gestural considerations and acoustic similarity, are discussed.

2018 ◽  
Vol 61 (3) ◽  
pp. 789-796 ◽  
Author(s):  
Shunsuke Tamura ◽  
Kazuhito Ito ◽  
Nobuyuki Hirose ◽  
Shuji Mori

Purpose The purpose of this study was to investigate the psychophysical boundary used for categorization of voiced–voiceless stop consonants in native Japanese speakers. Method Twelve native Japanese speakers participated in the experiment. The stimuli were synthetic stop consonant–vowel stimuli varying in voice onset time (VOT) with manipulation of the amplitude of the initial noise portion and the first formant (F1) frequency of the periodic portion. There were 3 tasks, namely, speech identification to either /d/ or /t/, detection of the noise portion, and simultaneity judgment of onsets of the noise and periodic portions. Results The VOT boundaries of /d/–/t/ were close to the shortest VOT values that allowed for detection of the noise portion but not to those for perceived nonsimultaneity of the noise and periodic portions. The slopes of noise detection functions along VOT were as sharp as those of voiced–voiceless identification functions. In addition, the effects of manipulating the amplitude of the noise portion and the F1 frequency of the periodic portion on the detection of the noise portion were similar to those on voiced–voiceless identification. Conclusion The psychophysical boundary of perception of the initial noise portion masked by the following periodic portion may be used for voiced–voiceless categorization by Japanese speakers.


Languages ◽  
2021 ◽  
Vol 6 (2) ◽  
pp. 61
Author(s):  
Lisa Kornder ◽  
Ineke Mennen

The purpose of this investigation was to trace first (L1) and second language (L2) segmental speech development in the Austrian German–English late bilingual Arnold Schwarzenegger over a period of 40 years, which makes it the first study to examine a bilingual’s speech development over several decades in both their languages. To this end, acoustic measurements of voice onset time (VOT) durations of word-initial plosives (Study 1) and formant frequencies of the first and second formant of Austrian German and English monophthongs (Study 2) were conducted using speech samples collected from broadcast interviews. The results of Study 1 showed a merging of Schwarzenegger’s German and English voiceless plosives in his late productions as manifested in a significant lengthening of VOT duration in his German plosives, and a shortening of VOT duration in his English plosives, closer to L1 production norms. Similar findings were evidenced in Study 2, revealing that some of Schwarzenegger’s L1 and L2 vowel categories had moved closer together in the course of L2 immersion. These findings suggest that both a bilingual’s first and second language accent is likely to develop and reorganize over time due to dynamic interactions between the first and second language system.


1999 ◽  
Vol 82 (5) ◽  
pp. 2346-2357 ◽  
Author(s):  
Mitchell Steinschneider ◽  
Igor O. Volkov ◽  
M. Daniel Noh ◽  
P. Charles Garell ◽  
Matthew A. Howard

Voice onset time (VOT) is an important parameter of speech that denotes the time interval between consonant onset and the onset of low-frequency periodicity generated by rhythmic vocal cord vibration. Voiced stop consonants (/b/, /g/, and /d/) in syllable initial position are characterized by short VOTs, whereas unvoiced stop consonants (/p/, /k/, and t/) contain prolonged VOTs. As the VOT is increased in incremental steps, perception rapidly changes from a voiced stop consonant to an unvoiced consonant at an interval of 20–40 ms. This abrupt change in consonant identification is an example of categorical speech perception and is a central feature of phonetic discrimination. This study tested the hypothesis that VOT is represented within auditory cortex by transient responses time-locked to consonant and voicing onset. Auditory evoked potentials (AEPs) elicited by stop consonant-vowel (CV) syllables were recorded directly from Heschl's gyrus, the planum temporale, and the superior temporal gyrus in three patients undergoing evaluation for surgical remediation of medically intractable epilepsy. Voiced CV syllables elicited a triphasic sequence of field potentials within Heschl's gyrus. AEPs evoked by unvoiced CV syllables contained additional response components time-locked to voicing onset. Syllables with a VOT of 40, 60, or 80 ms evoked components time-locked to consonant release and voicing onset. In contrast, the syllable with a VOT of 20 ms evoked a markedly diminished response to voicing onset and elicited an AEP very similar in morphology to that evoked by the syllable with a 0-ms VOT. Similar response features were observed in the AEPs evoked by click trains. In this case, there was a marked decrease in amplitude of the transient response to the second click in trains with interpulse intervals of 20–25 ms. Speech-evoked AEPs recorded from the posterior superior temporal gyrus lateral to Heschl's gyrus displayed comparable response features, whereas field potentials recorded from three locations in the planum temporale did not contain components time-locked to voicing onset. This study demonstrates that VOT at least partially is represented in primary and specific secondary auditory cortical fields by synchronized activity time-locked to consonant release and voicing onset. Furthermore, AEPs exhibit features that may facilitate categorical perception of stop consonants, and these response patterns appear to be based on temporal processing limitations within auditory cortex. Demonstrations of similar speech-evoked response patterns in animals support a role for these experimental models in clarifying selected features of speech encoding.


2011 ◽  
Vol 15 (2) ◽  
pp. 275-287 ◽  
Author(s):  
SUE ANN S. LEE ◽  
GREGORY K. IVERSON

The purpose of this study was to conduct an acoustic examination of the obstruent stops produced by Korean–English bilingual children in connection with the question of whether bilinguals establish distinct categories of speech sounds across languages. Stop productions were obtained from ninety children in two age ranges, five and ten years: thirty Korean–English bilinguals, thirty monolingual Koreans and thirty monolingual English speakers. Voice-Onset-Time (VOT) lag at word-initial stop and fundamental frequency (f0) in the following vowel (hereafter vowel-onset f0) were measured. The bilingual children showed different patterns of VOT in comparison to both English and Korean monolinguals, with longer VOT in their production of Korean stop consonants and shorter VOT for English. Moreover, the ten-year-old bilinguals distinguished all stop categories using both VOT and vowel-onset f0,whereas the five-year-olds tended to make stop distinctions based on VOT but not vowel-onset f0. The results of this study suggest that bilingual children at around five years of age do not yet have fully separate stop systems, and that the systems continue to evolve during the developmental period.


2020 ◽  
Vol 49 (3) ◽  
pp. 97-101
Author(s):  
Martin Kaňok ◽  
Michal Novotný

<p class="BodyTextNext"><em>Evaluation of precision of consonant articulation is commonly used metric in assessment of pathological speech. </em><em>However, up to date most of the research on consonant characteristics was performed on English while there are obvious language-specific differences. The aim of the current study was therefore to investigate the patterns of consonant articulation in Czech across 6 stop consonants with respect to age and gender. The database used consisted of 30 female and 30 male healthy participants. Four acoustic variables including voice onset time (VOT), VOT ratio and two spectral moments were analyzed. The Czech plosives /p/, /t/ and /k/ were found to be characterized by short voicing lag (average VOT ranged from 14 to 32 ms) while voiced plosives /b/, /d/ and /g/ by long voicing lead (average VOT ranged from -79 to -91 ms). </em><em>Furthermore, we observed significantly longer duration of both VOT </em><em>(p &lt; 0.05) </em><em>and VOT ratio </em><em>(p &lt; 0.01) </em><em>of voiceless plosives in female compared to male gender. Finally, we revealed a significant negative correlation between age and duration of voiceless </em><em>(</em><em>r = -0.36, p </em><em>&lt; 0.05) </em><em>as well as voiced VOT </em><em>(</em><em>r = -0.45, p =</em><em> 0.01) </em><em>in female but not in male participants.</em></p>


2019 ◽  
Vol 5 (3) ◽  
pp. 402-434
Author(s):  
Katharina S. Schuhmann ◽  
Marie K. Huffman

Abstract We present a study of the development of L2 stop VOT (voice onset time) in lower-level English-speaking learners of Spanish over the course of a college semester. Participants were recorded six times in two-week intervals. Halfway through the semester, students received a brief pronunciation training session with practice and feedback. Overall, the learners did not lower their L2 VOTs in the first half of the study, before pronunciation training. Following training, however, they lowered their mean VOTs for Spanish voiceless stops significantly. A similar effect was not found for their mean VOTs of Spanish voiced stops, in line with prior work suggesting that prevoicing may be harder to acquire. Yet careful examination suggests that learners are increasing the frequency with which they use prevoicing in Spanish, suggesting this metric might inform future work on L2 Spanish pronunciation development. This work has implications for teaching and research in second language pronunciation.


2005 ◽  
Vol 48 (5) ◽  
pp. 1013-1024 ◽  
Author(s):  
Christopher R. McCrea ◽  
Richard J. Morris

The purpose of this study was to examine the effect of fundamental frequency (F 0 ) on stop consonant voice onset time (VOT). VOT was measured from the recordings of 56 young men reading phrases containing all 6 English voiced and voiceless stops in word-initial position across high-, medium-, and low-F 0 levels. Separate analyses of variance for the voiced and voiceless stops revealed no significant main effect for F 0 for the voiced stops but a significant F 0 effect for the voiceless stops. Across the voiceless stops, productions at high F 0 s displayed significantly shorter VOTs than productions at low or mid F 0 s. The findings indicated that researchers must take into account the F 0 level at which voiceless stop VOT is measured.


2014 ◽  
Vol 30 (2) ◽  
pp. 129-157 ◽  
Author(s):  
Monika S. Schmid ◽  
Steven Gilbers ◽  
Amber Nota

The present article provides an exploration of ultimate attainment in second language (L2) and its limitations. It is argued that the question of maturational constraints can best be investigated when the reference population is bilingual and exposed on a regular basis to varieties of their first language (L1) that show cross-linguistic influence. To this end, 20 advanced Dutch–English bilinguals are compared to 9 English native speakers immersed in a Dutch environment. All participants are teachers or students of English at a Dutch institution of higher education. The populations are shown to be at similar global proficiency levels. Two phonetic variables (voice onset time or VOT and vowel discrimination) and one grammatical variable (verb phrase ellipsis), which are assumed to present particular challenges to Dutch learners of English, are explored, and speakers are furthermore rated for their global nativeness. The findings show no differences between populations on VOT but some variance on the production of a vowel that has no correlate in Dutch (the English trap vowel). However, all but one of the L2ers are rated outside the range of the natives on perceived foreign accent. There are also differences between groups where acceptance of different sentence types with verb phrase ellipsis are concerned. We interpret these findings to indicate that there are areas of L2 knowledge and production that are persistently difficult to acquire even under circumstances that are highly favourable for L2 acquisition.


1980 ◽  
Vol 7 (3) ◽  
pp. 433-458 ◽  
Author(s):  
Marlys A. Macken ◽  
David Barton

ABSTRACTThis paper reports on the acquisition of the voicing contrast in Mexican–Spanish word-initial stops. In Study 1, three monolingual children were recorded every two weeks for seven months, beginning when the children were about 1; 7. In Study 2, four monolingual children about 3; 10 were recorded once or twice. Two analyses were done. Instrumental analysis of the stop productions revealed that not even by age 3; 10 were the children consistently distinguishing between voiced–voiceless stop cognate pairs on the basis of adult-like voice-onset time characteristics. The spirantization analysis, however, more clearly revealed the children's phonological knowledge. Discussion focuses on the implications of the data for phonological development in general and for the phonological description of voicing in Spanish.


Sign in / Sign up

Export Citation Format

Share Document