vowel formant
Recently Published Documents

Purpose The aim of the study was to examine associations between speaking fundamental frequency ( f os ), vowel formant frequencies ( F ), listener perceptions of speaker gender, and vocal femininity–masculinity. Method An exploratory study was undertaken to examine associations between f os , F 1 – F 3 , listener perceptions of speaker gender (nominal scale), and vocal femininity–masculinity (visual analog scale). For 379 speakers of Australian English aged 18–60 years, f os mode and F 1 – F 3 (12 monophthongs; total of 36 F s) were analyzed on a standard reading passage. Seventeen listeners rated speaker gender and vocal femininity–masculinity on randomized audio recordings of these speakers. Results Model building using principal component analysis suggested the 36 F s could be succinctly reduced to seven principal components (PCs). Generalized structural equation modeling (with the seven PCs of F and f os as predictors) suggested that only F 2 and f os predicted listener perceptions of speaker gender (male, female, unable to decide). However, listener perceptions of vocal femininity–masculinity behaved differently and were predicted by F 1 , F 3 , and the contrast between monophthongs at the extremities of the F 1 acoustic vowel space, in addition to F 2 and f os . Furthermore, listeners' perceptions of speaker gender also influenced ratings of vocal femininity–masculinity substantially. Conclusion Adjusted odds ratios highlighted the substantially larger contribution of F to listener perceptions of speaker gender and vocal femininity–masculinity relative to f os than has previously been reported.

Download Full-text

The Effects of Language Contact on Non-Native Vowel Sequences in Lexical Borrowings: The Case of Media Lengua

Language and Speech ◽

10.1177/00238309211014911 ◽

2021 ◽

pp. 002383092110149

Author(s):

Sky Onosson ◽

Jesse Stewart

Keyword(s):

Mixed Models ◽

Language Contact ◽

Vowel Formant ◽

Vowel Space ◽

Sequence Patterns ◽

Generalized Additive Mixed Models ◽

Additive Mixed Models ◽

Acoustic Space ◽

The Difference ◽

High Vowels

Media Lengua (ML), a mixed language derived from Quichua and Spanish, exhibits a phonological system that largely conforms to that of Quichua acoustically. Yet, it incorporates a large number of vowel sequences from Spanish which do not occur in the Quichua system. This includes the use of mid-vowels, which are phonetically realized in ML as largely overlapping with the high-vowels in acoustic space. We analyze and compare production of vowel sequences by speakers of ML, Quichua, and Spanish through the use of generalized additive mixed models to determine statistically significant differences between vowel formant trajectories. Our results indicate that Spanish-derived ML vowel sequences frequently differ significantly from their Spanish counterparts, largely occupying a more central region of the vowel space and frequently exhibiting markedly reduced trajectories over time. In contrast, we find only one case where an ML vowel sequence differs significantly from its Quichua counterpart—and even in this case the difference from Spanish is substantially greater. Our findings show how the vowel system of ML successfully integrates novel vowel sequence patterns from Spanish into what is essentially Quichua phonology by markedly adapting their production, while still maintaining contrasts which are not expressed in Quichua.

Download Full-text

Intact Correction for Self-Produced Vowel Formant Variability in Individuals With Cerebellar Ataxia Regardless of Auditory Feedback Availability

Journal of Speech Language and Hearing Research ◽

10.1044/2021_jslhr-20-00270 ◽

2021 ◽

pp. 1-14

Author(s):

Benjamin Parrell ◽

Richard B. Ivry ◽

Srikantan S. Nagarajan ◽

John F. Houde

Keyword(s):

Feedback Control ◽

Cerebellar Ataxia ◽

Auditory Feedback ◽

Cerebellar Degeneration ◽

Vowel Production ◽

Compensatory Response ◽

Vowel Context ◽

Somatosensory Feedback ◽

Vowel Formant ◽

Masking Noise

Purpose Individuals with cerebellar ataxia (CA) caused by cerebellar degeneration exhibit larger reactive compensatory responses to unexpected auditory feedback perturbations than neurobiologically typical speakers, suggesting they may rely more on feedback control during speech. We test this hypothesis by examining variability in unaltered speech. Previous studies of typical speakers have demonstrated a reduction in formant variability (centering) observed during the initial phase of vowel production from vowel onset to vowel midpoint. Centering is hypothesized to reflect feedback-based corrections for self-produced variability and thus may provide a behavioral assay of feedback control in unperturbed speech in the same manner as the compensatory response does for feedback perturbations. Method To comprehensively compare centering in individuals with CA and controls, we examine centering in two vowels (/i/ and /ɛ/) under two contexts (isolated words and connected speech). As a control, we examine speech produced both with and without noise to mask auditory feedback. Results Individuals with CA do not show increased centering compared to age-matched controls, regardless of vowel, context, or masking. Contrary to previous results in neurobiologically typical speakers, centering was not affected by the presence of masking noise in either group. Conclusions The similar magnitude of centering seen with and without masking noise questions whether centering is driven by auditory feedback. However, if centering is at least partially driven by auditory/somatosensory feedback, these results indicate that the larger compensatory response to altered auditory feedback observed in individuals with CA may not reflect typical motor control processes during normal, unaltered speech production.

Download Full-text

Perceptual validation of vowel normalization methods for variationist research

Language Variation and Change ◽

10.1017/s0954394521000016 ◽

2021 ◽

pp. 1-27

Author(s):

Santiago Barreda

Keyword(s):

Phonetic Variation ◽

Vowel Formant ◽

Normalization Methods ◽

Vowel Space ◽

Vowel Normalization ◽

Uniform Scaling ◽

Do So

AbstractThe evaluation of normalization methods sometimes focuses on the maximization of vowel-space similarity. This focus can lead to the adoption of methods that erase legitimate phonetic variation from our data, that is, overnormalization. First, a production corpus is presented that highlights three types of variation in formant patterns: uniform scaling, nonuniform scaling, and centralization. Then the results of two perceptual experiments are presented, both suggesting that listeners tend to ignore variation according to uniform scaling, while associating nonuniform scaling and centralization with phonetic differences. Overall, results suggest that normalization methods that remove variation not according to uniform scaling can remove legitimate phonetic variation from vowel formant data. As a result, although these methods can provide more similar vowel spaces, they do so by erasing phonetic variation from vowel data that may be socially and linguistically meaningful, including a potential male-female difference in the low vowels in our corpus.

Download Full-text

Acoustic differentiation of allophones of /aɪ/ in Chicagoland English: Statistical comparison of formant trajectories

Journal of the International Phonetic Association ◽

10.1017/s0025100320000158 ◽

2021 ◽

pp. 1-31

Author(s):

José Ignacio Hualde ◽

Marissa Barlaz ◽

Tatiana Luchkina

Keyword(s):

Mixed Models ◽

Acoustic Analysis ◽

Complete Understanding ◽

Limited Sampling ◽

Quantitative Studies ◽

Vowel Formant ◽

Adequate Understanding ◽

The Us ◽

Generalized Additive Mixed Models ◽

Additive Mixed Models

Diphthongs have a dynamic formant structure. Nevertheless, many quantitative studies of diphthongs are based on measurements at only two points, somewhere in the nucleus and somewhere in the glide. The question arises as to whether analyses based on values at only two points provide an adequate understanding of the dynamics of diphthongs. Wieling (2018) mentions the analysis of /aɪ/ raising in Chicago English in Hualde, Luchkina & Eager (2017) as one of several examples of recent studies where potentially interesting patterns may have been missed because of limited sampling of formant values, and proposes using Generalized Additive Mixed Models (GAMM) to allow a more complete understanding of diphthong dynamics. In this paper, we examine the acoustic nature of the (quasi-)phonemic differentiation between two originally allophonic variants of the diphthong /aɪ/ in the US English of Chicago and the surrounding area. We offer an acoustic analysis based on full formant trajectories of diphthongs with data obtained from a group of 53 speakers. The results of a GAMM analysis are then compared with those obtained in Hualde et al. (2017), which considered values at only two points and from a smaller set of speakers (17). We also discuss the main advantages of GAMM analysis over other techniques that have being proposed for the analysis of differences in vowel formant dynamics.

Download Full-text

Musical Hearing and Musical Experience in Second Language English Vowel Acquisition

Journal of Speech Language and Hearing Research ◽

10.1044/2021_jslhr-19-00253 ◽

2021 ◽

pp. 1-17

Author(s):

Mateusz Jekiel ◽

Kamil Malarski

Keyword(s):

Second Language ◽

Foreign Language ◽

Music Perception ◽

Musical Experience ◽

Music Practice ◽

Musical Skills ◽

Vowel Formant ◽

English Vowels ◽

Before And After ◽

Vowel Acquisition

Purpose Former studies suggested that music perception can help produce certain accentual features in the first and second language (L2), such as intonational contours. What was missing in many of these studies was the identification of the exact relationship between specific music perception skills and the production of different accentual features in a foreign language. Our aim was to verify whether empirically tested musical hearing skills can be related to the acquisition of English vowels by learners of English as an L2 before and after a formal accent training course. Method Fifty adult Polish speakers of L2 English were tested before and after a two-semester accent training in order to observe the effect of musical hearing on the acquisition of English vowels. Their L2 English vowel formant contours produced in consonant–vowel–consonant context were compared with the target General British vowels produced by their pronunciation teachers. We juxtaposed these results with their musical hearing test scores and self-reported musical experience to observe a possible relationship between successful L2 vowel acquisition and musical aptitude. Results Preexisting rhythmic memory was reported as a significant predictor before training, while musical experience was reported as a significant factor in the production of more native-like L2 vowels after training. We also observed that not all vowels were equally acquired or affected by musical hearing or musical experience. The strongest estimate we observed was the closeness to model before training, suggesting that learners who already managed to acquire some features of a native-like accent were also more successful after training. Conclusions Our results are revealing in two aspects. First, the learners' former proficiency in L2 pronunciation is the most robust predictor in acquiring a native-like accent. Second, there is a potential relationship between rhythmic memory and L2 vowel acquisition before training, as well as years of musical experience after training, suggesting that specific musical skills and music practice can be an asset in learning a foreign language accent.

Download Full-text

Acoustic analysis of vowel formant frequencies in genetically-related and non-genetically related speakers with implications for forensic speaker comparison

PLoS ONE ◽

10.1371/journal.pone.0246645 ◽

2021 ◽

Vol 16 (2) ◽

pp. e0246645

Author(s):

Julio Cesar Cavalcanti ◽

Anders Eriksson ◽

Plinio A. Barbosa

Keyword(s):

High Frequency ◽

Acoustic Analysis ◽

Low Frequency ◽

Identical Twin ◽

Vowel Quality ◽

Male Adult ◽

Formant Frequencies ◽

Phonetic Similarity ◽

Vowel Formant ◽

Speaker Discrimination

The purpose of this study was to explore the speaker-discriminatory potential of vowel formant mean frequencies in comparisons of identical twin pairs and non-genetically related speakers. The influences of lexical stress and the vowels’ acoustic distances on the discriminatory patterns of formant frequencies were also assessed. Acoustic extraction and analysis of the first four speech formants F1-F4 were carried out using spontaneous speech materials. The recordings comprise telephone conversations between identical twin pairs while being directly recorded through high-quality microphones. The subjects were 20 male adult speakers of Brazilian Portuguese (BP), aged between 19 and 35. As for comparisons, stressed and unstressed oral vowels of BP were segmented and transcribed manually in the Praat software. F1-F4 formant estimates were automatically extracted from the middle points of each labeled vowel. Formant values were represented in both Hertz and Bark. Comparisons within identical twin pairs using the Bark scale were performed to verify whether the measured differences would be potentially significant when following a psychoacoustic criterion. The results revealed consistent patterns regarding the comparison of low-frequency and high-frequency formants in twin pairs and non-genetically related speakers, with high-frequency formants displaying a greater speaker-discriminatory power compared to low-frequency formants. Among all formants, F4 seemed to display the highest discriminatory potential within identical twin pairs, followed by F3. As for non-genetically related speakers, both F3 and F4 displayed a similar high discriminatory potential. Regarding vowel quality, the central vowel /a/ was found to be the most speaker-discriminatory segment, followed by front vowels. Moreover, stressed vowels displayed a higher inter-speaker discrimination than unstressed vowels in both groups; however, the combination of stressed and unstressed vowels was found even more explanatory in terms of the observed differences. Although identical twins displayed a higher phonetic similarity, they were not found phonetically identical.

Download Full-text

Extending automatic vowel formant extraction to New Englishes

English World-Wide ◽

10.1075/eww.00060.mee ◽

2021 ◽

Vol 42 (1) ◽

Author(s):

Philipp Meer ◽

Thorsten Brato ◽

José Alejandro Matute Flores

Keyword(s):

Speech Analysis ◽

Specific Level ◽

Vowel Formant ◽

Automatic Methods ◽

Automated Procedures ◽

Phonetic Study ◽

Better Than

Abstract While different automated procedures for vowel formant prediction have recently been proposed, it is unclear how reliably these methods perform in the phonetic study of vowels in New Englishes and how such approaches could be applied to specific varieties. This paper compares different automatic methods for vowel formant prediction in New Englishes, using manual measurements of Trinidadian English as a baseline. The results show that all methods perform significantly better than default formant parameters often used in speech analysis packages, and that a Bayesian formant tracker calibrated with American (US-FAVE) and Trinidadian English (TRINI-FAVE) generally provides better results than an automatic procedure that optimizes formant ceilings on a vowel- and speaker-specific level. TRINI-FAVE measures vowels characteristic of Trinidadian English most accurately. Phonetic studies of vowels in New Englishes can benefit from these methods.

Download Full-text

Gated Bilinear Networks for Vowel Formant Estimation

2020 International Conference on Asian Language Processing (IALP) ◽

10.1109/ialp51396.2020.9310481 ◽

2020 ◽

Author(s):

Wang Dai ◽

Zheng Hua ◽

Jinsong Zhang ◽

Yanlu Xie ◽

Binghuai Lin

Keyword(s):

Vowel Formant

Download Full-text

vowel formantRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Response patterns to vowel formant perturbations in children

Associations Between Speaking Fundamental Frequency, Vowel Formant Frequencies, and Listener Perceptions of Speaker Gender and Vocal Femininity–Masculinity

The Effects of Language Contact on Non-Native Vowel Sequences in Lexical Borrowings: The Case of Media Lengua

Intact Correction for Self-Produced Vowel Formant Variability in Individuals With Cerebellar Ataxia Regardless of Auditory Feedback Availability

Perceptual validation of vowel normalization methods for variationist research

Acoustic differentiation of allophones of /aɪ/ in Chicagoland English: Statistical comparison of formant trajectories

Musical Hearing and Musical Experience in Second Language English Vowel Acquisition

Acoustic analysis of vowel formant frequencies in genetically-related and non-genetically related speakers with implications for forensic speaker comparison

Extending automatic vowel formant extraction to New Englishes

Gated Bilinear Networks for Vowel Formant Estimation

vowel formant
Recently Published Documents