Gated Bilinear Networks for Vowel Formant Estimation

Exploring vowel formant estimation through simulation-based techniques

Linguistics Vanguard ◽

10.1515/lingvan-2018-0060 ◽

2020 ◽

Vol 6 (s1) ◽

Cited By ~ 1

Author(s):

Tyler Kendall ◽

Charlotte Vaughn

Keyword(s):

Predictive Coding ◽

Linear Predictive Coding ◽

Fine Grained ◽

Vowel Formant ◽

Simulation Based ◽

Insight Into

AbstractThis paper contributes insight into the sources of variability in vowel formant estimation, a major analytic activity in sociophonetics, by reviewing the outcomes of two simulations that manipulated the settings used for linear predictive coding (LPC)-based vowel formant estimation. Simulation 1 explores the range of frequency differences obtained when minor adjustments are made to LPC settings, and measurement timepoints around the settings used by trained analysts, in order to determine the range of variability that should be expected in sociophonetic vowel studies. Simulation 2 examines the variability that emerges when LPC settings are varied combinatorially around constant default settings, rather than settings set by trained analysts. The impacts of different LPC settings are discussed as a way of demonstrating the inherent properties of LPC-based formant estimation. This work suggests that differences more fine-grained than about 10 Hz in F1 and 15–20 Hz in F2 are within the range of LPC-based formant estimation variability.

Download Full-text

An Experience with Multilingual Infovox Sythesizer and its Vowel Formant Analysis

IETE Journal of Research ◽

10.1080/03772063.1999.11416125 ◽

1999 ◽

Vol 45 (5-6) ◽

pp. 383-384

Author(s):

I Khan ◽

S K Gupta

Keyword(s):

Vowel Formant ◽

Formant Analysis

Download Full-text

Isovowel lines for the evaluation of vowel formant structure in patients with hemiglossectomy.

The Japan Journal of Logopedics and Phoniatrics ◽

10.5112/jjlp.26.245 ◽

1985 ◽

Vol 26 (3) ◽

pp. 245-253

Author(s):

Hiroshi Watanabe ◽

Takemoto Shin ◽

Koichi Matsuo ◽

Junichi Fukaura ◽

Mariko Tomita

Keyword(s):

Vowel Formant

Download Full-text

Estimating vowel formant discrimination thresholds using a single-interval classification task

The Journal of the Acoustical Society of America ◽

10.1121/1.3086269 ◽

2009 ◽

Vol 125 (4) ◽

pp. 2323-2335 ◽

Cited By ~ 4

Author(s):

Eric Oglesbee ◽

Diane Kewley-Port

Keyword(s):

Classification Task ◽

Vowel Formant ◽

Single Interval

Download Full-text

Factors affecting vowel formant discrimination by hearing-impaired listeners

The Journal of the Acoustical Society of America ◽

10.1121/1.2781580 ◽

2007 ◽

Vol 122 (5) ◽

pp. 2855 ◽

Cited By ~ 9

Author(s):

Chang Liu ◽

Diane Kewley-Port

Keyword(s):

Hearing Impaired ◽

Factors Affecting ◽

Vowel Formant

Download Full-text

Variability of vowel formant frequencies and speaker normalization: The cases of Mandarin and Taiwanese

The Journal of the Acoustical Society of America ◽

10.1121/1.413993 ◽

1995 ◽

Vol 98 (5) ◽

pp. 2966-2966

Author(s):

Kuo‐You Huang ◽

Chin‐Hsing Tseng

Keyword(s):

Speaker Normalization ◽

Formant Frequencies ◽

Vowel Formant

Download Full-text

Vowel formant frequency characteristics of preadolescent males and females

The Journal of the Acoustical Society of America ◽

10.1121/1.385343 ◽

1981 ◽

Vol 69 (1) ◽

pp. 231-238 ◽

Cited By ~ 46

Author(s):

Suzanne Bennett

Keyword(s):

Frequency Characteristics ◽

Formant Frequency ◽

Vowel Formant ◽

Males And Females

Download Full-text

Acoustic Characteristics of Dysarthria Associated with Cerebellar Disease

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.2203.627 ◽

1979 ◽

Vol 22 (3) ◽

pp. 627-648 ◽

Cited By ~ 97

Author(s):

Ray D. Kent ◽

Ronald Netsell ◽

James H. Abbs

Keyword(s):

Normal Subjects ◽

Cerebellar Disease ◽

Acoustic Characteristics ◽

Essentially Normal ◽

Vowel Formant ◽

Timing Pattern ◽

Ataxic Dysarthria ◽

Acoustic Analyses ◽

Simple Sentences ◽

Number Of Segments

The speech of five individuals with cerebellar disease and ataxic dysarthria was studied with acoustic analyses of CVC words, words of varying syllabic structure (stem, stem plus suffix, stem plus two suffixes), simple sentences, the Rainbow Passage, and conversation. The most consistent and marked abnormalities observed in spectrograms were alterations of the normal timing pattern, with prolongation of a variety of segments and a tendency toward equalized syllable durations. Vowel formant structure in the CVC words was judged to be essentially normal except for transitional segments. The greater the severity of the dysarthria, the greater the number of segments lengthened and the degree of lengthening of individual segments. The ataxic subjects were inconsistent in durational adjustments of the stem syllable as the number of syllables in a word was varied and generally made smaller reductions than normal subjects as suffixes were added. Disturbances of syllable timing frequently were accompanied by abnormal contours of fundamental frequency, particularly monotone and syllable-falling patterns. These dysprosodic aspects of ataxic dysarthria are discussed in relation to cerebellar function in motor control.

Download Full-text

Associations Between Speaking Fundamental Frequency, Vowel Formant Frequencies, and Listener Perceptions of Speaker Gender and Vocal Femininity–Masculinity

Journal of Speech Language and Hearing Research ◽

10.1044/2021_jslhr-20-00747 ◽

2021 ◽

pp. 1-23

Author(s):

Yeptain Leung ◽

Jennifer Oates ◽

Siew-Pang Chan ◽

Viktória Papp

Keyword(s):

Fundamental Frequency ◽

Structural Equation ◽

Model Building ◽

Principal Component ◽

Equation Modeling ◽

Formant Frequencies ◽

Vowel Formant ◽

Vowel Space ◽

Australian English ◽

Speaking Fundamental Frequency

Purpose The aim of the study was to examine associations between speaking fundamental frequency ( f os ), vowel formant frequencies ( F ), listener perceptions of speaker gender, and vocal femininity–masculinity. Method An exploratory study was undertaken to examine associations between f os , F 1 – F 3 , listener perceptions of speaker gender (nominal scale), and vocal femininity–masculinity (visual analog scale). For 379 speakers of Australian English aged 18–60 years, f os mode and F 1 – F 3 (12 monophthongs; total of 36 F s) were analyzed on a standard reading passage. Seventeen listeners rated speaker gender and vocal femininity–masculinity on randomized audio recordings of these speakers. Results Model building using principal component analysis suggested the 36 F s could be succinctly reduced to seven principal components (PCs). Generalized structural equation modeling (with the seven PCs of F and f os as predictors) suggested that only F 2 and f os predicted listener perceptions of speaker gender (male, female, unable to decide). However, listener perceptions of vocal femininity–masculinity behaved differently and were predicted by F 1 , F 3 , and the contrast between monophthongs at the extremities of the F 1 acoustic vowel space, in addition to F 2 and f os . Furthermore, listeners' perceptions of speaker gender also influenced ratings of vocal femininity–masculinity substantially. Conclusion Adjusted odds ratios highlighted the substantially larger contribution of F to listener perceptions of speaker gender and vocal femininity–masculinity relative to f os than has previously been reported.

Download Full-text

Evaluation of Mental Fatigue Using Vowel Formant Analysis

Journal of Society of Korea Industrial and Systems Engineering ◽

10.11627/jkise.2014.37.1.26 ◽

2014 ◽

Vol 37 (1) ◽

pp. 26-32 ◽

Cited By ~ 1

Author(s):

Wook Hyun Ha ◽

◽

Sung Ha Park

Keyword(s):

Mental Fatigue ◽

Vowel Formant ◽

Formant Analysis

Download Full-text