The Identification of a Speaker's Sex from Synthesized Vowels

1998 ◽  
Vol 87 (2) ◽  
pp. 595-600 ◽  
Author(s):  
S. P. Whiteside

This experiment assessed whether fundamental frequency or formant frequencies have more perceptual salience in the identification of the sex of the speaker from synthesized vowels. Four sets of ten vowels were synthesized by combining fundamental frequencies and formant frequencies with different permutations 50 listeners took part in a listening test. Analysis of the listening test scores suggested that for 36 vowels, the fundamental frequency (F0) was probably the most salient perceptual cue. For the remaining four vowels, however, this was not the case as either the formant frequencies or the onset-offset patterns of the F0 appeared to have some perceptual salience.

1983 ◽  
Vol 26 (1) ◽  
pp. 89-97 ◽  
Author(s):  
Randall B. Monsen ◽  
A. Maynard Engebretson

The accuracy of spectrographic techniques and of linear prediction analysis in measuring formant frequencies is compared. The first three formant frequencies 90 synthetic speech tokens were measured by three experienced spectrographic readers and by linear prediction analysis. For fundamental frequencies between 100 and 300 Hz, both methods are accurate to within approximatey , ±60 Hz for both first and second formants. The third formant can be measured with the same degree of accuracy by linear prediction, but only to within ± 110 by spectrographic means. The accuracy of both methods decreases greatly when fundamental frequency is 350 Hz or greater. These limits of measurement appear to be within the range of the difference limens for formant frequencies


1979 ◽  
Vol 10 (4) ◽  
pp. 246-248 ◽  
Author(s):  
Peter B. Mueller ◽  
Marla Adams ◽  
Jean Baehr-Rouse ◽  
Debbie Boos

Mean fundamental frequencies of male and female subjects obtained with FLORIDA I and a tape striation counting procedure were compared. The fundamental frequencies obtained with these two methods were similar and it appears that the tape striation counting procedure is a viable, simple, and inexpensive alternative to more costly and complicated procedures and instrumentation.


ALQALAM ◽  
2015 ◽  
Vol 32 (2) ◽  
pp. 284
Author(s):  
Muhammad Subali ◽  
Miftah Andriansyah ◽  
Christanto Sinambela

This article aims to look at the similarities and differences in the fundamental frequency and formant frequencies using the autocorrelation function and LPCfunction in GUI MATLAB 2012b on sound hijaiyah letters for adult male speaker beginner and expert based on makhraj pronunciation and both of speaker will be analysis on matching distance of the sound use DTW method on cepstrum. Subject for speech beginner makhraj pronunciation are taken from college student of Universitas Gunadarma and SITC aged 22 years old Data of the speech beginner makhraj pronunciation is recorded using MATLAB algorithm on GUI Subject for speech expert makhraj pronunciation are taken from previous research. They are 20-30 years old from the time of taking data. The sound will be extracted to get the value of the fundamental frequency and formant frequency. After getting both frequencies, it will be obtained analysis of the similarities and differences in the fundamental frequency and formant frequencies of speech beginner and expert and it will shows matching distance of both speech. The result is all of speech beginner and expert based on makhraj pronunciation have different values of fundamental frequency and formant frequency. Then the results of the analysis matching distance using method DTW showed that obtained in the range of 28.9746 to 136.4 between speech beginner and expert based on makhraj pronunciation. Keywords: fundamental frequency, formant frequency, hijaiyah letters, makhraj


1997 ◽  
Vol 119 (4) ◽  
pp. 451-456 ◽  
Author(s):  
C. Lay ◽  
O. A. Abu-Yasein ◽  
M. A. Pickett ◽  
J. Madia ◽  
S. K. Sinha

The damping coefficients and ratios of piping system snubber supports were found to vary logarithmically with pipe support nodal displacement. For piping systems with fundamental frequencies in the range of 0.6 to 6.6 Hz, the support damping ratio for snubber supports was found to increase with increasing fundamental frequency. For 3-kip snubbers, damping coefficient and damping ratio decreased logarithmically with nodal displacement, indicating that the 3-kip snubbers studied behaved essentially as coulomb dampers; while for the 10-kip snubbers studied, damping coefficient and damping ratio increased logarithmically with nodal displacement.


Animals ◽  
2018 ◽  
Vol 8 (10) ◽  
pp. 167 ◽  
Author(s):  
Anton Baotic ◽  
Maxime Garcia ◽  
Markus Boeckle ◽  
Angela Stoeger

African savanna elephants live in dynamic fission–fusion societies and exhibit a sophisticated vocal communication system. Their most frequent call-type is the ‘rumble’, with a fundamental frequency (which refers to the lowest vocal fold vibration rate when producing a vocalization) near or in the infrasonic range. Rumbles are used in a wide variety of behavioral contexts, for short- and long-distance communication, and convey contextual and physical information. For example, maturity (age and size) is encoded in male rumbles by formant frequencies (the resonance frequencies of the vocal tract), having the most informative power. As sound propagates, however, its spectral and temporal structures degrade progressively. Our study used manipulated and resynthesized male social rumbles to simulate large and small individuals (based on different formant values) to quantify whether this phenotypic information efficiently transmits over long distances. To examine transmission efficiency and the potential influences of ecological factors, we broadcasted and re-recorded rumbles at distances of up to 1.5 km in two different habitats at the Addo Elephant National Park, South Africa. Our results show that rumbles were affected by spectral–temporal degradation over distance. Interestingly and unlike previous findings, the transmission of formants was better than that of the fundamental frequency. Our findings demonstrate the importance of formant frequencies for the efficiency of rumble propagation and the transmission of information content in a savanna elephant’s natural habitat.


2018 ◽  
Vol 35 (3) ◽  
pp. 305-318 ◽  
Author(s):  
Kimiko Tsukada ◽  
Jeong-Im Han

While it is well established that non-native speakers differ from native speakers in their perception and/or production of Mandarin lexical tones, empirical studies focusing on non-native learners are still limited. The objective of this study is to add to the current understanding of lexical tone perception by comparing native speakers of standard Korean from the Seoul/Kyunggi area differing in Mandarin experience (NK1, NK2) with native speakers of Mandarin. NK1 ( n = 10) had no experience with Mandarin whereas NK2 ( n = 10) consisted of highly advanced learners of Mandarin. A group of 10 native Mandarin (NM) speakers was included as controls. Accuracy of perception of six tone pairs (T1–T2, T1–T3, T1–T4, T2–T3, T2–T4, T3–T4) was assessed in a four-alternative forced-choice discrimination test. As expected, the NK2 group with extensive Mandarin learning experience resembled the NM group to a greater extent than did the NK1 group. T2–T3 was the hardest pair for both NK groups, but NK2 had the largest advantage over NK1 for this pair. Apart from T2–T3 which is generally considered difficult, tone pairs involving T1 caused some misperception by the NK groups. This may be related to the difficulty with perceiving a level tone which shows the least fundamental frequency (F0) movement and possibly has limited perceptual salience.


Author(s):  
Yeptain Leung ◽  
Jennifer Oates ◽  
Siew-Pang Chan ◽  
Viktória Papp

Purpose The aim of the study was to examine associations between speaking fundamental frequency ( f os ), vowel formant frequencies ( F ), listener perceptions of speaker gender, and vocal femininity–masculinity. Method An exploratory study was undertaken to examine associations between f os , F 1 – F 3 , listener perceptions of speaker gender (nominal scale), and vocal femininity–masculinity (visual analog scale). For 379 speakers of Australian English aged 18–60 years, f os mode and F 1 – F 3 (12 monophthongs; total of 36 F s) were analyzed on a standard reading passage. Seventeen listeners rated speaker gender and vocal femininity–masculinity on randomized audio recordings of these speakers. Results Model building using principal component analysis suggested the 36 F s could be succinctly reduced to seven principal components (PCs). Generalized structural equation modeling (with the seven PCs of F and f os as predictors) suggested that only F 2 and f os predicted listener perceptions of speaker gender (male, female, unable to decide). However, listener perceptions of vocal femininity–masculinity behaved differently and were predicted by F 1 , F 3 , and the contrast between monophthongs at the extremities of the F 1 acoustic vowel space, in addition to F 2 and f os . Furthermore, listeners' perceptions of speaker gender also influenced ratings of vocal femininity–masculinity substantially. Conclusion Adjusted odds ratios highlighted the substantially larger contribution of F to listener perceptions of speaker gender and vocal femininity–masculinity relative to f os than has previously been reported.


MANUSYA ◽  
2012 ◽  
Vol 15 (2) ◽  
pp. 1-20
Author(s):  
Chommanad Intajamornrak

This paper aims to analyze and compare the acoustic characteristics of the vowels and tones in the Phrae Pwo Karen spoken by three generations. The data was collected at Khangchai Village in Wang Chin District, Phrae Province. A wordlist of Pwo Karen vowels and tones was recorded directly on to computer using Adobe Audition version 2. Fifteen female informants were divided into three groups: those over 60 years old, those 35-50 years old and those under 25 years old. The total number of test tokens was 405 for vowel analysis, and 810 for tone analysis. The fundamental frequencies and formant frequencies were measured using Praat version 5.1.43.


Author(s):  
Johan Sundberg

The function of the voice organ is basically the same in classical singing as in speech. However, loud orchestral accompaniment has necessitated the use of the voice in an economical way. As a consequence, the vowel sounds tend to deviate considerably from those in speech. Male voices cluster formant three, four, and five, so that a marked peak is produced in spectrum envelope near 3,000 Hz. This helps them to get heard through a loud orchestral accompaniment. They seem to achieve this effect by widening the lower pharynx, which makes the vowels more centralized than in speech. Singers often sing at fundamental frequencies higher than the normal first formant frequency of the vowel in the lyrics. In such cases they raise the first formant frequency so that it gets somewhat higher than the fundamental frequency. This is achieved by reducing the degree of vocal tract constriction or by widening the lip and jaw openings, constricting the vocal tract in the pharyngeal end and widening it in the mouth. These deviations from speech cause difficulties in vowel identification, particularly at high fundamental frequencies. Actually, vowel identification is almost impossible above 700 Hz (pitch F5). Another great difference between vocal sound produced in speech and the classical singing tradition concerns female voices, which need to reduce the timbral differences between voice registers. Females normally speak in modal or chest register, and the transition to falsetto tends to happen somewhere above 350 Hz. The great timbral differences between these registers are avoided by establishing control over the register function, that is, over the vocal fold vibration characteristics, so that seamless transitions are achieved. In many other respects, there are more or less close similarities between speech and singing. Thus, marking phrase structure, emphasizing important events, and emotional coloring are common principles, which may make vocal artists deviate considerably from the score’s nominal description of fundamental frequency and syllable duration.


Sign in / Sign up

Export Citation Format

Share Document