Modeling the Fundamental Frequency of the Voice

1982 ◽  
Vol 27 (9) ◽  
pp. 690-692 ◽  
Author(s):  
Janet Pierrehumbert ◽  
Mark Liberman
Author(s):  
Johan Sundberg

The function of the voice organ is basically the same in classical singing as in speech. However, loud orchestral accompaniment has necessitated the use of the voice in an economical way. As a consequence, the vowel sounds tend to deviate considerably from those in speech. Male voices cluster formant three, four, and five, so that a marked peak is produced in spectrum envelope near 3,000 Hz. This helps them to get heard through a loud orchestral accompaniment. They seem to achieve this effect by widening the lower pharynx, which makes the vowels more centralized than in speech. Singers often sing at fundamental frequencies higher than the normal first formant frequency of the vowel in the lyrics. In such cases they raise the first formant frequency so that it gets somewhat higher than the fundamental frequency. This is achieved by reducing the degree of vocal tract constriction or by widening the lip and jaw openings, constricting the vocal tract in the pharyngeal end and widening it in the mouth. These deviations from speech cause difficulties in vowel identification, particularly at high fundamental frequencies. Actually, vowel identification is almost impossible above 700 Hz (pitch F5). Another great difference between vocal sound produced in speech and the classical singing tradition concerns female voices, which need to reduce the timbral differences between voice registers. Females normally speak in modal or chest register, and the transition to falsetto tends to happen somewhere above 350 Hz. The great timbral differences between these registers are avoided by establishing control over the register function, that is, over the vocal fold vibration characteristics, so that seamless transitions are achieved. In many other respects, there are more or less close similarities between speech and singing. Thus, marking phrase structure, emphasizing important events, and emotional coloring are common principles, which may make vocal artists deviate considerably from the score’s nominal description of fundamental frequency and syllable duration.


2018 ◽  
Vol 8 (5) ◽  
pp. 27
Author(s):  
Abdul Abbasi ◽  
Mansoor Channa ◽  
Masood Memon ◽  
Stephen John ◽  
Irtaza Ahmed ◽  
...  

The purpose of this investigation was to document acoustic characteristics of Pakistani English (PaKE) vowel sounds. The experiment was designed to examine the properties of ten vowels produced by Pakistani ESL learners. The analysis is based on the voice samples of recorded 50 CVC words. Total 5000 (10  10  50=5000) voiced samples were analyzed. The data consisted of 50 words of ten English vowel sounds [i: ɪ e ɔ: æ ə ɑ: u: ɒ ʊ]. Ten ESL speakers recorded their voice samples on Praat speech processing tool installed on laptop. Three parameters were considered i.e., fundamental frequency (F0), vowel quality (F1-F2) and duration. Formant patterns were judged manually by visual inspection on Praat Speech Processing Tool. Analysis of formant frequency shows numerous differences between male and female of F1 and F2, fundamental frequency and duration of English vowels. The voice samples provide evidence for higher and lower frequency of vowel sounds. Additionally, the data analysis illustrates that there were statistical differences in the values of short and long vowels coupled with vowel space plot showing explicit differences in locating the production of vowels of male & female vowel space acoustic realizations.


Author(s):  
Lourdes Bernadete Rocha de SOUZA ◽  
Rayane Medeiros PEREIRA ◽  
Marquiony Marques dos SANTOS ◽  
Cynthia Meida de Almeida GODOY

Background : Obese people have abnormal deposition of fat in the vocal tract that can interfere with the acoustic voice. Aim : To relate the fundamental frequency, the maximum phonation time and voice complaints from a group of morbidly obese women. Methods : Observational, cross-sectional and descriptive study that included 44 morbidly obese women, mean age of 42.45 (±10.31) years old, observational group and 30 women without obesity, control group, with 33.79 (±4.51)years old. The voice recording was done in a quiet environment, on a laptop using the program ANAGRAF acoustic analysis of speech sounds. To extract the values of fundamental frequency the subjects were asked to produce vowel [a] at usual intensity for a period in average of three seconds. After the voice recording, participants were prompted to produce sustained vowel [ a] , [ i] and [ u] at usual intensity and height, using a stopwatch to measure the time that each participant could hold each vowel. Results : The majority, 31(70.5%), had vocal complaints, with a higher percentage for complaints of vocal fatigue 20(64.51%) and voice failures 19(61.29%) followed by dryness of the throat in 15 (48.38%) and effort to speak 13(41.93%). There was no statistically significant difference regarding the mean fundamental frequency of the voice in both groups, but there was significance between the two groups regarding maximum phonation. Conclusion : Increased adipose tissue in the vocal tract interfered in the vocal parameters.


1999 ◽  
Vol 113 (1) ◽  
pp. 31-34 ◽  
Author(s):  
Guo-Dong Li ◽  
Liancai Mu ◽  
Shilin Yang

AbstractThe goal of this study was to determine if there are acoustical differences between pre- and post-surgical voices and to evaluate the effectiveness of Isshiki type III thyroplasty in 11 male patients with mutational voice disorders. Acoustic measures were obtained both pre- and post-operatively. A comparison of pre-and post-operative fundamental frequency (Fo), voice frequencies, and vocal intensity obtained from a sustained vowel /i/ during different phonatory tasks was made. The results from the present study demonstrated that after operation the voice frequencies were significantly decreased (p<0.05). The vocal intensity tended to reduce slightly as the voice frequency lowered. However, there were no statistically significant differences in the pre- and post-operative measures of vocal intensity (p>0.5). The preoperative high pitched voices of all the male patients were lowered up to the normal value by the type III thyroplasty.


2000 ◽  
Vol 114 (2) ◽  
pp. 113-118 ◽  
Author(s):  
Sibylle Brosch ◽  
Christiane Matthes ◽  
Wolfgang Pirsig ◽  
Thomas Verse

The aim of the study was to find whether a muscle-sparing uvulopalatopharyngoplasty (UPPP) and tonsillectomy cause a measurable change in the voice. The fundamental frequency and the first two formants of five sustained vowels were measured before and nine (six to 15) months following operation. The operation consisted of tonsillectomy and UPPP with preservation of the musculature of the soft palate. All patients received a pre- and post-operative 12-canal polysomnography (level-I sleep study). No patient showed signs of any post-operative velopharyngeal insufficiency. Acoustic analysis showed a significant raising of the fundamental frequency of up to 10 Hz. There was also a lowering of the second formant in two of the five vowels. This was correlated with the volume of the excised tissue. The minimal changes will probably have no significance for those who place no special reliance on their voice, but the possible post-operative changes should be made clear to singers and those relying on their voice for professional reasons.


2020 ◽  
Author(s):  
Susanto Susanto ◽  
Wang Zhenhua ◽  
Wang Yingli ◽  
Deri Sis Nanda

In the provision of linguistic evidence as one of the foci in Forensic Linguistics, Forensic Speaker Verification (FSV) includes an analysis of speech recordings to verify the voice of a criminal. As an inquiry into the validity of the available FSV, we present the analysis on Indonesian FSV system. The system consists of pairing, tagging, acoustic features extraction, and statistical analysis. There is a claim that the system meets the demand for presenting legal evidence in Indonesian court. In the system, one of the acoustic features extracted from the speech data is fundamental frequency (F0). Then, the paper aims at reviewing the method in Indonesian FSV system in terms of fundamental frequency (F0) used as the discriminatory potential. The results show that F0 has not represented an adequate interpretation of the linguistic evidence in our experimental data. It leads us to suggest that more experimental studies are required to scrutinize F0 in the system.


2020 ◽  
Vol 63 (4) ◽  
pp. 1033-1043
Author(s):  
Filipa M. B. Lã ◽  
Nuria Polo

Purpose Concentrations of sex steroid hormones—estrogens, progesterone, and testosterone—have been associated with premenstrual and menstrual vocal symptoms. However, the extent to which these symptoms may be reflected on acoustical features of the voice is still debated. This study investigates variations in fundamental frequency ( f o ) and related parameters in connected speech across phases of the menstrual cycle and during the use of a combined oral contraceptive pill (OCP). Method Electrolaryngographic recordings were made, and blood samples were collected at three different phases of the menstrual cycle—menstrual, follicular, and luteal—for placebo and OCP use. These two conditions were blindly and randomly allocated in the study. Speaking f o (SFF), SFF standard deviation, SFF rate of change, SFF slope, maximum and minimum f o , and f o range were extracted for nine healthy females while reading a phrase from the Rainbow Passage. Concentrations of sex hormones were analyzed in serum. Nonparametric statistical tests were carried out to assess differences between phases and conditions. Results SFF, its standard deviation, and maximum f o were significantly different between phases of the menstrual cycle for placebo use only. Menstrual phase showed the lowest values. Maximum and minimum f o were significantly different between placebo and OCP use for menstrual and follicular phases, respectively. Conclusions Fluctuations in sex steroid hormones across the menstrual cycle alter f o in speech more than a particular hormonal concentration. OCP use seems to have a stabilizing effect on the voice relative to f o and related parameters in speech.


2016 ◽  
Vol 59 (3) ◽  
pp. 546-556 ◽  
Author(s):  
Hartmut Meister ◽  
Katrin Fürsen ◽  
Barbara Streicher ◽  
Ruth Lang-Roth ◽  
Martin Walger

PurposeThe focus of this study was to examine the influence of fundamental frequency (F0) and vocal tract length (VTL) modifications on speaker gender recognition in cochlear implant (CI) recipients for different stimulus types.MethodSingle words and sentences were manipulated using isolated or combined F0 and VTL cues. Using an 11-point rating scale, CI recipients and listeners with normal hearing rated the maleness/femaleness of the corresponding voice.ResultsSpeaker gender ratings for combined F0 and VTL modifications were similar across all stimulus types in both CI recipients and listeners with normal hearing, although the CI recipients showed a somewhat larger ambiguity. In contrast to listeners with normal hearing, F0-VTL and F0-only modifications revealed similar ratings in the CI recipients when using words as stimuli. However, when sentences were used, a difference was found between F0-VTL–based and F0-based ratings. Modifying VTL cues alone did not affect ratings in the CI group.ConclusionsWhereas speaker gender ratings by listeners with normal hearing relied on combined VTL and F0 cues, CI recipients made only limited use of VTL cues, which might be one reason behind problems with identifying the speaker on the basis of voice. However, use of the voice cues depended on stimulus type, with the greater information in sentences allowing a more detailed analysis than single words in both listener groups.


Sign in / Sign up

Export Citation Format

Share Document