Pitch versus Brightness of Timbre: Detecting Combined Shifts in Fundamental and Formant Frequency

1993 ◽  
Vol 11 (1) ◽  
pp. 1-13 ◽  
Author(s):  
Laurent Demany ◽  
Catherine Semal

The pitch of a periodic tone depends on its fundamental frequency (F0), and the brightness of its timbre depends on the centroid of its power spectrum (Fc). The goal of the present study was to determine whether small shifts in F0 and in Fc are detected independently of each other. The standard tone used had an F0 of 400 Hz, five harmonics (400-2000 Hz), and a triangular spectral envelope peaking at an Fc of 1000 Hz. With a forced-choice adaptive procedure, detection thresholds were measured for (1) shifts in F0 alone (Fc being fixed), (2) shifts in Fc alone (F0 being fixed), and (3) combined shifts in F0 and Fc. The two components of the combined shifts were chosen to have the same level of detectability when presented alone. Overall, as expected from the independence model, the combined shifts were not better detected when their two components had the same direction (F0 and Fc both increase, or both decrease) than when they had opposite directions. However, substantial differences between subjects were observed with respect to the perceptual integration of shifts in F0 and in Fc.

ALQALAM ◽  
2015 ◽  
Vol 32 (2) ◽  
pp. 284
Author(s):  
Muhammad Subali ◽  
Miftah Andriansyah ◽  
Christanto Sinambela

This article aims to look at the similarities and differences in the fundamental frequency and formant frequencies using the autocorrelation function and LPCfunction in GUI MATLAB 2012b on sound hijaiyah letters for adult male speaker beginner and expert based on makhraj pronunciation and both of speaker will be analysis on matching distance of the sound use DTW method on cepstrum. Subject for speech beginner makhraj pronunciation are taken from college student of Universitas Gunadarma and SITC aged 22 years old Data of the speech beginner makhraj pronunciation is recorded using MATLAB algorithm on GUI Subject for speech expert makhraj pronunciation are taken from previous research. They are 20-30 years old from the time of taking data. The sound will be extracted to get the value of the fundamental frequency and formant frequency. After getting both frequencies, it will be obtained analysis of the similarities and differences in the fundamental frequency and formant frequencies of speech beginner and expert and it will shows matching distance of both speech. The result is all of speech beginner and expert based on makhraj pronunciation have different values of fundamental frequency and formant frequency. Then the results of the analysis matching distance using method DTW showed that obtained in the range of 28.9746 to 136.4 between speech beginner and expert based on makhraj pronunciation. Keywords: fundamental frequency, formant frequency, hijaiyah letters, makhraj


1992 ◽  
Vol 35 (4) ◽  
pp. 761-768 ◽  
Author(s):  
Petra Zwirner ◽  
Gary J. Barnes

Acoustic analyses of upper airway and phonatory stability were conducted on samples of sustained phonation to evaluate the relation between laryngeal and articulomotor stability for 31 patients with dysarthria and 12 non-dysarthric control subjects. Significantly higher values were found for the variability in fundamental frequency and formant frequency of patients who have Huntington’s disease compared with normal subjects and patients with Parkinson’s disease. No significant correlations were found between formant frequency variability and the variability of the fundamental frequency for any subject group. These findings are discussed as they pertain to the relationship between phonatory and upper airway subsystems and the evaluation of vocal tract motor control impairments in dysarthria.


2018 ◽  
Vol 61 (9) ◽  
pp. 2376-2385 ◽  
Author(s):  
Erol J. Ozmeral ◽  
Ann C. Eddins ◽  
David A. Eddins

Purpose The goal was to evaluate the potential effects of increasing hearing loss and advancing age on spectral envelope perception. Method Spectral modulation detection was measured as a function of spectral modulation frequency from 0.5 to 8.0 cycles/octave. The spectral modulation task involved discrimination of a noise carrier (3 octaves wide from 400 to 3200 Hz) with a flat spectral envelope from a noise having a sinusoidal spectral envelope across a logarithmic audio frequency scale. Spectral modulation transfer functions (SMTFs; modulation threshold vs. modulation frequency) were computed and compared 4 listener groups: young normal hearing, older normal hearing, older with mild hearing loss, and older with moderate hearing loss. Estimates of the internal spectral contrast were obtained by computing excitation patterns. Results SMTFs for young listeners with normal hearing were bandpass with a minimum modulation detection threshold at 2 cycles/octave, and older listeners with normal hearing were remarkably similar to those of the young listeners. SMTFs for older listeners with mild and moderate hearing loss had a low-pass rather than a bandpass shape. Excitation patterns revealed that limited spectral resolution dictated modulation detection thresholds at high but not low spectral modulation frequencies. Even when factoring out (presumed) differences in frequency resolution among groups, the spectral envelope perception was worse for the group with moderate hearing loss than the other 3 groups. Conclusions The spectral envelope perception as measured by spectral modulation detection thresholds is compromised by hearing loss at higher spectral modulation frequencies, consistent with predictions of reduced spectral resolution known to accompany sensorineural hearing loss. Spectral envelope perception is not negatively impacted by advancing age at any spectral modulation frequency between 0.5 and 8.0 cycles/octave.


Author(s):  
Johan Sundberg

The function of the voice organ is basically the same in classical singing as in speech. However, loud orchestral accompaniment has necessitated the use of the voice in an economical way. As a consequence, the vowel sounds tend to deviate considerably from those in speech. Male voices cluster formant three, four, and five, so that a marked peak is produced in spectrum envelope near 3,000 Hz. This helps them to get heard through a loud orchestral accompaniment. They seem to achieve this effect by widening the lower pharynx, which makes the vowels more centralized than in speech. Singers often sing at fundamental frequencies higher than the normal first formant frequency of the vowel in the lyrics. In such cases they raise the first formant frequency so that it gets somewhat higher than the fundamental frequency. This is achieved by reducing the degree of vocal tract constriction or by widening the lip and jaw openings, constricting the vocal tract in the pharyngeal end and widening it in the mouth. These deviations from speech cause difficulties in vowel identification, particularly at high fundamental frequencies. Actually, vowel identification is almost impossible above 700 Hz (pitch F5). Another great difference between vocal sound produced in speech and the classical singing tradition concerns female voices, which need to reduce the timbral differences between voice registers. Females normally speak in modal or chest register, and the transition to falsetto tends to happen somewhere above 350 Hz. The great timbral differences between these registers are avoided by establishing control over the register function, that is, over the vocal fold vibration characteristics, so that seamless transitions are achieved. In many other respects, there are more or less close similarities between speech and singing. Thus, marking phrase structure, emphasizing important events, and emotional coloring are common principles, which may make vocal artists deviate considerably from the score’s nominal description of fundamental frequency and syllable duration.


2019 ◽  
Vol 23 ◽  
pp. 233121651985130
Author(s):  
Heather A. Kreft ◽  
Lindsay A. DeVries ◽  
Julie G. Arenberg ◽  
Andrew J. Oxenham

A rapid forward-masked spatial tuning curve measurement procedure, based on Bekesy tracking, was adapted and evaluated for use with cochlear implants. Twelve postlingually-deafened adult cochlear-implant users participated. Spatial tuning curves using the new procedure and using a traditional forced-choice adaptive procedure resulted in similar estimates of parameters. The Bekesy-tracking method was almost 3 times faster than the forced-choice procedure, but its test–retest reliability was significantly poorer. Although too time-consuming for general clinical use, the new method may have some benefits in individual cases, where identifying electrodes with poor spatial selectivity as candidates for deactivation is deemed necessary.


2018 ◽  
Vol 8 (5) ◽  
pp. 27
Author(s):  
Abdul Abbasi ◽  
Mansoor Channa ◽  
Masood Memon ◽  
Stephen John ◽  
Irtaza Ahmed ◽  
...  

The purpose of this investigation was to document acoustic characteristics of Pakistani English (PaKE) vowel sounds. The experiment was designed to examine the properties of ten vowels produced by Pakistani ESL learners. The analysis is based on the voice samples of recorded 50 CVC words. Total 5000 (10  10  50=5000) voiced samples were analyzed. The data consisted of 50 words of ten English vowel sounds [i: ɪ e ɔ: æ ə ɑ: u: ɒ ʊ]. Ten ESL speakers recorded their voice samples on Praat speech processing tool installed on laptop. Three parameters were considered i.e., fundamental frequency (F0), vowel quality (F1-F2) and duration. Formant patterns were judged manually by visual inspection on Praat Speech Processing Tool. Analysis of formant frequency shows numerous differences between male and female of F1 and F2, fundamental frequency and duration of English vowels. The voice samples provide evidence for higher and lower frequency of vowel sounds. Additionally, the data analysis illustrates that there were statistical differences in the values of short and long vowels coupled with vowel space plot showing explicit differences in locating the production of vowels of male & female vowel space acoustic realizations.


Sign in / Sign up

Export Citation Format

Share Document