scholarly journals Vocal emotion adaptation aftereffects within and across speaker genders: Roles of timbre and fundamental frequency

Cognition ◽  
2022 ◽  
Vol 219 ◽  
pp. 104967
Author(s):  
Christine Nussbaum ◽  
Celina I. von Eiff ◽  
Verena G. Skuk ◽  
Stefan R. Schweinberger
2021 ◽  
Author(s):  
Christine Nussbaum ◽  
Celina Isabelle von Eiff ◽  
Verena G. Skuk ◽  
Stefan R. Schweinberger

Although previous research demonstrated perceptual aftereffects in emotional voice adaptation, the contribution of different vocal cues to these effects is unclear. In two experiments, we used parameter-specific morphing of adaptor voices to investigate the relative roles of fundamental frequency (F0) and timbre in vocal emotion adaptation, using angry and fearful utterances. Participants adapted to voices containing emotion-specific information in either F0 or timbre, with all other parameters kept constant at an intermediate 50% morph level. Full emotional adaptors and ambiguous adaptors were used as reference conditions. Adaptors were either of the same (Experiment 1) or opposite speaker gender (Experiment 2) of target voices. In Experiment 1, we found consistent aftereffects in all adaptation conditions. Crucially, aftereffects following timbre adaptors were much larger than following F0 adaptors and were only marginally smaller than those following full adaptors. In Experiment 2, adaptation aftereffects appeared massively and proportionally reduced, with differences between morph types being no longer significant. These results suggest that timbre plays a larger role than F0 in vocal emotion adaptation, and that vocal emotion adaptation is compromised by eliminating gender-congruency between adaptors and targets. Our findings also add to mounting evidence suggesting a major role of timbre in auditory adaptation.


1979 ◽  
Vol 10 (4) ◽  
pp. 246-248 ◽  
Author(s):  
Peter B. Mueller ◽  
Marla Adams ◽  
Jean Baehr-Rouse ◽  
Debbie Boos

Mean fundamental frequencies of male and female subjects obtained with FLORIDA I and a tape striation counting procedure were compared. The fundamental frequencies obtained with these two methods were similar and it appears that the tape striation counting procedure is a viable, simple, and inexpensive alternative to more costly and complicated procedures and instrumentation.


1995 ◽  
Vol 4 (2) ◽  
pp. 62-69 ◽  
Author(s):  
Katherine Verdolini ◽  
Ingo R. Titze

In this paper, we discuss the application of mathematical formulas to guide the development of clinical interventions in voice disorders. Discussion of case examples includes fundamental frequency and intensity deviations, pitch and loudness abnormalities, laryngeal hyperand hypoadduction, and phonatory effort. The paper illustrates the interactive nature of theoretical and applied work in vocology


2020 ◽  
Vol 63 (4) ◽  
pp. 931-947
Author(s):  
Teresa L. D. Hardy ◽  
Carol A. Boliek ◽  
Daniel Aalto ◽  
Justin Lewicke ◽  
Kristopher Wells ◽  
...  

Purpose The purpose of this study was twofold: (a) to identify a set of communication-based predictors (including both acoustic and gestural variables) of masculinity–femininity ratings and (b) to explore differences in ratings between audio and audiovisual presentation modes for transgender and cisgender communicators. Method The voices and gestures of a group of cisgender men and women ( n = 10 of each) and transgender women ( n = 20) communicators were recorded while they recounted the story of a cartoon using acoustic and motion capture recording systems. A total of 17 acoustic and gestural variables were measured from these recordings. A group of observers ( n = 20) rated each communicator's masculinity–femininity based on 30- to 45-s samples of the cartoon description presented in three modes: audio, visual, and audio visual. Visual and audiovisual stimuli contained point light displays standardized for size. Ratings were made using a direct magnitude estimation scale without modulus. Communication-based predictors of masculinity–femininity ratings were identified using multiple regression, and analysis of variance was used to determine the effect of presentation mode on perceptual ratings. Results Fundamental frequency, average vowel formant, and sound pressure level were identified as significant predictors of masculinity–femininity ratings for these communicators. Communicators were rated significantly more feminine in the audio than the audiovisual mode and unreliably in the visual-only mode. Conclusions Both study purposes were met. Results support continued emphasis on fundamental frequency and vocal tract resonance in voice and communication modification training with transgender individuals and provide evidence for the potential benefit of modifying sound pressure level, especially when a masculine presentation is desired.


2020 ◽  
Vol 63 (11) ◽  
pp. 3855-3864
Author(s):  
Wanting Huang ◽  
Lena L. N. Wong ◽  
Fei Chen ◽  
Haihong Liu ◽  
Wei Liang

Purpose Fundamental frequency (F0) is the primary acoustic cue for lexical tone perception in tonal languages but is processed in a limited way in cochlear implant (CI) systems. The aim of this study was to evaluate the importance of F0 contours in sentence recognition in Mandarin-speaking children with CIs and find out whether it is similar to/different from that in age-matched normal-hearing (NH) peers. Method Age-appropriate sentences, with F0 contours manipulated to be either natural or flattened, were randomly presented to preschool children with CIs and their age-matched peers with NH under three test conditions: in quiet, in white noise, and with competing sentences at 0 dB signal-to-noise ratio. Results The neutralization of F0 contours resulted in a significant reduction in sentence recognition. While this was seen only in noise conditions among NH children, it was observed throughout all test conditions among children with CIs. Moreover, the F0 contour-induced accuracy reduction ratios (i.e., the reduction in sentence recognition resulting from the neutralization of F0 contours compared to the normal F0 condition) were significantly greater in children with CIs than in NH children in all test conditions. Conclusions F0 contours play a major role in sentence recognition in both quiet and noise among pediatric implantees, and the contribution of the F0 contour is even more salient than that in age-matched NH children. These results also suggest that there may be differences between children with CIs and NH children in how F0 contours are processed.


2020 ◽  
Vol 63 (8) ◽  
pp. 2597-2608
Author(s):  
Emily N. Snell ◽  
Laura W. Plexico ◽  
Aurora J. Weaver ◽  
Mary J. Sandage

Purpose The purpose of this preliminary study was to identify a vocal task that could be used as a clinical indicator of the vocal aptitude or vocal fitness required for vocally demanding occupations in a manner similar to that of the anaerobic power tests commonly used in exercise science. Performance outcomes for vocal tasks that require rapid acceleration and high force production may be useful as an indirect indicator of muscle fiber complement and bioenergetic fitness of the larynx, an organ that is difficult to study directly. Method Sixteen women (age range: 19–24 years, M age = 22 years) were consented for participation and completed the following performance measures: forced vital capacity, three adapted vocal function tasks, and the horizontal sprint test. Results Using a within-participant correlational analyses, results indicated a positive relationship between the rate of the last second of a laryngeal diadochokinesis task that was produced at a high fundamental frequency/high sound level and anaerobic power. Forced vital capacity was not correlated with any of the vocal function tasks. Conclusions These preliminary results indicate that aspects of the laryngeal diadochokinesis task produced at a high fundamental frequency and high sound level may be useful as an ecologically valid measure of vocal power ability. Quantification of vocal power ability may be useful as a vocal fitness assessment or as an outcome measure for voice rehabilitation and habilitation for patients with vocally demanding jobs.


2020 ◽  
Vol 63 (10) ◽  
pp. 3311-3325
Author(s):  
Brittany L. Perrine ◽  
Ronald C. Scherer

Purpose The goal of this study was to determine if differences in stress system activation lead to changes in speaking fundamental frequency, average oral airflow, and estimated subglottal pressure before and after an acute, psychosocial stressor. Method Eighteen vocally healthy adult females experienced the Trier Social Stress Test (TSST) to activate the hypothalamic–pituitary–adrenal axis. The TSST includes public speaking and performing mental arithmetic in front of an audience. At seven time points, three before the stressor and four after the stressor, the participants produced /pa/ repetitions, read the Rainbow Passage, and provided a saliva sample. Measures included (a) salivary cortisol level, (b) oral airflow, (c) estimated subglottal pressure, and (d) speaking fundamental frequency from the second sentence of the Rainbow Passage. Results Ten of the 18 participants experienced a hypothalamic–pituitary–adrenal axis response to stress as indicated by a 2.5-nmol/L increase in salivary cortisol from before the TSST to after the TSST. Those who experienced a response to stress had a significantly higher speaking fundamental frequency before and immediately after the stressor than later after the stressor. No other variable varied significantly due to the stressor. Conclusions This study suggests that the idiosyncratic and inconsistent voice changes reported in the literature may be explained by differences in stress system activation. In addition, laryngeal aerodynamic measures appear resilient to changes due to acute stress. Further work is needed to examine the influence of other stress systems and if these findings hold for dysphonic individuals.


2020 ◽  
Vol 63 (12) ◽  
pp. 4325-4326 ◽  
Author(s):  
Hartmut Meister ◽  
Katrin Fuersen ◽  
Barbara Streicher ◽  
Ruth Lang-Roth ◽  
Martin Walger

Purpose The purpose of this letter is to compare results by Skuk et al. (2020) with Meister et al. (2016) and to point to a potential general influence of stimulus type. Conclusion Our conclusion is that presenting sentences may give cochlear implant recipients the opportunity to use timbre cues for voice perception. This might not be the case when presenting brief and sparse stimuli such as consonant–vowel–consonant or single words, which were applied in the majority of studies.


1982 ◽  
Vol 27 (9) ◽  
pp. 690-692 ◽  
Author(s):  
Janet Pierrehumbert ◽  
Mark Liberman

Sign in / Sign up

Export Citation Format

Share Document