vocal pitch
Recently Published Documents


TOTAL DOCUMENTS

185
(FIVE YEARS 41)

H-INDEX

28
(FIVE YEARS 3)

Author(s):  
Isao Tokuda

In the source-filter theory, the mechanism of speech production is described as a two-stage process: (a) The air flow coming from the lungs induces tissue vibrations of the vocal folds (i.e., two small muscular folds located in the larynx) and generates the “source” sound. Turbulent airflows are also created at the glottis or at the vocal tract to generate noisy sound sources. (b) Spectral structures of these source sounds are shaped by the vocal tract “filter.” Through the filtering process, frequency components corresponding to the vocal tract resonances are amplified, while the other frequency components are diminished. The source sound mainly characterizes the vocal pitch (i.e., fundamental frequency), while the filter forms the timbre. The source-filter theory provides a very accurate description of normal speech production and has been applied successfully to speech analysis, synthesis, and processing. Separate control of the source (phonation) and the filter (articulation) is advantageous for acoustic communications, especially for human language, which requires expression of various phonemes realized by a flexible maneuver of the vocal tract configuration. Based on this idea, the articulatory phonetics focuses on the positions of the vocal organs to describe the produced speech sounds. The source-filter theory elucidates the mechanism of “resonance tuning,” that is, a specialized way of singing. To increase efficiency of the vocalization, soprano singers adjust the vocal tract filter to tune one of the resonances to the vocal pitch. Consequently, the main source sound is strongly amplified to produce a loud voice, which is well perceived in a large concert hall over the orchestra. It should be noted that the source–filter theory is based upon the assumption that the source and the filter are independent from each other. Under certain conditions, the source and the filter interact with each other. The source sound is influenced by the vocal tract geometry and by the acoustic feedback from the vocal tract. Such source–filter interaction induces various voice instabilities, for example, sudden pitch jump, subharmonics, resonance, quenching, and chaos.


2021 ◽  
Author(s):  
Jonathan Henry Venezia ◽  
Christian Herrera ◽  
Nicole Whittle ◽  
Marjorie R. Leek ◽  
Samuel Barnes ◽  
...  

In a recent study (Venezia et al., 2021), left dorsal premotor cortex (dPM) responded to vocal pitch during a degraded speech recognition task, but only when speech was rated as unintelligible. Crucially, vocal pitch was not relevant to the task. The present fMRI study (N = 25) tests the hypothesis that left dPM will respond to vocal pitch for increasingly intelligible speech in a multi-talker speech recognition task that emphasizes pitch for talker segregation. We applied spectrotemporal modulation distortion to independently modulate vocal pitch and phonetic content in two-talker (male/female) utterances across two conditions (Competing, Unison), only one of which required pitch-based segregation (Competing). A Bayesian hierarchical drift-diffusion model (HDDM) was used to predict speech recognition performance (3-AFC response times, accuracy coded) from the pattern of spectrotemporal distortion imposed on each trial. The model’s drift rate parameter, a d’-like measure of speech recognition performance, was strongly associated with vocal pitch for Competing but not Unison. In a second, Bayesian hierarchical brain-behavior model, we then regressed the HDDM’s posterior predictions of trial-wise drift rate against trial-wise fMRI activation amplitude. A significant positive association with overall drift rate, reflecting contributions from vocal pitch and/or phonetic content, was observed in left dPM in both conditions. A significant positive association with ‘pitch-restricted’ drift rate, reflecting only contributions from vocal pitch, was observed in left dPM but only in the Competing condition. These findings suggest that left dPM: (i) responds to vocal pitch; and (ii) can operate in an auditory-pitch mode and a phonetic-speech mode.


Author(s):  
CONSTANTINE BOUSSALIS ◽  
TRAVIS G. COAN ◽  
MIRYA R. HOLMAN ◽  
STEFAN MÜLLER

Voters evaluate politicians not just by what they say, but also how they say it, via facial displays of emotions and vocal pitch. Candidate characteristics can shape how leaders use—and how voters react to—nonverbal cues. Drawing on role congruity expectations, we study how the use of and reactions to facial, vocal, and textual communication in political debates varies by candidate gender. Relying on full-length videos of four German federal election debates (2005–2017) and a minor party debate, we use video, audio, and text data to measure candidate facial displays of emotion, vocal pitch, and speech sentiment. Consistent with our expectations, Angela Merkel expresses less anger than her male opponents, but she is just as emotive in other respects. Combining these measures of emotional expression with continuous responses recorded by live audiences, we find that voters punish Merkel for anger displays and reward her happiness and general emotional displays.


Author(s):  
Joshua J. Guyer ◽  
Pablo Briñol ◽  
Thomas I. Vaughan-Johnston ◽  
Leandre R. Fabrigar ◽  
Lorena Moreno ◽  
...  

AbstractThis article unpacks the basic mechanisms by which paralinguistic features communicated through the voice can affect evaluative judgments and persuasion. Special emphasis is placed on exploring the rapidly emerging literature on vocal features linked to appraisals of confidence (e.g., vocal pitch, intonation, speech rate, loudness, etc.), and their subsequent impact on information processing and meta-cognitive processes of attitude change. The main goal of this review is to advance understanding of the different psychological processes by which paralinguistic markers of confidence can affect attitude change, specifying the conditions under which they are more likely to operate. In sum, we highlight the importance of considering basic mechanisms of attitude change to predict when and why appraisals of paralinguistic markers of confidence can lead to more or less persuasion.


2021 ◽  
Vol 66 (1) ◽  
pp. 1-53
Author(s):  
Anna Pavlova

Abstract Vocal pitch (also known as F0), pause and positioning of the sentence accent are closely related to the meaning of a statement. Although other elements of prosody can also be related to the meaning of the statement (e. g. timbre, speaking rate), in this paper only the three mentioned elements of prosody are considered in connection with the semantic interpretation. These three parameters are analyzed here individually or in connection with the meaning of potentially ambiguous statements in writing. In this way, the role of prosody in resolving ambiguities in written formulations is demonstrated, ambiguities that may lead to incorrect translations. The approach used for this research is „Analysis-by-introspection.” For better illustration of the differences in the meaning depending on the prosody, the translation method is used. Some translations come from the author of this paper; other translations are taken from some published books. For one or the other prosodic contour, semantic explanations are formulated. It is shown that prosodic topics play a major role in courses for foreign language learning, translation lessons and post-editing.


2021 ◽  
Author(s):  
Jonathan Henry Venezia ◽  
Virginia Richards ◽  
Gregory Hickok

We recently developed a method to estimate speech-driven spectrotemporal receptive fields (STRFs) using fMRI. The method uses spectrotemporal modulation filtering, a form of acoustic distortion that renders speech sometimes intelligible and sometimes unintelligible. Using this method, we found significant STRF tuning only in classic auditory regions throughout the superior temporal lobes. However, our analysis was not optimized to detect small clusters of tuned STRFs as might be expected in non-auditory regions. Here, we re-analyze our data using a more sensitive multivariate procedure, and we identify STRF tuning in non-auditory regions including the left dorsal premotor cortex (left dPM), left inferior frontal gyrus (LIFG), and bilateral calcarine sulcus (calcS). All three regions responded more to intelligible than unintelligible speech, but left dPM and calcS responded significantly to vocal pitch and demonstrated strong functional connectivity with early auditory regions. However, only left dPM’s STRF predicted activation on trials rated as unintelligible by listeners, a hallmark auditory profile. LIFG, on the other hand, responded almost exclusively to intelligible speech and was functionally connected with classic speech-language regions in the superior temporal sulcus and middle temporal gyrus. LIFG’s STRF was also (weakly) able to predict activation on unintelligible trials, suggesting the presence of a partial ‘acoustic trace’ in the region. We conclude that left dPM is part of the human dorsal laryngeal motor cortex, a region previously shown to be capable of operating in an ‘auditory mode’ to encode vocal pitch. Further, given previous observations that LIFG is involved in syntactic working memory and/or processing of linear order, we conclude that LIFG is part of a higher-order speech circuit that exerts a top-down influence on processing of speech acoustics. Finally, because calcS is modulated by emotion, we speculate that changes in the quality of vocal pitch may have contributed to its response.


Author(s):  
Takahiro Yokozuka ◽  
Hitoshi Miyamoto ◽  
Masatoshi Kasai ◽  
Yoshihiro Miyake ◽  
Takayuki Nozawa

2021 ◽  
Author(s):  
Tatiana A Stroganova ◽  
Kirill S Komarov ◽  
Olga V Sysoeva ◽  
Dzerasa E Goiaeva ◽  
Tatiana S Obukhova ◽  
...  

Abstract Background: Deficits in perception and production of vocal pitch are often observed in people with autism spectrum disorder (ASD), but the neural basis of these deficits is unknown. In magnetoencephalogram (MEG), spectrally complex periodic sounds trigger two continuous neural responses – the auditory steady state response (ASSR) and the sustained field (SF). It has been shown that the SF in neurotypical individuals is associated with low-level analysis of pitch in the ‘pitch processing center’ of the Heschl’s gyrus. Therefore, alternations in this auditory response may reflect atypical processing of vocal pitch. The SF, however, has never been studied in people with ASD. Methods: We used MEG and individual brain models to investigate the ASSR and SF evoked by monaural 40 Hz click trains in boys with ASD (N=35) and neurotypical (NT) boys (N=35) aged 7-12-years.Results: In agreement with the previous research in adults, the cortical sources of the SF in children were located in the left and right Heschl’s gyri, anterolateral to those of the ASSR. In both groups, the SF and ASSR dominated in the right hemisphere and were higher in the hemisphere contralateral to the stimulated ear. The ASSR increased with age in both NT and ASD children and did not differ between the groups. The SF amplitude did not significantly change between the ages of 7 and 12 years. It was moderately attenuated in both hemispheres and was markedly delayed and displaced in the left hemisphere in boys with ASD. The SF delay in participants with ASD was present irrespective of their intelligence level and severity of autism symptoms. Limitations: We did not test the language abilities of our participants. Therefore, the link between SF and processing of vocal pitch in children with ASD remains speculative. Conclusion: Children with ASD demonstrate atypical processing of spectrally complex periodic sound at the level of the core auditory cortex of the left-hemisphere. The observed neural deficit may contribute to speech perception difficulties experienced by children with ASD, including their poor perception and production of linguistic prosody.


2020 ◽  
Author(s):  
Tatiana A Stroganova ◽  
Kirill S Komarov ◽  
Olga V Sysoeva ◽  
Dzerasa E Goiaeva ◽  
Tatiana S Obukhova ◽  
...  

Abstract Background: Deficits in perception and production of vocal pitch are often observed in people with autism spectrum disorder (ASD), but the neural basis of these abnormalities is unknown. In magnetoencephalogram (MEG), spectrally complex periodic sounds trigger two continuous neural responses – the auditory steady state response (ASSR) and the sustained field (SF). It has been shown that the SF in neurotypical individuals is associated with low-level analysis of pitch in the ‘pitch processing center’ of the Heschl’s gyrus. Therefore, this auditory response may reflect vocal pitch processing abnormalities in ASD. The SF, however, has never been studied in people with this disorder. Methods: We used MEG and individual brain models to investigate the ASSR and SF evoked by monaural 40 Hz click trains in boys with ASD (N=35) and neurotypical (NT) boys (N=35) aged 7-12-years.Results: In agreement with the previous research in adults, the cortical sources of the SF in children were located in the left and right Heschl’s gyri, anterolateral to those of the ASSR. In both groups, the SF and ASSR dominated in the right hemisphere and were higher in the hemisphere contralateral to the stimulated ear. The ASSR increased with age in both NT and ASD children and did not differ between the groups. The SF amplitude did not significantly change between the ages of 7 and 12 years. It was moderately attenuated in both hemispheres and was markedly delayed and displaced in the left hemisphere in boys with ASD. The SF delay in participants with ASD was present irrespective of their intelligence level and severity of autism symptoms.Limitations: We did not test the language abilities of our participants. Therefore, the link between SF and processing of vocal pitch in children with ASD remains speculative.Conclusion: Children with ASD demonstrate selective left-hemispheric abnormalities at the level of the core auditory cortex when processing spectrally complex periodic sound. The observed neural deficit may contribute to speech perception difficulties experienced by children with ASD, including their poor perception and production of linguistic prosody.


Sign in / Sign up

Export Citation Format

Share Document