voice modulation Latest Research Papers

Vocal size exaggeration may have contributed to the origins of vocalic complexity

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0401 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Katarzyna Pisanski ◽

Andrey Anikin ◽

David Reby

Keyword(s):

Body Size ◽

Social Impact ◽

Vocal Communication ◽

Vocal Tract ◽

Similar Extent ◽

Theme Issue ◽

Articulatory Movements ◽

Animal Vocalizations ◽

Second Formant ◽

Voice Modulation

Vocal tract elongation, which uniformly lowers vocal tract resonances (formant frequencies) in animal vocalizations, has evolved independently in several vertebrate groups as a means for vocalizers to exaggerate their apparent body size. Here, we propose that smaller speech-like articulatory movements that alter only individual formants can serve a similar yet less energetically costly size-exaggerating function. To test this, we examine whether uneven formant spacing alters the perceived body size of vocalizers in synthesized human vowels and animal calls. Among six synthetic vowel patterns, those characterized by the lowest first and second formant (the vowel /u/ as in ‘boot’) are consistently perceived as produced by the largest vocalizer. Crucially, lowering only one or two formants in animal-like calls also conveys the impression of a larger body size, and lowering the second and third formants simultaneously exaggerates perceived size to a similar extent as rescaling all formants. As the articulatory movements required for individual formant shifts are minor compared to full vocal tract extension, they represent a rapid and energetically efficient mechanism for acoustic size exaggeration. We suggest that, by favouring the evolution of uneven formant patterns in vocal communication, this deceptive strategy may have contributed to the origins of the phonemic diversification required for articulated speech. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

The neural control of volitional vocal production—from speech to identity, from social meaning to song

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0395 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Sophie K. Scott

Keyword(s):

Speech Production ◽

Neural Control ◽

Social Impact ◽

Vocal Production ◽

Flexible System ◽

Human Voice ◽

Social Traits ◽

Neural Underpinnings ◽

The Voice ◽

Voice Modulation

The networks of cortical and subcortical fields that contribute to speech production have benefitted from many years of detailed study, and have been used as a framework for human volitional vocal production more generally. In this article, I will argue that we need to consider speech production as an expression of the human voice in a more general sense. I will also argue that the neural control of the voice can and should be considered to be a flexible system, into which more right hemispheric networks are differentially recruited, based on the factors that are modulating vocal production. I will explore how this flexible network is recruited to express aspects of non-verbal information in the voice, such as identity and social traits. Finally, I will argue that we need to widen out the kinds of vocal behaviours that we explore, if we want to understand the neural underpinnings of the true range of sound-making capabilities of the human voice. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

Vocal communication across cultures: theoretical and methodological issues

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0387 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Gregory A. Bryant

Keyword(s):

Social Impact ◽

Vocal Communication ◽

Cross Cultural ◽

Small Scale ◽

Human Communication ◽

Methodological Issues ◽

Cultural Variations ◽

Speech Research ◽

Voice Modulation ◽

Behavioural Sciences

The study of human vocal communication has been conducted primarily in Western, educated, industrialized, rich, democratic (WEIRD) societies. Recently, cross-cultural investigations in several domains of voice research have been expanding into more diverse populations. Theoretically, it is important to understand how universals and cultural variations interact in vocal production and perception, but cross-cultural voice research presents many methodological challenges. Experimental methods typically used in WEIRD societies are often not possible to implement in many populations such as rural, small-scale societies. Moreover, theoretical and methodological issues are often unnecessarily intertwined. Here, I focus on three areas of cross-cultural voice modulation research: (i) vocal signalling of formidability and dominance, (ii) vocal emotions, and (iii) production and perception of infant-directed speech. Research in these specific areas illustrates challenges that apply more generally across the human behavioural sciences but also reveals promise as we develop our understanding of the evolution of human communication. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

Perception of group membership from spontaneous and volitional laughter

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0404 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Roza G. Kamiloğlu ◽

Akihiro Tanaka ◽

Sophie K. Scott ◽

Disa A. Sauter

Keyword(s):

Social Impact ◽

Group Identity ◽

Group Membership ◽

Theme Issue ◽

Bayesian Analyses ◽

Two Cultures ◽

Social Signal ◽

The Two Cultures ◽

Voice Modulation ◽

Production Mechanisms

Laughter is a ubiquitous social signal. Recent work has highlighted distinctions between spontaneous and volitional laughter, which differ in terms of both production mechanisms and perceptual features. Here, we test listeners' ability to infer group identity from volitional and spontaneous laughter, as well as the perceived positivity of these laughs across cultures. Dutch ( n = 273) and Japanese ( n = 131) participants listened to decontextualized laughter clips and judged (i) whether the laughing person was from their cultural in-group or an out-group; and (ii) whether they thought the laughter was produced spontaneously or volitionally. They also rated the positivity of each laughter clip. Using frequentist and Bayesian analyses, we show that listeners were able to infer group membership from both spontaneous and volitional laughter, and that performance was equivalent for both types of laughter. Spontaneous laughter was rated as more positive than volitional laughter across the two cultures, and in-group laughs were perceived as more positive than out-group laughs by Dutch but not Japanese listeners. Our results demonstrate that both spontaneous and volitional laughter can be used by listeners to infer laughers’ cultural group identity. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

The shallow of your smile: the ethics of expressive vocal deep-fakes

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2021.0083 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Nadia Guerouaou ◽

Guillaume Vaiva ◽

Jean-Julien Aucouturier

Keyword(s):

Science Fiction ◽

Autonomous Vehicles ◽

Social Impact ◽

Public Awareness ◽

Social Dilemma ◽

Point Of View ◽

Technological Advances ◽

Potential Applications ◽

Experimental Ethics ◽

Voice Modulation

Rapid technological advances in artificial intelligence are creating opportunities for real-time algorithmic modulations of a person’s facial and vocal expressions, or ‘deep-fakes’. These developments raise unprecedented societal and ethical questions which, despite much recent public awareness, are still poorly understood from the point of view of moral psychology. We report here on an experimental ethics study conducted on a sample of N = 303 participants (predominantly young, western and educated), who evaluated the acceptability of vignettes describing potential applications of expressive voice transformation technology. We found that vocal deep-fakes were generally well accepted in the population, notably in a therapeutic context and for emotions judged otherwise difficult to control, and surprisingly, even if the user lies to their interlocutors about using them. Unlike other emerging technologies like autonomous vehicles, there was no evidence of social dilemma in which one would, for example, accept for others what they resent for themselves. The only real obstacle to the massive deployment of vocal deep-fakes appears to be situations where they are applied to a speaker without their knowing, but even the acceptability of such situations was modulated by individual differences in moral values and attitude towards science fiction. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

The bouba/kiki effect is robust across cultures and writing systems

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0390 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Aleksandra Ćwiek ◽

Susanne Fuchs ◽

Christoph Draxler ◽

Eva Liina Asu ◽

Dan Dediu ◽

...

Keyword(s):

Social Impact ◽

Spoken Language ◽

Writing Systems ◽

Theme Issue ◽

Round Shape ◽

Crossmodal Correspondence ◽

Visual Shape ◽

Visual Properties ◽

The Voice ◽

Voice Modulation

The bouba/kiki effect—the association of the nonce word bouba with a round shape and kiki with a spiky shape—is a type of correspondence between speech sounds and visual properties with potentially deep implications for the evolution of spoken language. However, there is debate over the robustness of the effect across cultures and the influence of orthography. We report an online experiment that tested the bouba/kiki effect across speakers of 25 languages representing nine language families and 10 writing systems. Overall, we found strong evidence for the effect across languages, with bouba eliciting more congruent responses than kiki . Participants who spoke languages with Roman scripts were only marginally more likely to show the effect, and analysis of the orthographic shape of the words in different scripts showed that the effect was no stronger for scripts that use rounder forms for bouba and spikier forms for kiki . These results confirm that the bouba/kiki phenomenon is rooted in crossmodal correspondence between aspects of the voice and visual shape, largely independent of orthography. They provide the strongest demonstration to date that the bouba/kiki effect is robust across cultures and writing systems. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

Musicality in human vocal communication: an evolutionary perspective

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0391 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Juan David Leongómez ◽

Jan Havlíček ◽

S. Craig Roberts

Keyword(s):

Social Interaction ◽

Social Environment ◽

Social Impact ◽

Vocal Communication ◽

Social Bonds ◽

Mental States ◽

Evolutionary Perspective ◽

Theme Issue ◽

Human Infants ◽

Voice Modulation

Studies show that specific vocal modulations, akin to those of infant-directed speech (IDS) and perhaps music, play a role in communicating intentions and mental states during human social interaction. Based on this, we propose a model for the evolution of musicality—the capacity to process musical information—in relation to human vocal communication. We suggest that a complex social environment, with strong social bonds, promoted the appearance of musicality-related abilities. These social bonds were not limited to those between offspring and mothers or other carers, although these may have been especially influential in view of altriciality of human infants. The model can be further tested in other species by comparing levels of sociality and complexity of vocal communication. By integrating several theories, our model presents a radically different view of musicality, not limited to specifically musical scenarios, but one in which this capacity originally evolved to aid parent–infant communication and bonding, and even today plays a role not only in music but also in IDS, as well as in some adult-directed speech contexts. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

A cross-species framework to identify vocal learning abilities in mammals

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0394 ◽

2021 ◽

Vol 377 (1841) ◽

Author(s):

Andrea Ravignani ◽

Maxime Garcia

Keyword(s):

Sexual Selection ◽

Social Impact ◽

Regression Line ◽

Vocal Learning ◽

Intermediate Step ◽

Learning Abilities ◽

Correlational Approach ◽

Potential Link ◽

Multiple Species ◽

Voice Modulation

Vocal production learning (VPL) is the experience-driven ability to produce novel vocal signals through imitation or modification of existing vocalizations. A parallel strand of research investigates acoustic allometry, namely how information about body size is conveyed by acoustic signals. Recently, we proposed that deviation from acoustic allometry principles as a result of sexual selection may have been an intermediate step towards the evolution of vocal learning abilities in mammals. Adopting a more hypothesis-neutral stance, here we perform phylogenetic regressions and other analyses further testing a potential link between VPL and being an allometric outlier. We find that multiple species belonging to VPL clades deviate from allometric scaling but in the opposite direction to that expected from size exaggeration mechanisms. In other words, our correlational approach finds an association between VPL and being an allometric outlier. However, the direction of this association, contra our original hypothesis, may indicate that VPL did not necessarily emerge via sexual selection for size exaggeration: VPL clades show higher vocalization frequencies than expected. In addition, our approach allows us to identify species with potential for VPL abilities: we hypothesize that those outliers from acoustic allometry lying above the regression line may be VPL species. Our results may help better understand the cross-species diversity, variability and aetiology of VPL, which among other things is a key underpinning of speech in our species. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part II)’.

Get full-text (via PubEx)

Vocal modulation in human mating and competition

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2020.0388 ◽

2021 ◽

Vol 376 (1840) ◽

Cited By ~ 1

Author(s):

Susan M. Hughes ◽

David A. Puts

Keyword(s):

Social Interactions ◽

Social Impact ◽

Social Contexts ◽

Future Studies ◽

Human Voice ◽

Vocal Behaviour ◽

Courtship Success ◽

Human Mating ◽

Voice Modulation ◽

Romantic Interest

The human voice is dynamic, and people modulate their voices across different social interactions. This article presents a review of the literature examining natural vocal modulation in social contexts relevant to human mating and intrasexual competition. Altering acoustic parameters during speech, particularly pitch, in response to mating and competitive contexts can influence social perception and indicate certain qualities of the speaker. For instance, a lowered voice pitch is often used to exert dominance, display status and compete with rivals. Changes in voice can also serve as a salient medium for signalling a person's attraction to another, and there is evidence to support the notion that attraction and/or romantic interest can be distinguished through vocal tones alone. Individuals can purposely change their vocal behaviour in attempt to sound more attractive and to facilitate courtship success. Several findings also point to the effectiveness of vocal change as a mechanism for communicating relationship status. As future studies continue to explore vocal modulation in the arena of human mating, we will gain a better understanding of how and why vocal modulation varies across social contexts and its impact on receiver psychology. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part I)’.

Get full-text (via PubEx)

Across demographics and recent history, most parents sing to their infants and toddlers daily

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2021.0089 ◽

2021 ◽

Vol 376 (1840) ◽

Cited By ~ 1

Author(s):

Ran Yan ◽

Ghazal Jessani ◽

Elizabeth S. Spelke ◽

Peter de Villiers ◽

Jill de Villiers ◽

...

Keyword(s):

Social Impact ◽

Daily Basis ◽

Human Society ◽

Theme Issue ◽

Technological Environment ◽

Older Parents ◽

The Past ◽

Recorded Music ◽

The Everyday ◽

Voice Modulation

Music is universally prevalent in human society and is a salient component of the lives of young families. Here, we studied the frequency of singing and playing recorded music in the home using surveys of parents with infants ( N = 945). We found that most parents sing to their infant on a daily basis and the frequency of infant-directed singing is unrelated to parents’ income or ethnicity. Two reliable individual differences emerged, however: (i) fathers sing less than mothers and (ii) as infants grow older, parents sing less. Moreover, the latter effect of child age was specific to singing and was not reflected in reports of the frequency of playing recorded music. Last, we meta-analysed reports of the frequency of infant-directed singing and found little change in its frequency over the past 30 years, despite substantial changes in the technological environment in the home. These findings, consistent with theories of the psychological functions of music, in general, and infant-directed singing, in particular, demonstrate the everyday nature of music in infancy. This article is part of the theme issue ‘Voice modulation: from origin and mechanism to social impact (Part I)’.

Get full-text (via PubEx)

voice modulation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Vocal size exaggeration may have contributed to the origins of vocalic complexity

The neural control of volitional vocal production—from speech to identity, from social meaning to song

Vocal communication across cultures: theoretical and methodological issues

Perception of group membership from spontaneous and volitional laughter

The shallow of your smile: the ethics of expressive vocal deep-fakes

The bouba/kiki effect is robust across cultures and writing systems

Musicality in human vocal communication: an evolutionary perspective

A cross-species framework to identify vocal learning abilities in mammals

Vocal modulation in human mating and competition

Across demographics and recent history, most parents sing to their infants and toddlers daily

Export Citation Format

voice modulationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Vocal size exaggeration may have contributed to the origins of vocalic complexity

The neural control of volitional vocal production—from speech to identity, from social meaning to song

Vocal communication across cultures: theoretical and methodological issues

Perception of group membership from spontaneous and volitional laughter

The shallow of your smile: the ethics of expressive vocal deep-fakes

The bouba/kiki effect is robust across cultures and writing systems

Musicality in human vocal communication: an evolutionary perspective

A cross-species framework to identify vocal learning abilities in mammals

Vocal modulation in human mating and competition

Across demographics and recent history, most parents sing to their infants and toddlers daily

voice modulation
Recently Published Documents