Evidence for Nonlinear Sound Production Mechanisms in the Vocal Tract

Author(s):  
H. M. Teager ◽  
S. M. Teager
Biology ◽  
2021 ◽  
Vol 10 (8) ◽  
pp. 750
Author(s):  
Angela S. Stoeger ◽  
Anton Baotic ◽  
Gunnar Heilmann

How do elephants achieve their enormous vocal flexibility when communicating, imitating or creating idiosyncratic sounds? The mechanisms that underpin this trait combine motoric abilities with vocal learning processes. We demonstrate the unusual production techniques used by five African savanna elephants to create idiosyncratic sounds, which they learn to produce on cue by positive reinforcement training. The elephants generate these sounds by applying nasal tissue vibration via an ingressive airflow at the trunk tip, or by contracting defined superficial muscles at the trunk base. While the production mechanisms of the individuals performing the same sound categories are similar, they do vary in fine-tuning, revealing that each individual has its own specific sound-producing strategy. This plasticity reflects the creative and cognitive abilities associated with ‘vocal’ learning processes. The fact that these sounds were reinforced and cue-stimulated suggests that social feedback and positive reinforcement can facilitate vocal creativity and vocal learning behavior in elephants. Revealing the mechanism and the capacity for vocal learning and sound creativity is fundamental to understanding the eloquence within the elephants’ communication system. This also helps to understand the evolution of human language and of open-ended vocal systems, which build upon similar cognitive processes.


2005 ◽  
Vol 83 (7) ◽  
pp. 721-737
Author(s):  
H Teffahi ◽  
B Guerin ◽  
A Djeradi

Knowledge of vocal tract area functions is important for the understanding of phenomena occurring during speech production. We present here a new measurement method based on the external excitation of the vocal tract with a known pseudo-random sequence, where the area function is obtained by a linear prediction analysis applied to the cross-correlation between the sequence and the signal measured at the lips. The advantages of this method over methods based on sweep-tones or white noise excitation are (1) a much shorter measurement time (about 100 ms) and (2) the possibility of speech sound production during the measurement. This method has been checked against classical methods through systematic comparisons on a small corpus of vowels. Moreover, it has been verified that simultaneous speech sound production does not perturb significantly the measurements. This method should thus be a very helpful tool for the investigation of the acoustic properties of the vocal tract in various cases for vowels.


2004 ◽  
Author(s):  
Michael Barry

The design and testing of an experimental apparatus for in vitro study of phonatory aerodynamics (voice production) in humans is presented. The presentation includes not only the details of apparatus design, but flow visualization and Digital Particle Image Velocimetry (DPIV) measurements of the developing flow that occurs during the opening of the constriction from complete closure. The main features of the phonation process have long been understood. A proper combination of air flow from the lungs and of vocal fold tension initiates a vibration of the vocal folds, which in turn valves the airflow. The resulting periodic acceleration of the airstream through the glottis excites the acoustic modes of the vocal tract. It is further understood that the pressure gradient driving glottal flow is related to flow separation on the downstream side of the vocal folds. However, the details of this process and how it may contribute to effects such as aperiodicity of the voice and energy losses in voiced sound production are still not fully grasped. The experimental apparatus described in this paper is designed to address these issues. The apparatus itself consists of a scaled-up duct in which water flows through a constriction whose width is modulated by motion of the duct wall in a manner mimicking vocal fold vibration. Scaling the duct up 10 times and using water as the working fluid allows temporally and spatially resolved measurements of the dynamically similar flow velocity field using DPIV at video standard framing rates (15Hz). Dynamic similarity is ensured by matching the Reynolds number (based on glottal flow speed and glottis width) of 8000, and by varying the Strouhal number (based on vocal fold length, glottal flow speed, and a time scale characterizing the motion of the vocal folds) ranging from 0.01 to 0.1. The walls of the 28 cm × 28 cm test section and the vocal fold pieces are made of clear cast acrylic to allow optical access. The vocal fold pieces are 12.7 cm × 14 cm × 28 cm and are rectangular in shape, except for the surfaces which form the glottis, which are 6.35 cm radius half-circles. Dye injection slots are placed on the upstream side of both vocal field pieces to allow flow visualization. Prescribed motion of the vocal folds is provided by two linear stages. Linear bearings ensure smooth execution of the motion prescribed using a computer interface. Measurements described here use the Laser-Induced Fluorescence (LIF) flow visualization and DPIV techniques and are performed for two Strouhal numbers to assess the effect of opening time on the development of the glottal jet. These measurements are conducted on a plane oriented perpendicular to the glottis, at the duct midplane. LIF measurements use a 5W Argon ion laser to produce a light sheet, which illuminates the dye injected through a slot in each vocal fold piece. Two dye colors are used, one for each side. Quantitative information about the velocity and vorticity fields are obtained through DPIV measurements at the same location as the LIF measurements.


2020 ◽  
Vol 17 (1) ◽  
Author(s):  
Hannah Joy Kriesell ◽  
Céline Le Bohec ◽  
Alexander F. Cerwenka ◽  
Moritz Hertel ◽  
Jean-Patrice Robin ◽  
...  

2006 ◽  
Vol 14 (04) ◽  
pp. 555-566 ◽  
Author(s):  
TINA P. BENKO ◽  
MATJAŽ PERC

We analyze the sound recording of the Southeast Asian cicada Tosena depicta with methods of nonlinear time series analysis. First, we reconstruct the phase space from the sound recording and test it against determinism and stationarity. After positively establishing determinism and stationarity in the series, we calculate the maximal Lyapunov exponent. We find that the latter is positive, from which we conclude that the sound recording possesses clear markers of deterministic chaos. We discuss that methods of nonlinear time series analysis can yield instructive insights and foster the understanding of acoustic and vibrational communication among insects, as well as provide vital clues regarding the origin and functionality of their sound production mechanisms. Furthermore, such studies can serve as means to distinguish different insect genera or even species either from each other or under various environmental influences.


2015 ◽  
Vol 101 (2) ◽  
pp. 317-330 ◽  
Author(s):  
R. Auvray ◽  
A. Ernoult ◽  
S. Terrien ◽  
P. Y. Lagrée ◽  
B. Fabre ◽  
...  

eLife ◽  
2020 ◽  
Vol 9 ◽  
Author(s):  
Christopher Bergevin ◽  
Chandan Narayan ◽  
Joy Williams ◽  
Natasha Mhatre ◽  
Jennifer KE Steeves ◽  
...  

Khoomei is a unique singing style originating from the republic of Tuva in central Asia. Singers produce two pitches simultaneously: a booming low-frequency rumble alongside a hovering high-pitched whistle-like tone. The biomechanics of this biphonation are not well-understood. Here, we use sound analysis, dynamic magnetic resonance imaging, and vocal tract modeling to demonstrate how biphonation is achieved by modulating vocal tract morphology. Tuvan singers show remarkable control in shaping their vocal tract to narrowly focus the harmonics (or overtones) emanating from their vocal cords. The biphonic sound is a combination of the fundamental pitch and a focused filter state, which is at the higher pitch (1–2 kHz) and formed by merging two formants, thereby greatly enhancing sound-production in a very narrow frequency range. Most importantly, we demonstrate that this biphonation is a phenomenon arising from linear filtering rather than from a nonlinear source.


Sign in / Sign up

Export Citation Format

Share Document