The inverse problem for the vocal tract: Numerical methods, acoustical experiments, and speech synthesis

1983 ◽  
Vol 73 (3) ◽  
pp. 985-1002 ◽  
Author(s):  
Man Mohan Sondhi ◽  
J. R. Resnick
2003 ◽  
Vol 184 (1) ◽  
pp. 122-148 ◽  
Author(s):  
Yuriy A. Gryazin ◽  
Michael V. Klibanov ◽  
Thomas R. Lucas

Information ◽  
2020 ◽  
Vol 11 (10) ◽  
pp. 491
Author(s):  
Samuel Silva ◽  
Nuno Almeida ◽  
Conceição Cunha ◽  
Arun Joseph ◽  
Jens Frahm ◽  
...  

Technologies, such as real-time magnetic resonance (RT-MRI), can provide valuable information to evolve our understanding of the static and dynamic aspects of speech by contributing to the determination of which articulators are essential (critical) in producing specific sounds and how (gestures). While a visual analysis and comparison of imaging data or vocal tract profiles can already provide relevant findings, the sheer amount of available data demands and can strongly profit from unsupervised data-driven approaches. Recent work, in this regard, has asserted the possibility of determining critical articulators from RT-MRI data by considering a representation of vocal tract configurations based on landmarks placed on the tongue, lips, and velum, yielding meaningful results for European Portuguese (EP). Advancing this previous work to obtain a characterization of EP sounds grounded on Articulatory Phonology, important to explore critical gestures and advance, for example, articulatory speech synthesis, entails the consideration of a novel set of tract variables. To this end, this article explores critical variable determination considering a vocal tract representation aligned with Articulatory Phonology and the Task Dynamics framework. The overall results, obtained considering data for three EP speakers, show the applicability of this approach and are consistent with existing descriptions of EP sounds.


1992 ◽  
Vol 3 (3) ◽  
pp. 261-267
Author(s):  
P. N. Vabishchevich ◽  
A. Yu. Denisenko

2014 ◽  
Vol 24 (1) ◽  
pp. 7-20
Author(s):  
Brad H. Story

Models that take the form of artificial talkers and speech synthesis systems have long been used as a means of understanding both speech production and speech perception. The article begins with a brief history of two artificial speaking devices that exemplify the representation of speech production as a system of modulations. The development of a recent airway modulation model is then described that simulates the time-varying changes of the vocal tract and acoustic wave propagation. The result is a type of artificial talker that can be used to study various aspects of how sound is generated by humans and how that sound is perceived by a listener.


Sign in / Sign up

Export Citation Format

Share Document