formant synthesis Latest Research Papers

Modern speech synthesis for phonetic sciences: a discussion and an evaluation

10.31234/osf.io/dxvhc ◽

2020 ◽

Author(s):

Zofia Malisz ◽

Gustav Eje Henter ◽

Cassia Valentini-Botinhao ◽

Oliver Watts ◽

Jonas Beskow ◽

...

Keyword(s):

Speech Synthesis ◽

State Of The Art ◽

Reaction Times ◽

Natural Speech ◽

Decision Task ◽

Synthesis Reaction ◽

Text To Speech ◽

Rule Based ◽

Quantum Leap ◽

Formant Synthesis

Decades of gradual advances in speech synthesis have recently culminated in exponential improvements fuelled by deep learning. This quantum leap has the potential to finally deliver realistic, controllable, and robust synthetic stimuli for speech experiments. In this article, we discuss these and other implications for phonetic sciences. We substantiate our argument by evaluating classic rule-based formant synthesis against state-of-the-art synthesisers on a) subjective naturalness ratings and b) a behavioural measure (reaction times in a lexical decision task). We also differentiate between text-to-speech and speech-to-speech methods. Naturalness ratings indicate that all modern systems are substantially closer to natural speech than formant synthesis. Reaction times for several modern systems do not differ substantially from natural speech, meaning that the processing gap observed in older systems, and reproduced with our formant synthesiser, is no longer evident. Importantly, some speech-to-speech methods are nearly indistinguishable from natural speech on both measures.

Text-to-Speech Synthesis

The Oxford Handbook of Computational Linguistics 2nd edition ◽

10.1093/oxfordhb/9780199573691.013.38 ◽

2018 ◽

Author(s):

Thierry Dutoit ◽

Yannis Stylianou

Keyword(s):

Speech Synthesis ◽

Markov Models ◽

Text To Speech ◽

Functional Perspective ◽

Formant Synthesis ◽

Engineering Costs ◽

Text To Speech Synthesis ◽

Major Shift ◽

Learning Architectures ◽

Real Challenge

Text-to-speech (TTS) synthesis is the art of designing talking machines. Seen from this functional perspective, the task looks simple, but this chapter shows that delivering intelligible, natural-sounding, and expressive speech, while also taking into account engineering costs, is a real challenge. Speech synthesis has made a long journey from the big controversy in the 1980s, between MIT’s formant synthesis and Bell Labs’ diphone-based concatenative synthesis. While unit selection technology, which appeared in the mid-1990s, can be seen as an extension of diphone-based approaches, the appearance of Hidden Markov Models (HMM) synthesis around 2005 resulted in a major shift back to models. More recently, the statistical approaches, supported by advanced deep learning architectures, have been shown to advance text analysis and normalization as well as the generation of the waveforms. Important recent milestones have been Google’s Wavenet (September 2016) and the sequence-to-sequence models referred to as Tacotron (I and II).

Formant Synthesis of Kannada Consonant Vowel (CV) Co-Articulations

2017 International Conference on Current Trends in Computer, Electrical, Electronics and Communication (CTCEEC) ◽

10.1109/ctceec.2017.8455073 ◽

2017 ◽

Author(s):

Alfred Vivek DrSouza ◽

D.J Ravi

Keyword(s):

Formant Synthesis

Text to speech synthesizer-formant synthesis

2017 International Conference on Nascent Technologies in Engineering (ICNTE) ◽

10.1109/icnte.2017.7947945 ◽

2017 ◽

Cited By ~ 2

Author(s):

Sneha Lukose ◽

Savitha S. Upadhya

Keyword(s):

Text To Speech ◽

Speech Synthesizer ◽

Formant Synthesis

Thai speech synthesis based on Formant synthesis for home robot

2016 International Computer Science and Engineering Conference (ICSEC) ◽

10.1109/icsec.2016.7859899 ◽

2016 ◽

Author(s):

Chaiyong Khorinphan ◽

Saiyan Saiyod ◽

Pichet Wayalun

Keyword(s):

Speech Synthesis ◽

Formant Synthesis ◽

Home Robot

Accelerating the formant synthesis of haegeum sounds using a general-purpose graphics processing unit

Multimedia Tools and Applications ◽

10.1007/s11042-014-2297-3 ◽

2014 ◽

Vol 75 (23) ◽

pp. 15445-15459 ◽

Cited By ~ 2

Author(s):

Myeongsu Kang ◽

Shohidul Islam ◽

Rashedul Islam ◽

Jong-Myon Kim

Keyword(s):

Graphics Processing Unit ◽

General Purpose ◽

Processing Unit ◽

Formant Synthesis ◽

Graphics Processing

Thai speech synthesis with emotional tone: Based on Formant synthesis for Home Robot

2014 Third ICT International Student Project Conference (ICT-ISPC) ◽

10.1109/ict-ispc.2014.6923230 ◽

2014 ◽

Cited By ~ 1

Author(s):

Chaiyong Khorinphan ◽

Sukanya Phansamdaeng ◽

Saiyan Saiyod

Keyword(s):

Speech Synthesis ◽

Emotional Tone ◽

Formant Synthesis ◽

Home Robot

Synthesis of Resonance by Nonlinear Distortion Methods

Computer Music Journal ◽

10.1162/comj_a_00160 ◽

2013 ◽

Vol 37 (1) ◽

pp. 35-43

Author(s):

Victor Lazzarini ◽

Joseph Timoney

Keyword(s):

Frequency Modulation ◽

Nonlinear Distortion ◽

Sound Synthesis ◽

Classic Case ◽

Modulation Technique ◽

Formant Synthesis ◽

Resonator Model ◽

Made In

This article explores techniques for synthesizing resonant sounds using the principle of nonlinear distortion. These methods can be grouped under the heading of “subtractive synthesis without filters,” the case for which has been made in the literature. Starting with a simple resonator model, this article looks at how the source-modifier arrangement can be reconstructed as a heterodyne structure made of a sinusoidal carrier and a complex modulator. From this, we examine how the modulator signal can be created with nonlinear distortion methods, looking at the classic case of phase-aligned formant synthesis and then our own modified frequency-modulation technique. The article concludes with some application examples of this sound-synthesis principle.

Formant Speech Synthesis Based on Trainable Model

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.303-306.1334 ◽

2013 ◽

Vol 303-306 ◽

pp. 1334-1337

Author(s):

Zhi Ping Zhang ◽

Xi Hong Wu

Keyword(s):

Speech Synthesis ◽

Synthesis Method ◽

Experimental Results ◽

Trajectory Model ◽

Formant Synthesis ◽

Speech Data ◽

Model Training

The authors proposed a trainable formant synthesis method based on the multi-channel Hidden Trajectory Model (HTM). In the method, the phonetic targets, formant trajectories and spectrum states from the oral, nasal, voiceless and background channels were designed to construct hierarchical hidden layers, and then spectrum were generated as observable features. In model training, the phonemic targets were learned from one-hour training speech data and the boundaries of phonemes were also aligned. The experimental results showed that the speech could be reconstructed with the formant trainable model by a source-filter synthesizer.

Joint pitch-analysis formant-synthesis framework for CS recovery of speech

10.21437/interspeech.2012-283 ◽

2012 ◽

Author(s):

Srikanth Raj Chetupally ◽

Thippur V. Sreenivas

Keyword(s):

Formant Synthesis ◽

Pitch Analysis

formant synthesis
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Modern speech synthesis for phonetic sciences: a discussion and an evaluation

Text-to-Speech Synthesis

Formant Synthesis of Kannada Consonant Vowel (CV) Co-Articulations

Text to speech synthesizer-formant synthesis

Thai speech synthesis based on Formant synthesis for home robot

Accelerating the formant synthesis of haegeum sounds using a general-purpose graphics processing unit

Thai speech synthesis with emotional tone: Based on Formant synthesis for Home Robot

Synthesis of Resonance by Nonlinear Distortion Methods

Formant Speech Synthesis Based on Trainable Model

Joint pitch-analysis formant-synthesis framework for CS recovery of speech

Export Citation Format

formant synthesisRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Modern speech synthesis for phonetic sciences: a discussion and an evaluation

Text-to-Speech Synthesis

Formant Synthesis of Kannada Consonant Vowel (CV) Co-Articulations

Text to speech synthesizer-formant synthesis

Thai speech synthesis based on Formant synthesis for home robot

Accelerating the formant synthesis of haegeum sounds using a general-purpose graphics processing unit

Thai speech synthesis with emotional tone: Based on Formant synthesis for Home Robot

Synthesis of Resonance by Nonlinear Distortion Methods

Formant Speech Synthesis Based on Trainable Model

Joint pitch-analysis formant-synthesis framework for CS recovery of speech

formant synthesis
Recently Published Documents