Fluent personalized speech synthesis with prosodic word-level spontaneous speech generation

Mapping Intimacies ◽

10.21437/interspeech.2015-120 ◽

2015 ◽

Author(s):

Yi-Chin Huang ◽

Chung-Hsien Wu ◽

Ming-Ge Shie

Keyword(s):

Speech Synthesis ◽

Spontaneous Speech ◽

Prosodic Word ◽

Speech Generation

Download Full-text

Toward Spontaneous Speech Synthesis—Utilizing Language Model Information in TTS

IEEE Transactions on Speech and Audio Processing ◽

10.1109/tsa.2004.828635 ◽

2004 ◽

Vol 12 (4) ◽

pp. 436-445 ◽

Author(s):

S. Werner ◽

M. Eichner ◽

M. Wolff ◽

R. Hoffmann

Keyword(s):

Speech Synthesis ◽

Language Model ◽

Spontaneous Speech

Download Full-text

Toward hidden Markov model‐based spontaneous speech synthesis

The Journal of the Acoustical Society of America ◽

10.1121/1.4787189 ◽

2006 ◽

Vol 120 (5) ◽

pp. 3037-3038

Author(s):

Tatsuya Akagawa ◽

Koji Iwano ◽

Sadaoki Furui

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Speech Synthesis ◽

Hidden Markov ◽

Spontaneous Speech ◽

Download Full-text

Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2016.7472660 ◽

2016 ◽

Author(s):

Rasmus Dall ◽

Sandrine Brognaux ◽

Korin Richmond ◽

Cassia Valentini-Botinhao ◽

Gustav Eje Henter ◽

...

Keyword(s):

Speech Synthesis ◽

Spontaneous Speech

Download Full-text

In which Clause do Subordinate Conjunctions Prosodically Belong?

Journal of Linguistics/Jazykovedný casopis ◽

10.2478/jazcas-2019-0052 ◽

2019 ◽

Vol 70 (2) ◽

pp. 216-224

Author(s):

Zuzana Komrsková ◽

Petra Poukarová

Keyword(s):

Spontaneous Speech ◽

Subordinate Clause ◽

Prosodic Word ◽

Functional Differences

Abstract This paper deals with the position of three Czech subordinating conjunctions že ’that’, když ‘when’, and až ‘when’ within the prosodic word, using the phonetic annotation in the ORTOFON corpus. The position of subordinating conjunctions is traditionally described as initial within the subordinate clause, but the situation in spontaneous speech is not so clear. This paper shows the functional differences between the various positions within the prosodic word and presents the words which are most frequently combined with the selected conjunctions.

Download Full-text

Word-Level Style Control for Expressive, Non-attentive Speech Synthesis

10.1007/978-3-030-87802-3_31 ◽

2021 ◽

pp. 336-347

Author(s):

Konstantinos Klapsas ◽

Nikolaos Ellinas ◽

June Sig Sung ◽

Hyoungmin Park ◽

Spyros Raptis

Keyword(s):

Speech Synthesis ◽

Download Full-text

Error Free Punjabi Text to Speech Generation System based on Phonemes

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i8.134 ◽

2018 ◽

Vol 6 (8) ◽

pp. 172

Author(s):

Tejinder Kaur ◽

Charanjiv Singh

Keyword(s):

Computer System ◽

Distinctive Feature ◽

Speech Signal ◽

Speech Synthesis ◽

Euclidean Distance ◽

Previous Method ◽

Text To Speech ◽

Generation System ◽

Speech Synthesizer ◽

Speech Generation

Text-to-speech (TTS) is the generation ofsynthesized speech from text.Language is the ability to express one’sthoughts by means of a set of signs (text), gestures,and sounds. It is a distinctive feature of humanbeings, who are the only creatures to use such asystem. Speech is the oldest means of communicationbetween people and it is also the most widely used.‘Speech synthesis’ also called ‘Text to speechsynthesis’ is the artificial production ofhuman speech. A computer system used for thispurpose is called a speech synthesizer and can beimplemented in software. A text-to-speech(TTS) system converts text to speech.The proposed Enhanced Transcriptions Method is developed using Microsoft Visual Studio in VB.Net Language. Firstly word indexing is performed for the predefined words then corresponding speech signal is detected and errors in words are calculated using Euclidean distance. The results of the proposed work shows that Enhanced Transcriptions Method has more accuracy 89% as compared to previous Transcriptions Method 79%. The value of specificity for proposed method is 0.89 and for previous method is 0.79.

Download Full-text

Speech Sound Development of Toddlers in Spontaneous Speech: Segmental Level and Whole Word Level Analysis

Korean Journal of Early Childhood Special Education ◽

10.21214/kecse.2016.16.2.111 ◽

2016 ◽

Vol 16 (2) ◽

pp. 111-130 ◽

Author(s):

Eun Hee Park ◽

◽

Mi Sun Yoon ◽

Keyword(s):

Speech Sound ◽

Spontaneous Speech ◽

Speech Sound Development ◽

Download Full-text

Perception of smiling voice in spontaneous speech synthesis

10.21437/ssw.2021-19 ◽

2021 ◽

Author(s):

Ambika Kirkland ◽

Marcin Włodarczak ◽

Joakim Gustafson ◽

Eva Szekely

Keyword(s):

Speech Synthesis ◽

Spontaneous Speech

Download Full-text

Personality in the mix - investigating the contribution of fillers and speaking style to the perception of spontaneous speech synthesis

10.21437/ssw.2021-9 ◽

2021 ◽

Author(s):

Joakim Gustafson ◽

Jonas Beskow ◽

Eva Szekely

Keyword(s):

Speech Synthesis ◽

Spontaneous Speech ◽

Download Full-text

Spontaneous speech synthesis by pronunciation variant selection - a comparison to natural speech

10.21437/interspeech.2007-498 ◽

2007 ◽

Author(s):

Steffen Werner ◽

Rüdiger Hoffmann

Keyword(s):

Speech Synthesis ◽

Spontaneous Speech ◽

Natural Speech ◽

Variant Selection

Download Full-text