Improving the Prosody of RNN-Based English Text-To-Speech Synthesis by Incorporating a BERT Model

Mapping Intimacies ◽

10.21437/interspeech.2020-1430 ◽

2020 ◽

Author(s):

Tom Kenter ◽

Manish Sharma ◽

Rob Clark

Keyword(s):

Speech Synthesis ◽

English Text ◽

Text To Speech ◽

Text To Speech Synthesis

Download Full-text

Accent level adjustment in bilingual Thai-English text-to-speech synthesis

2011 IEEE Workshop on Automatic Speech Recognition & Understanding ◽

10.1109/asru.2011.6163947 ◽

2011 ◽

Author(s):

Chai Wutiwiwatchai ◽

Ausdang Thangthai ◽

Ananlada Chotimongkol ◽

Chatchawarn Hansakunbuntheung ◽

Nattanun Thatphithakkul

Keyword(s):

Speech Synthesis ◽

English Text ◽

Text To Speech ◽

Text To Speech Synthesis ◽

Download Full-text

Design of English text-to-speech conversion algorithm based on machine learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189238 ◽

2020 ◽

pp. 1-12

Author(s):

Li Dongmei

Keyword(s):

Machine Learning ◽

Speech Synthesis ◽

Feature Recognition ◽

Learning Algorithm ◽

Morphological Structure ◽

English Text ◽

Text To Speech ◽

Part Of Speech ◽

Modern Computer ◽

Conversion Algorithm

English text-to-speech conversion is the key content of modern computer technology research. Its difficulty is that there are large errors in the conversion process of text-to-speech feature recognition, and it is difficult to apply the English text-to-speech conversion algorithm to the system. In order to improve the efficiency of the English text-to-speech conversion, based on the machine learning algorithm, after the original voice waveform is labeled with the pitch, this article modifies the rhythm through PSOLA, and uses the C4.5 algorithm to train a decision tree for judging pronunciation of polyphones. In order to evaluate the performance of pronunciation discrimination method based on part-of-speech rules and HMM-based prosody hierarchy prediction in speech synthesis systems, this study constructed a system model. In addition, the waveform stitching method and PSOLA are used to synthesize the sound. For words whose main stress cannot be discriminated by morphological structure, label learning can be done by machine learning methods. Finally, this study evaluates and analyzes the performance of the algorithm through control experiments. The results show that the algorithm proposed in this paper has good performance and has a certain practical effect.

Download Full-text

Integrating Articulatory Information in Deep Learning-Based Text-to-Speech Synthesis

10.21437/interspeech.2017-1762 ◽

2017 ◽

Author(s):

Beiming Cao ◽

Myungjong Kim ◽

Jan van Santen ◽

Ted Mau ◽

Jun Wang

Keyword(s):

Deep Learning ◽

Speech Synthesis ◽

Text To Speech ◽

Text To Speech Synthesis

Download Full-text

Subset Selection, Adaptation, Gemination and Prosody Prediction for Amharic Text-to-Speech Synthesis

10.21437/ssw.2019-37 ◽

2019 ◽

Author(s):

Elshadai Tesfaye Biru ◽

Yishak Tofik Mohammed ◽

David Tofu ◽

Erica Cooper ◽

Julia Hirschberg

Keyword(s):

Speech Synthesis ◽

Subset Selection ◽

Text To Speech ◽

Text To Speech Synthesis ◽

Prosody Prediction

Download Full-text

“I Can’t Talk Now”: Speaking with Voice Output Communication Aid Using Text-to-Speech Synthesis During Multiparty Video Conference

Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems ◽

10.1145/3411763.3451745 ◽

2021 ◽

Author(s):

Wooseok Kim ◽

Sangsu Lee

Keyword(s):

Speech Synthesis ◽

Video Conference ◽

Text To Speech ◽

Voice Output Communication Aid ◽

Communication Aid ◽

Text To Speech Synthesis ◽

Download Full-text

Comparative Study on Neural Vocoders for Multispeaker Text-To-Speech Synthesis

2020 IEEE Recent Advances in Intelligent Computational Systems (RAICS) ◽

10.1109/raics51191.2020.9332514 ◽

2020 ◽

Author(s):

Rajeev Rajan ◽

Ashish Roopan ◽

Sachin Prakash ◽

Elisa Jose ◽

Sati P.

Keyword(s):

Comparative Study ◽

Speech Synthesis ◽

Text To Speech ◽

Text To Speech Synthesis

Download Full-text

Comparison of Urdu text to speech synthesis using unit selection and HMM based techniques

2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA) ◽

10.1109/icsda.2016.7918988 ◽

2016 ◽

Author(s):

Farah Adeeba ◽

Tania Habib ◽

Sarmad Hussain ◽

Ehsan-ul-haq ◽

Kh. Shahzada Shahid

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

Unit Selection ◽

Text To Speech Synthesis

Download Full-text

Comparative study of text-to-speech synthesis techniques for mobile linguistic translation process

2014 IEEE International Conference on Control System, Computing and Engineering (ICCSCE 2014) ◽

10.1109/iccsce.2014.7072761 ◽

2014 ◽

Author(s):

Phanchita Chomwihoke ◽

Manop Phankokkruad

Keyword(s):

Comparative Study ◽

Speech Synthesis ◽

Text To Speech ◽

Translation Process ◽

Synthesis Techniques ◽

Text To Speech Synthesis

Download Full-text

The future role of text to speech synthesis in automated services

10.1049/ic:19970799 ◽

1997 ◽

Author(s):

A.P. Breen

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

Future Role ◽

Text To Speech Synthesis

Download Full-text

An advanced NLP framework for high-quality Text-to-Speech synthesis

2011 6th Conference on Speech Technology and Human-Computer Dialogue (SpeD) ◽

10.1109/sped.2011.5940733 ◽

2011 ◽

Author(s):

Catalin Ungurean ◽

Dragos Burileanu

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

High Quality ◽

Text To Speech Synthesis

Download Full-text