A multi-language text-to-speech module

Author(s):  
R. Carlson ◽  
B. Granstrom ◽  
S. Hunnicutt
Keyword(s):  
2018 ◽  
Vol 2 (2) ◽  
pp. 92
Author(s):  
Sasanko Sekhar Gantayat

A text-to-speech (TTS) system converts normal language text into speech. An intelligent text-to-speech program allows people with visual impairments or reading disabilities, to listen to written works on a home computer. Many computer operating systems and day to day software applications like Adobe Reader have included text-to-speech systems. This paper is presented to show that how HMM can be used as a tool to convert text to speech.


Author(s):  
Soumya Priyadarsini Panda ◽  
Ajit Kumar Nayak

This paper presents a novel technique for context based numeral reading in Indian language text to speech systems. The model uses a set of rules to determine the context of the numeral pronunciation and is being integrated with the waveform concatenation technique to produce speech out of the input text in Indian languages. For this purpose, the three Indian languages Odia, Hindi and Bengali are considered. To analyze the performance of the proposed technique, a set of experiments are performed considering different context of numeral pronunciations and the results are compared with existing syllable-based technique. The results obtained from different experiments shows the effectiveness of the proposed technique in producing intelligible speech out of the entered text utterances compared to the existing technique even with very less storage and execution time.


2001 ◽  
Vol 7 (1) ◽  
pp. 47-86 ◽  
Author(s):  
M. THEUNE ◽  
E. KLABBERS ◽  
J. R. DE PIJPER ◽  
E. KRAHMER ◽  
J. ODIJK

We present a data-to-speech system called D2S, which can be used for the creation of data-to-speech systems in different languages and domains. The most important characteristic of a data-to-speech system is that it combines language and speech generation: language generation is used to produce a natural language text expressing the system's input data, and speech generation is used to make this text audible. In D2S, this combination is exploited by using linguistic information available in the language generation module for the computation of prosody. This allows us to achieve a better prosodic output quality than can be achieved in a plain text-to-speech system. For language generation in D2S, the use of syntactically enriched templates is guided by knowledge of the discourse context, while for speech generation pre-recorded phrases are combined in a prosodically sophisticated manner. This combination of techniques makes it possible to create linguistically sound but efficient systems with a high quality language and speech output.


2021 ◽  
Author(s):  
Zengqiang Shang ◽  
Zhihua Huang ◽  
Haozhe Zhang ◽  
Pengyuan Zhang ◽  
Yonghong Yan

2015 ◽  
Vol 76 ◽  
pp. 417-424 ◽  
Author(s):  
Izzad Ramli ◽  
Nursuriati Jamil ◽  
Noraini Seman ◽  
Norizah Ardi

Sign in / Sign up

Export Citation Format

Share Document