Nepali Text to Speech Synthesis System using FreeTTS

Krishna Bikram Shah; Kiran Kumar Chaudhary; Ashmita Ghimire

doi:10.3126/scitech.v13i1.23498

Nepali Text to Speech Synthesis System using FreeTTS

SCITECH Nepal ◽

10.3126/scitech.v13i1.23498 ◽

2018 ◽

Vol 13 (1) ◽

pp. 24-31

Author(s):

Krishna Bikram Shah ◽

Kiran Kumar Chaudhary ◽

Ashmita Ghimire

Keyword(s):

Language Processing ◽

Communication Technology ◽

Speech Synthesis ◽

Digital Signal ◽

Disabled Persons ◽

Human Communication ◽

Text To Speech ◽

Synthesis System ◽

Text To Speech Synthesis ◽

Visually Handicapped

This paper confers the tools and methodology used in developing a Nepali Text to Speech Synthesis System using FreeTTS and is entirely developed in Java and uses FreeTTS synthesize1: Vocalized form of human communication is Speech. Here the Nepali Language is Synthetized based on formant approach and the use of one of the popular generic frameworks FreeTTS that is available in public domain for the development of a TTS system. The Text To Speech Architecture has been developed putting more emphasis on the Natural Language Processing (NLP) component rather than Digital Signal Processing (DSP) component. Nepali language being mostly used language in Nepal and some parts of India and abroad, a text-to-speech (TTS} synthesizer for this language will prove to be a convenient tool and communication technology (JCT) based system to aid to those majorities of people who are illiterate and also to those who are physical impairments like visually handicapped and vocally disabled persons. This ability to convert text to voice may reduce the dependency, frustration, and sense of helplessness of these people. The system can be extended to include more features such as emotions, improved tokenization, interactive options and the use of minimal database.

Download Full-text

Text-to-Speech Synthesis

10.1093/oxfordhb/9780199276349.013.0017 ◽

2012 ◽

Cited By ~ 2

Author(s):

Thierry Dutoit ◽

Yannis Stylianou

Keyword(s):

Language Processing ◽

Speech Synthesis ◽

State Of The Art ◽

Digital Signal ◽

Text To Speech ◽

Waveform Generation ◽

Sentence Level ◽

Text To Speech Synthesis ◽

Commercial Applications ◽

Prosody Generation

This article gives an introduction to state-of-the-art text-to-speech (TTS) synthesis systems, showing both the natural language processing and the digital signal processing problems involved. Text-to-speech (TTS) synthesis is the art of designing talking machines. The article begins with brief user-oriented description of a general TTS system and comments on its commercial applications. It then gives a functional diagram of a modern TTS system, highlighting its components. It describes its morphosyntactic module. Furthermore, it examines why sentence-level phonetization cannot be achieved by a sequence of dictionary look-ups, and describes possible implementations of the phonetizer. Finally, the article describes prosody generation, outlining how intonation and duration can approximately be computed from text. Prosody refers to certain properties of the speech signal, which are related to audible changes in pitch, loudness, and syllable length. This article also introduces the two main existing categories of techniques for waveform generation: synthesis by rule and concatenative synthesis.

Download Full-text

A prosodic phrasing model for a Korean text-to-speech synthesis system

10.21437/interspeech.2004-463 ◽

2004 ◽

Author(s):

Kyuchul Yoon

Keyword(s):

Korean Text ◽

Speech Synthesis ◽

Text To Speech ◽

Synthesis System ◽

Prosodic Phrasing ◽

Text To Speech Synthesis

Download Full-text

Developing Resources for Te Reo Māori Text To Speech Synthesis System

Text, Speech, and Dialogue - Lecture Notes in Computer Science ◽

10.1007/978-3-030-58323-1_32 ◽

2020 ◽

pp. 294-302

Author(s):

Jesin James ◽

Isabella Shields ◽

Rebekah Berriman ◽

Peter J. Keegan ◽

Catherine I. Watson

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

Synthesis System ◽

Text To Speech Synthesis

Download Full-text

A prosodic phrasing model for a Korean text-to-speech synthesis system

Computer Speech & Language ◽

10.1016/j.csl.2005.01.001 ◽

2006 ◽

Vol 20 (1) ◽

pp. 69-79 ◽

Cited By ~ 11

Author(s):

Kyuchul Yoon

Keyword(s):

Korean Text ◽

Speech Synthesis ◽

Text To Speech ◽

Synthesis System ◽

Prosodic Phrasing ◽

Text To Speech Synthesis

Download Full-text

Review on Unit Selection-Based Concatenation Approach in Text to Speech Synthesis System

Cybernetics, Cognition and Machine Learning Applications - Algorithms for Intelligent Systems ◽

10.1007/978-981-33-6691-6_22 ◽

2021 ◽

pp. 191-202

Author(s):

Priyanka Gujarathi ◽

Sandip Raosaheb Patil

Keyword(s):

Speech Synthesis ◽

Text To Speech ◽

Synthesis System ◽

Unit Selection ◽

Text To Speech Synthesis

Download Full-text

Text-to-Speech Synthesis

Encyclopedia of Multimedia Technology and Networking ◽

10.4018/978-1-59140-561-0.ch135 ◽

2011 ◽

pp. 957-963

Author(s):

Mahbubur R. Syed ◽

Shuvro Chakrobartty ◽

Robert J. Bignall

Keyword(s):

Speech Production ◽

Speech Synthesis ◽

Synthetic Speech ◽

Practical Application ◽

Text To Speech ◽

Synthesis System ◽

System A ◽

Vocal System ◽

Text To Speech Synthesis ◽

Computer Based

Speech synthesis is the process of producing natural-sounding, highly intelligible synthetic speech simulated by a machine in such a way that it sounds as if it was produced by a human vocal system. A text-to-speech (TTS) synthesis system is a computer-based system where the input is text and the output is a simulated vocalization of that text. Before the 1970s, most speech synthesis was achieved with hardware, but this was costly and it proved impossible to properly simulate natural speech production. Since the 1970s, the use of computers has made the practical application of speech synthesis more feasible.

Download Full-text