prosody generation
Recently Published Documents


TOTAL DOCUMENTS

46
(FIVE YEARS 6)

H-INDEX

5
(FIVE YEARS 0)

2020 ◽  
Author(s):  
Shubhi Tyagi ◽  
Marco Nicolis ◽  
Jonas Rohnke ◽  
Thomas Drugman ◽  
Jaime Lorenzo-Trueba

2019 ◽  
Author(s):  
Masashi Aso ◽  
Shinnosuke Takamichi ◽  
Norihiro Takamune ◽  
Hiroshi Saruwatari

Author(s):  
Chen-Yu Chiang ◽  
Yu-Ping Hung ◽  
Han-Yun Yeh ◽  
I-Bin Liao ◽  
Chen-Ming Pan

Author(s):  
Pongsathon Janyoi ◽  
Pusadee Seresangtakul

This paper describes the Isarn speech synthesis system, which is a regional dialect spoken in the Northeast of Thailand. In this study, we focus to improve the prosody generation of the system by using the additional context features. In order to develop the system, the speech parameters (Mel-ceptrum and fundamental frequencies of phoneme within different phonetic contexts) were modelled using Hidden Markov Models (HMM). Synthetic speech was generated by converting the input text into context-dependent phonemes. Speech parameters were generated from the trained HMM, according to the context-dependent phonemes, and were then synthesized through a speech vocoder. In this study, systems were trained using three different feature sets: basic contextual features, tonal, and syllable-context features. Objective and subjective tests were conducted to determine the performance of the proposed system. The results indicated that the addition of the syllable-context features significantly improved the naturalness of synthesized speech.


2018 ◽  
Author(s):  
Monica Dominguez ◽  
Alicia Burga ◽  
Mireia Farrús ◽  
Leo Wanner

Author(s):  
Chen-Yu Chiang ◽  
Yu-Ping Hung ◽  
Han-Yun Yeh ◽  
I-Bin Liao ◽  
Chen-Ming Pan

This paper proposes two fully-automatic machine-extracted linguistic features from an unlimited text input for Mandarin prosody generation. One is the punctuation confidence (PC) which measures the likelihood of inserting a major punctuation mark (PM) at a word boundary. Another is the quotation confidence (QC) which measures the likelihood of a word string to be quoted as a meaningful or emphasized unit in text. Because a major PM in a text is highly correlated with a prosodic break, and a quoted word string plays an important role in human language understanding, the two features potentially could provide useful information for prosody generation. The idea is first realized by employing conditional random field (CRF)-based models to predict major PMs, quoted word string locations, and their associated confidences, i.e., the PC and the QC, for each word boundary. Then, the predicted punctuations and their confidences are combined with traditional contextual linguistic features to predict prosodic-acoustic features. Both objective and subjective tests showed that the prosody generation with the proposed linguistic features performed better than the one without the proposed features. So, the proposed PC and QC are promising features for Mandarin prosody generation.


Author(s):  
D.N. Krishna ◽  
M.G. Khanum Noor Fathima ◽  
Mythri Thippareddy ◽  
A. Sricharan ◽  
V. Ramasubramanian

Sign in / Sign up

Export Citation Format

Share Document