scholarly journals Automatic generation of prosodic structure for high quality Mandarin speech synthesis

Author(s):  
Fu-Chiang Chou ◽  
Chiu-Yu Tseng ◽  
Lin-Shan Lee
1999 ◽  
Vol 106 (4) ◽  
pp. 2182-2182
Author(s):  
Hideki Kawahara ◽  
Parham S. Zolfaghari

Author(s):  
Yue Jiang ◽  
Zhouhui Lian ◽  
Yingmin Tang ◽  
Jianguo Xiao

Automatic generation of Chinese fonts that consist of large numbers of glyphs with complicated structures is now still a challenging and ongoing problem in areas of AI and Computer Graphics (CG). Traditional CG-based methods typically rely heavily on manual interventions, while recentlypopularized deep learning-based end-to-end approaches often obtain synthesis results with incorrect structures and/or serious artifacts. To address those problems, this paper proposes a structure-guided Chinese font generation system, SCFont, by using deep stacked networks. The key idea is to integrate the domain knowledge of Chinese characters with deep generative networks to ensure that high-quality glyphs with correct structures can be synthesized. More specifically, we first apply a CNN model to learn how to transfer the writing trajectories with separated strokes in the reference font style into those in the target style. Then, we train another CNN model learning how to recover shape details on the contour for synthesized writing trajectories. Experimental results validate the superiority of the proposed SCFont compared to the state of the art in both visual and quantitative assessments.


2006 ◽  
Vol 18 (2) ◽  
pp. 195-202 ◽  
Author(s):  
Yuji Hosoda ◽  
◽  
Saku Egawa ◽  
Junichi Tamamoto ◽  
Kenjiro Yamamoto ◽  
...  

We are developing a robot that will support people in their daily lives, i.e., a human-symbiotic robot. This kind of robot is required to coexist with users, be user friendly, and be capable of supporting them. As a first step to achieving the last goal, we have developed an autonomous mobile robot that makes use of a self-balancing two-wheeled mobility system and a body swing mechanism to shift its center of gravity. This allows it to move nimbly at up to six kilometers per hour. It also has capabilities that enable it to avoid collisions with obstacles and move safely through complex environments. It is able to interact with people naturally without special tools by means of distant-speech recognition and high-quality speech-synthesis technologies. These capabilities were demonstrated at the 2005 World Exposition Aichi Japan.


Sign in / Sign up

Export Citation Format

Share Document