Impostures of Talking Face Systems Using Automatic Face Animation

Lipreading aims to recognize sentences being spoken by a talking face. In recent years, the lipreading method has achieved a high level of accuracy on large datasets and made breakthrough progress. However, lipreading is still far from being solved, and existing methods tend to have high error rates on the wild data and have the defects of disappearing training gradient and slow convergence. To overcome these problems, we proposed an efficient end-to-end sentence-level lipreading model, using an encoder based on a 3D convolutional network, ResNet50, Temporal Convolutional Network (TCN), and a CTC objective function as the decoder. More importantly, the proposed architecture incorporates TCN as a feature learner to decode feature. It can partly eliminate the defects of RNN (LSTM, GRU) gradient disappearance and insufficient performance, and this yields notable performance improvement as well as faster convergence. Experiments show that the training and convergence speed are 50% faster than the state-of-the-art method, and improved accuracy by 2.4% on the GRID dataset.

Download Full-text

Bilingualism Modulates Infants’ Selective Attention to the Mouth of a Talking Face

Psychological Science ◽

10.1177/0956797614568320 ◽

2015 ◽

Vol 26 (4) ◽

pp. 490-498 ◽

Cited By ~ 65

Author(s):

Ferran Pons ◽

Laura Bosch ◽

David J. Lewkowicz

Keyword(s):

Selective Attention ◽

Talking Face

Download Full-text

OMG! Chatting online as expressive as talking face-to-face

The New Scientist ◽

10.1016/s0262-4079(08)62698-6 ◽

2008 ◽

Vol 200 (2679) ◽

pp. 21

Keyword(s):

Face To Face ◽

Talking Face

Download Full-text

Feasibility of Face Animation on Mobile Phones for Deaf Users

2007 16th IST Mobile and Wireless Communications Summit ◽

10.1109/istmwc.2007.4299237 ◽

2007 ◽

Author(s):

Takacs Gyorgy ◽

Tihanyi Attila ◽

Bardi Tamas ◽

Feldhoffer Gergely

Keyword(s):

Mobile Phones ◽

Face Animation

Download Full-text

Smartwatch Design Motion Assets: Watch Face Animation

SmartWatch Design Fundamentals ◽

10.1007/978-1-4842-4369-5_5 ◽

2019 ◽

pp. 123-153

Author(s):

Wallace Jackson

Keyword(s):

Face Animation

Download Full-text

Personalized Avatars for Mobile Entertainment

Mobile Information Systems ◽

10.1155/2006/139614 ◽

2006 ◽

Vol 2 (2-3) ◽

pp. 95-110 ◽

Cited By ~ 2

Author(s):

Tomislav Kosutic ◽

Miran Mosmondor ◽

Ivan Andrisek ◽

Mario Weber ◽

Maja Matijasevic ◽

...

Keyword(s):

Speech Synthesis ◽

Service Providers ◽

Heterogeneous Environments ◽

End User ◽

Face Animation ◽

Mobile Networking ◽

Famous Person ◽

The Face ◽

Networking Technologies ◽

Mobile Market

With evolution in computer and mobile networking technologies comes the challenge of offering novel and complex multimedia applications and end-user services in heterogeneous environments for both developers and service providers. This paper describes one novel service, called LiveMail that explores the potential of existing face animation technologies for innovative and attractive services intended for the mobile market. This prototype service allows mobile subscribers to communicate using personalized 3D face models created from images taken by their phone cameras. The user can take a snapshot of someone's face – a friend, famous person, themselves, even a pet – using the mobile phone's camera. After a quick manipulation on the phone, a 3D model of that face is created and can be animated simply by typing in some text. Speech and appropriate animation of the face are created automatically by speech synthesis. Furthermore, these highly personalized animations can be sent to others as real 3D animated messages or as short videos in MMS. The clients were implemented on different platforms, and different network and face animation techniques, and connected into one complex system. This paper presents the architecture and experience gained in building such a system.

Download Full-text