BLSTM neural networks for speech driven head motion synthesis

Author(s):  
Chuang Ding ◽  
Pengcheng Zhu ◽  
Lei Xie
2014 ◽  
Vol 74 (22) ◽  
pp. 9871-9888 ◽  
Author(s):  
Chuang Ding ◽  
Lei Xie ◽  
Pengcheng Zhu

2014 ◽  
Author(s):  
Chuang Ding ◽  
Pengcheng Zhu ◽  
Lei Xie ◽  
Dongmei Jiang ◽  
Zhong-Hua Fu

2013 ◽  
Author(s):  
David Adam Braude ◽  
Hiroshi Shimodaira ◽  
Atef Ben-Youssef
Keyword(s):  

2019 ◽  
Author(s):  
Ryan Cunningham ◽  
María B. Sánchez ◽  
Ian D. Loram

Objectives: To automate online segmentation of cervical muscles from transverse ultrasound (US) images of the human neck during functional head movement. To extend ground-truth labelling methodology beyond dependence upon MRI imaging of static head positions required for application to participants with involuntary movement disorders. Method: We collected sustained sequences (> 3 minutes) of US images of human posterior cervical neck muscles at 25 fps from 28 healthy adults, performing visually-guided pitch and yaw head motions. We sampled 1,100 frames (approx. 40 per participant) spanning the experimental range of head motion. We manually labelled all 1,100 US images and trained deconvolutional neural networks (DCNN) with a spatial SoftMax regression layer to classify every pixel in the full resolution (525x491) US images, as one of 14 classes (10 muscles, ligamentum nuchae, vertebra, skin, background). We investigated ‘MaxOut’ and Exponential Linear unit (ELU) transfer functions and compared with our previous benchmark (analytical shape modelling). Results: These DCNNs showed higher Jaccard Index (53.2%) and lower Hausdorff Distance (5.7 mm) than the previous benchmark (40.5%, 6.2 mm). SoftMax Confidence corresponded with correct classification. ‘MaxOut’ outperformed ELU marginally. Conclusion: The DCNN architecture accommodates challenging images and imperfect manual labels. The SoftMax layer gives user feedback of likely correct classification. The ‘MaxOut’ transfer function benefits from near-linear operation, compatibility with deconvolution operations and the dropout regulariser. Significance: This methodology for labelling ground-truth and training automated labelling networks is applicable for dynamic segmentation of moving muscles and for participants with involuntary movement disorders who cannot remain still.


2005 ◽  
Vol 16 (3-4) ◽  
pp. 283-290 ◽  
Author(s):  
Carlos Busso ◽  
Zhigang Deng ◽  
Ulrich Neumann ◽  
Shrikanth Narayanan

Sign in / Sign up

Export Citation Format

Share Document