articulatory inversion
Recently Published Documents


TOTAL DOCUMENTS

85
(FIVE YEARS 23)

H-INDEX

10
(FIVE YEARS 2)

2021 ◽  
Vol 11 (19) ◽  
pp. 9056
Author(s):  
Guolun Sun ◽  
Zhihua Huang ◽  
Li Wang ◽  
Pengyuan Zhang

Articulatory features are proved to be efficient in the area of speech recognition and speech synthesis. However, acquiring articulatory features has always been a difficult research hotspot. A lightweight and accurate articulatory model is of significant meaning. In this study, we propose a novel temporal convolution network-based acoustic-to-articulatory inversion system. The acoustic feature is converted into a high-dimensional hidden space feature map through temporal convolution with frame-level feature correlations taken into account. Meanwhile, we construct a two-part target function combining prediction’s Root Mean Square Error (RMSE) and the sequences’ Pearson Correlation Coefficient (PCC) to jointly optimize the performance of the specific inversion model from both aspects. We also further conducted an analysis on the impact of the weight between the two parts on the final performance of the inversion model. Extensive experiments have shown that our, temporal convolution networks (TCN) model outperformed the Bi-derectional Long Short Term Memory model by 1.18 mm in RMSE and 0.845 in PCC with 14 model parameters when optimizing evenly with RMSE and PCC aspects.


2021 ◽  
Author(s):  
Abdolreza Sabzi Shahrebabaki ◽  
Sabato Marco Siniscalchi ◽  
Torbjørn Svendsen

Author(s):  
Abdolreza Sabzi Shahrebabaki ◽  
Negar Olfati ◽  
Ali Shariq Imran ◽  
Magne Hallstein Johnsen ◽  
Sabato Marco Siniscalchi ◽  
...  

2020 ◽  
Author(s):  
Abdolreza Sabzi Shahrebabaki ◽  
Sabato Marco Siniscalchi ◽  
Giampiero Salvi ◽  
Torbjørn Svendsen

Sign in / Sign up

Export Citation Format

Share Document