speech source localization
Recently Published Documents


TOTAL DOCUMENTS

33
(FIVE YEARS 6)

H-INDEX

5
(FIVE YEARS 1)

Complexity ◽  
2020 ◽  
Vol 2020 ◽  
pp. 1-7
Author(s):  
Lulu Wu ◽  
Hong Liu ◽  
Bing Yang ◽  
Runwei Ding

Most binaural speech source localization models perform poorly in unprecedentedly noisy and reverberant situations. Here, this issue is approached by modelling a multiscale dilated convolutional neural network (CNN). The time-related crosscorrelation function (CCF) and energy-related interaural level differences (ILD) are preprocessed in separate branches of dilated convolutional network. The multiscale dilated CNN can encode discriminative representations for CCF and ILD, respectively. After encoding, the individual interaural representations are fused to map source direction. Furthermore, in order to improve the parameter adaptation, a novel semiadaptive entropy is proposed to train the network under directional constraints. Experimental results show the proposed method can adaptively locate speech sources in simulated noisy and reverberant environments.


IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 169969-169978 ◽  
Author(s):  
Abdullah Kucuk ◽  
Anshuman Ganguly ◽  
Yiya Hao ◽  
Issa M. S. Panahi

2018 ◽  
Author(s):  
Karthik Girija Ramesan ◽  
Parth Suresh ◽  
Prasanta Kumar Ghosh

Sign in / Sign up

Export Citation Format

Share Document