Robust voice activity detection based on LSTM recurrent neural networks and modulation spectrum

Author(s):  
Phuttapong Sertsi ◽  
Surasak Boonkla ◽  
Vataya Chunwijitra ◽  
Nattapong Kurpukdee ◽  
Chai Wutiwiwatchai
Author(s):  
Pablo Gimeno Jordán ◽  
Ignacio Viñals Bailo ◽  
Alfonso Ortega Giménez ◽  
Antonio Miguel Artiaga ◽  
Eduardo Lleida Solano

Voice Activity Detection (VAD) aims to distinguishcorrectly those audio segments containing humanspeech. In this paper we present our latest approachto the VAD task that relies on the modellingcapabilities of Bidirectional Long Short TermMemory (BLSTM) layers to classify every frame inan audio signal as speech or non-speech


2021 ◽  
Vol 175 ◽  
pp. 107832
Author(s):  
Joaquín García-Gómez ◽  
Roberto Gil-Pita ◽  
Miguel Aguilar-Ortega ◽  
Manuel Utrilla-Manso ◽  
Manuel Rosa-Zurera ◽  
...  

2021 ◽  
Author(s):  
Serban Mihalache ◽  
Ioan-Alexandru Ivanov ◽  
Dragos Burileanu

Sign in / Sign up

Export Citation Format

Share Document