speech separation
Recently Published Documents


TOTAL DOCUMENTS

724
(FIVE YEARS 234)

H-INDEX

36
(FIVE YEARS 7)

Author(s):  
Jiacheng Yao ◽  
Jing Zhang ◽  
Jiafeng Li ◽  
Li Zhuo

AbstractWith the sharp booming of online live streaming platforms, some anchors seek profits and accumulate popularity by mixing inappropriate content into live programs. After being blacklisted, these anchors even forged their identities to change the platform to continue live, causing great harm to the network environment. Therefore, we propose an anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit (GRU) for anchor identification of live platform. First, the speech of the anchor is extracted from the live streaming by using voice activation detection (VAD) and speech separation. Then, the feature sequence of anchor voiceprint is generated from the speech waveform with the self-attention network RawNet-SA. Finally, the feature sequence of anchor voiceprint is aggregated by GRU to transform into a deep voiceprint feature vector for anchor recognition. Experiments are conducted on the VoxCeleb, CN-Celeb, and MUSAN dataset, and the competitive results demonstrate that our method can effectively recognize the anchor voiceprint in video streaming.


Author(s):  
Beom Jun Woo ◽  
Hyung Yong Kim ◽  
Jeunghun Kim ◽  
Nam Soo Kim
Keyword(s):  

2021 ◽  
Author(s):  
Viet-Nhat Nguyen ◽  
Mostafa Sadeghi ◽  
Elisa Ricci ◽  
Xavier Alameda-Pineda

Author(s):  
Zhong-Qiu Wang ◽  
Gordon Wichern ◽  
Jonathan Le Roux

2021 ◽  
Author(s):  
Yanmin Zhu ◽  
Xiang Zheng ◽  
Xinrong Wu ◽  
Wanning Liu ◽  
Lei Pi ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document