speech extraction
Recently Published Documents


TOTAL DOCUMENTS

65
(FIVE YEARS 19)

H-INDEX

6
(FIVE YEARS 1)

2021 ◽  
Author(s):  
Jiangyu Han ◽  
Wei Rao ◽  
Yannan Wang ◽  
Yanhua Long
Keyword(s):  

2021 ◽  
Author(s):  
Katerina Zmolikova ◽  
Marc Delcroix ◽  
Desh Raj ◽  
Shinji Watanabe ◽  
Jan Černocký

2021 ◽  
Vol 263 (5) ◽  
pp. 1095-1106
Author(s):  
Tsubasa Yoshizawa ◽  
Atsushi Yoshida ◽  
Kenta Iwai ◽  
Takanobu Nishiura

Recent studies have been proposed to extract speech from the captured video of objects vibrating by sound waves. Among them, from the viewpoint of equipment cost, the method of extracting speech from the video captured by rolling-shutter cameras, which are widely used in consumer digital single-lens reflex cameras, has been attracting attention. The conventional method with the rolling-shutter video uses a grayscale video for processing based on phase images. However, a grayscale video has a smaller dynamic range than an RGB video, and thus the speech extraction accuracy of the conventional method degrades. Therefore, this paper proposes a speech extraction method based on RGB-intensity gradients on an RGB video to improve speech extraction accuracy. The proposed method extracts the speech by calculating the similarity of R, G, and B intensity gradients, and using these three intensity gradients expands the dynamic range. The experimental results on the quality and intelligibility of the extracted speech show our proposed method outperforms the conventional method.


Author(s):  
Marc Delcroix ◽  
Katerina Zmolikova ◽  
Tsubasa Ochiai ◽  
Keisuke Kinoshita ◽  
Tomohiro Nakatani
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document