speech extraction
Recently Published Documents

TOTAL DOCUMENTS

65

(FIVE YEARS 19)

H-INDEX

6

(FIVE YEARS 1)

Latest Documents Most Cited Documents Contributed Authors Related Sources Related Keywords

Real-Time Binaural Target Speech Extraction Using Phase Unwrapping

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.141.1077 ◽

2021 ◽

Vol 141 (10) ◽

pp. 1077-1086

Author(s):

Eiji Saito ◽

Arata Kawamura

Keyword(s):

Real Time ◽

Phase Unwrapping ◽

Speech Extraction

Download Full-text

TSEGAN:Target Speech Extraction Algorithm Based on Generative Adversarial Networks

10.1109/icicsp54369.2021.9611982 ◽

2021 ◽

Author(s):

Meijun Chen ◽

Xiang Zheng ◽

Xinrong Wu

Keyword(s):

Generative Adversarial Networks ◽

Adversarial Networks ◽

Extraction Algorithm ◽

Speech Extraction

Download Full-text

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction

10.21437/interspeech.2021-298 ◽

2021 ◽

Author(s):

Jiangyu Han ◽

Wei Rao ◽

Yannan Wang ◽

Yanhua Long

Keyword(s):

Speech Extraction

Download Full-text

Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics

10.21437/interspeech.2021-986 ◽

2021 ◽

Author(s):

Katerina Zmolikova ◽

Marc Delcroix ◽

Desh Raj ◽

Shinji Watanabe ◽

Jan Černocký

Keyword(s):

Loss Function ◽

Weak Supervision ◽

Speech Extraction

Download Full-text

Speech extraction with RGB-intensity gradient on rolling-shutter video

INTER-NOISE and NOISE-CON Congress and Conference Proceedings ◽

10.3397/in-2021-1753 ◽

2021 ◽

Vol 263 (5) ◽

pp. 1095-1106

Author(s):

Tsubasa Yoshizawa ◽

Atsushi Yoshida ◽

Kenta Iwai ◽

Takanobu Nishiura

Keyword(s):

Conventional Method ◽

Extraction Method ◽

Dynamic Range ◽

Experimental Results ◽

Sound Waves ◽

Intensity Gradient ◽

Equipment Cost ◽

Single Lens ◽

Phase Images ◽

Speech Extraction

Recent studies have been proposed to extract speech from the captured video of objects vibrating by sound waves. Among them, from the viewpoint of equipment cost, the method of extracting speech from the video captured by rolling-shutter cameras, which are widely used in consumer digital single-lens reflex cameras, has been attracting attention. The conventional method with the rolling-shutter video uses a grayscale video for processing based on phase images. However, a grayscale video has a smaller dynamic range than an RGB video, and thus the speech extraction accuracy of the conventional method degrades. Therefore, this paper proposes a speech extraction method based on RGB-intensity gradients on an RGB video to improve speech extraction accuracy. The proposed method extracts the speech by calculating the similarity of R, G, and B intensity gradients, and using these three intensity gradients expands the dynamic range. The experimental results on the quality and intelligibility of the extracted speech show our proposed method outperforms the conventional method.

Download Full-text

Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414092 ◽

2021 ◽

Author(s):

Jisi Zhang ◽

Catalin Zorila ◽

Rama Doddipatla ◽

Jon Barker

Keyword(s):

Time Domain ◽

Spatial Information ◽

Speech Extraction

Download Full-text

Deficient Basis Estimation of Noise Spatial Covariance Matrix for Rank-Constrained Spatial Covariance Matrix Estimation Method in Blind Speech Extraction

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414479 ◽

2021 ◽

Author(s):

Yuto Kondo ◽

Yuki Kubo ◽

Norihiro Takamune ◽

Daichi Kitamura ◽

Hiroshi Saruwatari

Keyword(s):

Covariance Matrix ◽

Estimation Method ◽

Covariance Matrix Estimation ◽

Spatial Covariance ◽

Matrix Estimation ◽

Speech Extraction

Download Full-text

Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414244 ◽

2021 ◽

Author(s):

Jiangyu Han ◽

Xinyuan Zhou ◽

Yanhua Long ◽

Yijie Li

Keyword(s):

Speech Extraction ◽

Download Full-text

Speaker Activity Driven Neural Speech Extraction

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414998 ◽

2021 ◽

Author(s):

Marc Delcroix ◽

Katerina Zmolikova ◽

Tsubasa Ochiai ◽

Keisuke Kinoshita ◽

Tomohiro Nakatani

Keyword(s):

Speech Extraction

Download Full-text

Design and Implementation of Video Speech Extraction Text System Based on Deep Learning

Software Engineering and Applications ◽

10.12677/sea.2021.104057 ◽

2021 ◽

Vol 10 (04) ◽

pp. 528-541

Author(s):

煜颖谢

Keyword(s):

Deep Learning ◽

Design And Implementation ◽

Speech Extraction

Download Full-text