An evaluation of alaryngeal speech enhancement methods based on voice conversion techniques

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2011.5947513 ◽

2011 ◽

Author(s):

Hironori Doi ◽

Keigo Nakamura ◽

Tomoki Toda ◽

Hiroshi Saruwatari ◽

Kiyohiro Shikano

Keyword(s):

Speech Enhancement ◽

Voice Conversion ◽

Alaryngeal Speech

Download Full-text

Increasing the Intelligibility and Naturalness of Alaryngeal Speech Using Voice Conversion and Synthetic Fundamental Frequency

10.21437/interspeech.2020-1196 ◽

2020 ◽

Author(s):

Tuan Dinh ◽

Alexander Kain ◽

Robin Samlan ◽

Beiming Cao ◽

Jun Wang

Keyword(s):

Fundamental Frequency ◽

Voice Conversion ◽

Alaryngeal Speech

Download Full-text

An Improved Fully Convolutional Network Based on Post-Processing with Global Variance Equalization and Noise-Aware Training for Speech Enhancement

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2021.p0130 ◽

2021 ◽

Vol 25 (1) ◽

pp. 130-137

Author(s):

Wenlong Li ◽

◽

Kaoru Hirota ◽

Yaping Dai ◽

Zhiyang Jia

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Voice Conversion ◽

Post Processing ◽

Generalization Capability ◽

Convolutional Network ◽

Fully Convolutional Network ◽

Subjective Score ◽

An improved fully convolutional network based on post-processing with global variance (GV) equalization and noise-aware training (PN-FCN) for speech enhancement model is proposed. It aims at reducing the complexity of the speech improvement system, and it solves overly smooth speech signal spectrogram problem and poor generalization capability. The PN-FCN is fed with the noisy speech samples augmented with an estimate of the noise. In this way, the PN-FCN uses additional online noise information to better predict the clean speech. Besides, PN-FCN uses the global variance information, which improve the subjective score in a voice conversion task. Finally, the proposed framework adopts FCN, and the number of parameters is one-seventh of deep neural network (DNN). Results of experiments on the Valentini-Botinhaos dataset demonstrate that the proposed framework achieves improvements in both denoising effect and model training speed.

Download Full-text

Alaryngeal Speech Enhancement Using Pattern Recognition Techniques

IEICE Transactions on Information and Systems ◽

10.1093/ietisy/e88-d.7.1618 ◽

2005 ◽

Vol E88-D (7) ◽

pp. 1618-1622 ◽

Author(s):

G. AGUILAR

Keyword(s):

Pattern Recognition ◽

Speech Enhancement ◽

Pattern Recognition Techniques ◽

Alaryngeal Speech

Download Full-text

Speech conversion and its application to alaryngeal speech enhancement

Proceedings of Third International Conference on Signal Processing (ICSP'96) ◽

10.1109/icsigp.1996.571190 ◽

2002 ◽

Author(s):

Ning Bi ◽

Yingyong Qi

Keyword(s):

Speech Enhancement ◽

Alaryngeal Speech

Download Full-text

Female alaryngeal speech enhancement for improved speaker identification using linear predictive synthesis

The Journal of the Acoustical Society of America ◽

10.1121/1.411689 ◽

1995 ◽

Vol 97 (5) ◽

pp. 3245-3245

Author(s):

Renetta G. Tull ◽

Janet C. Rutledge ◽

Jerry J. Mahler

Keyword(s):

Speech Enhancement ◽

Speaker Identification ◽

Alaryngeal Speech

Download Full-text

A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion

10.21437/interspeech.2013-670 ◽

2013 ◽

Author(s):

Takuto Moriguchi ◽

Tomoki Toda ◽

Motoaki Sano ◽

Hiroshi Sato ◽

Graham Neubig ◽

...

Keyword(s):

Real Time ◽

Speech Enhancement ◽

Digital Signal Processor ◽

Digital Signal ◽

Voice Conversion ◽

Signal Processor

Download Full-text

Electrolaryngeal speech enhancement based on statistical voice conversion

10.21437/interspeech.2009-439 ◽

2009 ◽

Author(s):

Keigo Nakamura ◽

Tomoki Toda ◽

Hiroshi Saruwatari ◽

Kiyohiro Shikano

Keyword(s):

Speech Enhancement ◽

Voice Conversion

Download Full-text

Esophageal Speech Enhancement Based on Statistical Voice Conversion with Gaussian Mixture Models

IEICE Transactions on Information and Systems ◽

10.1587/transinf.e93.d.2472 ◽

2010 ◽

Vol E93-D (9) ◽

pp. 2472-2482 ◽

Author(s):

Hironori DOI ◽

Keigo NAKAMURA ◽

Tomoki TODA ◽

Hiroshi SARUWATARI ◽

Kiyohiro SHIKANO

Keyword(s):

Mixture Models ◽

Speech Enhancement ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Voice Conversion ◽

Esophageal Speech

Download Full-text

Application of speech conversion to alaryngeal speech enhancement

IEEE Transactions on Speech and Audio Processing ◽

10.1109/89.554771 ◽

1997 ◽

Vol 5 (2) ◽

pp. 97-105 ◽

Author(s):

Ning Bi ◽

Yingyong Qi

Keyword(s):

Speech Enhancement ◽

Alaryngeal Speech

Download Full-text

Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion

IEEE/ACM Transactions on Audio Speech and Language Processing ◽

10.1109/taslp.2013.2286917 ◽

2014 ◽

Vol 22 (1) ◽

pp. 172-183 ◽

Author(s):

Hironori Doi ◽

Tomoki Toda ◽

Keigo Nakamura ◽

Hiroshi Saruwatari ◽

Kiyohiro Shikano

Keyword(s):

Speech Enhancement ◽

Alaryngeal Speech

Download Full-text