Robot Audition: Missing Feature Theory Approach and Active Audition

Improvement of robot audition by interfacing sound source separation and automatic speech recognition with Missing Feature Theory

IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004 ◽

10.1109/robot.2004.1308039 ◽

2004 ◽

Cited By ~ 22

Author(s):

S. Yamamoto ◽

K. Nakadai ◽

H. Tsujino ◽

T. Yokoyama ◽

H.G. Okuno

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Sound Source ◽

Source Separation ◽

Sound Source Separation ◽

Robot Audition ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Barge-in-able robot audition based on ICA and missing feature theory under semi-blind situation

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems ◽

10.1109/iros.2008.4650799 ◽

2008 ◽

Cited By ~ 6

Author(s):

R. Takeda ◽

K. Nakadai ◽

K. Komatani ◽

T. Ogata ◽

H.G. Okuno

Keyword(s):

Robot Audition ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Soft missing-feature mask generation for Robot Audition

Paladyn Journal of Behavioral Robotics ◽

10.2478/s13230-010-0005-1 ◽

2010 ◽

Vol 1 (1) ◽

Author(s):

Toru Takahashi ◽

Kazuhiro Nakadai ◽

Kazunori Komatani ◽

Tetsuya Ogata ◽

Hiroshi G. Okuno

Keyword(s):

Human Robot Interaction ◽

Sigmoid Function ◽

Robot Interaction ◽

Time Frequency ◽

Sound Source Separation ◽

Robot Audition ◽

Frequency Components ◽

Interaction Task ◽

Missing Feature Theory ◽

Missing Feature

AbstractThis paper describes an improvement in automatic speech recognition (ASR) for robot audition by introducing Missing Feature Theory (MFT) based on soft missing feature masks (MFM) to realize natural human-robot interaction. In an everyday environment, a robot’s microphones capture various sounds besides the user’s utterances. Although sound-source separation is an effective way to enhance the user’s utterances, it inevitably produces errors due to reflection and reverberation. MFT is able to cope with these errors. First, MFMs are generated based on the reliability of time-frequency components. Then ASR weighs the time-frequency components according to the MFMs. We propose a new method to automatically generate soft MFMs, consisting of continuous values from 0 to 1 based on a sigmoid function. The proposed MFM generation was implemented for HRP-2 using HARK, our open-sourced robot audition software. Preliminary results show that the soft MFM outperformed a hard (binary) MFM in recognizing three simultaneous utterances. In a human-robot interaction task, the interval limitations between two adjacent loudspeakers were reduced from 60 degrees to 30 degrees by using soft MFMs.

Download Full-text

Speaker verification in noisy environments with combined spectral subtraction and missing feature theory

Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181) ◽

10.1109/icassp.1998.674382 ◽

2002 ◽

Cited By ~ 16

Author(s):

A. Drygajlo ◽

M. El-Maliki

Keyword(s):

Speaker Verification ◽

Spectral Subtraction ◽

Noisy Environments ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model

2009 IEEE/RSJ International Conference on Intelligent Robots and Systems ◽

10.1109/iros.2009.5354201 ◽

2009 ◽

Cited By ~ 4

Author(s):

Toru Takahashi ◽

Kazuhiro Nakadai ◽

Kazunori Komatani ◽

Tetsuya Ogata ◽

Hiroshi G. Okuno

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Acoustic Model ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Missing Feature Theory based Interface Between Sound Source Separation and Automatic Speech Recognition and Applying to Multiple Robots

Journal of the Robotics Society of Japan ◽

10.7210/jrsj.23.743 ◽

2005 ◽

Vol 23 (6) ◽

pp. 743-751

Author(s):

Shunichi Yamamoto ◽

Kazuhiro Nakadai ◽

Hiroshi Tsujino ◽

Hiroshi G. Okuno

Keyword(s):

Speech Recognition ◽

Automatic Speech Recognition ◽

Sound Source ◽

Source Separation ◽

Multiple Robots ◽

Sound Source Separation ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory

Proceedings of the 2005 IEEE International Conference on Robotics and Automation ◽

10.1109/robot.2005.1570323 ◽

2006 ◽

Cited By ~ 35

Author(s):

S. Yamamoto ◽

J.-M. Valin ◽

K. Nakadai ◽

J. Rouat ◽

F. Michaud ◽

...

Keyword(s):

Speech Recognition ◽

Microphone Array ◽

Source Separation ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Hard-mask missing feature theory for robust speaker recognition

IEEE Transactions on Consumer Electronics ◽

10.1109/tce.2011.6018880 ◽

2011 ◽

Vol 57 (3) ◽

pp. 1245-1250 ◽

Cited By ~ 2

Author(s):

Shin-cheol Lim ◽

Sei-jin Jang ◽

Soek-pil Lee ◽

Moo Kim

Keyword(s):

Speaker Recognition ◽

Hard Mask ◽

Robust Speaker Recognition ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Advanced missing feature theory with fast score calculation for noise robust speaker identification

Electronics Letters ◽

10.1049/el.2010.0368 ◽

2010 ◽

Vol 46 (14) ◽

pp. 1027 ◽

Cited By ~ 2

Author(s):

J. Jung ◽

K. Kim ◽

M.Y. Kim

Keyword(s):

Speaker Identification ◽

Robust Speaker Identification ◽

Noise Robust ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text

Robust speech recognition using missing feature theory and target speech enhancement based on degenerate unmixing and estimation technique

10.1117/12.883340 ◽

2011 ◽

Cited By ~ 2

Author(s):

Minook Kim ◽

Ji-Seon Kim ◽

Hyung-Min Park

Keyword(s):

Speech Recognition ◽

Speech Enhancement ◽

Robust Speech Recognition ◽

Estimation Technique ◽

Missing Feature Theory ◽

Missing Feature ◽

Feature Theory

Download Full-text