The Use of Word, Phrase and Intent Accuracy as Measures of Connected Speech Recognition Performance

Eleven subjects participated in a study designed to test the accuracy of a newer-generation connected speech recognition system using a 49 word vocabulary likely to be used in an aircraft cockpit environment. The 49 vocabulary words were used to create 392 test phrases. These phrases were divided into three groups: Complex phrases, which contain more than five words, and two groups of Simple phrases, which contain 5 words or less. The simple phrases were divided into Simple Alternate and Simple No-Alternate phrases, depending on whether or not the phrase was the only one in the entire vocabulary capable of carrying out a particular action once recognition occurred. Performance of the recognition system was measured with three accuracy statistics: word accuracy, the most commonly reported statistic in speech recognition research, phrase accuracy, which is gaining popularity in connected speech recognition research, and intent accuracy, which is probably the most relevant statistic that could be reported in research of this type. Significantly different word, phrase, and intent accuracy results were obtained for the three different phrase types.

Download Full-text

Recent improvements to the harpy connected speech recognition system

10.1109/cdc.1978.268153 ◽

1978 ◽

Cited By ~ 1

Author(s):

R. Bisiani ◽

K. Greer

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Connected Speech

Download Full-text

Large vocabulary connected speech recognition system and method of language representation using evolutional grammar to represent context free grammars

The Journal of the Acoustical Society of America ◽

10.1121/1.423799 ◽

1998 ◽

Vol 104 (5) ◽

pp. 2557

Author(s):

Michael Kenneth Brown

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Connected Speech ◽

Large Vocabulary ◽

Language Representation ◽

Context Free ◽

Context Free Grammars

Download Full-text

A connected speech recognition system based on spotting diphone-like segments--Preliminary results

10.1109/icassp.1987.1169897 ◽

2005 ◽

Cited By ~ 5

Author(s):

A. Rosenberg ◽

A. Colla

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Connected Speech ◽

Preliminary Results

Download Full-text

Harpy, a connected speech recognition system

The Journal of the Acoustical Society of America ◽

10.1121/1.2003013 ◽

1976 ◽

Vol 59 (S1) ◽

pp. S97-S97 ◽

Cited By ~ 5

Author(s):

Bruce P. Lowerre ◽

B. Raj Reddy

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Connected Speech

Download Full-text

Noise Speech Recognition Based on Compressive Sensing

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.268-270.82 ◽

2011 ◽

Vol 268-270 ◽

pp. 82-87

Author(s):

Zhi Peng Zhao ◽

Yi Gang Cen ◽

Xiao Fang Chen

Keyword(s):

Speech Recognition ◽

Word Recognition ◽

Compressive Sensing ◽

Recognition Accuracy ◽

Recognition Performance ◽

Recognition System ◽

Speech Recognition System ◽

Recognition Method ◽

Isolated Word ◽

Isolated Word Recognition

In this paper, we proposed a new noise speech recognition method based on the compressive sensing theory. Through compressive sensing, our method increases the anti-noise ability of speech recognition system greatly, which leads to the improvement of the recognition accuracy. According to the experiments, our proposed method achieved better recognition performance compared with the traditional isolated word recognition method based on DTW algorithm.

Download Full-text

Featurematching by skpcawithunsupervisedalgorithmand maximum probability in speech recognition

Journal of Management and Science ◽

10.26524/jms.2011.2 ◽

2011 ◽

Vol 1 (1) ◽

pp. 9-13

Author(s):

Pavithra M ◽

Chinnasamy G ◽

Azha Periasamy

Keyword(s):

Speech Recognition ◽

Unsupervised Learning ◽

Learning Algorithm ◽

Recognition Performance ◽

Original Data ◽

Recognition System ◽

Training Data ◽

Speech Recognition System ◽

Maximum Probability ◽

Great Performance

A Speech recognition system requires a combination of various techniques and algorithms, each of which performs a specific task for achieving the main goal of the system. Speech recognition performance can be enhanced by selecting the proper acoustic model. In this work, the feature extraction and matching is done by SKPCA with Unsupervised learning algorithm and maximum probability. SKPCA reduces the data maximization of the model. It represents a sparse solution for KPCA, because the original data can be reduced considering the weights, i.e., the weights show the vectors which most influence the maximization. Unsupervised learning algorithm is implemented to find the suitable representation of the labels and maximum probability is used to maximize thenormalized acoustic likelihood of the most likely state sequences of training data. The experimental results show the efficiency of SKPCA technique with the proposed approach and maximum probability produce the great performance in the speech recognition system.

Download Full-text

Large vocabulary connected speech recognition system and method of language representation using evolutional grammar to represent context free grammars

The Journal of the Acoustical Society of America ◽

10.1121/1.429323 ◽

2000 ◽

Vol 107 (5) ◽

pp. 2326

Author(s):

Michael Kenneth Brown ◽

Stephen Charles Glinski

Keyword(s):

Speech Recognition ◽

Recognition System ◽

Speech Recognition System ◽

Connected Speech ◽

Large Vocabulary ◽

Language Representation ◽

Context Free ◽

Context Free Grammars

Download Full-text

Individual Aided Speech-Recognition Performance and Predictions of Benefit for Listeners With Impaired Hearing Employing FADE

Trends in Hearing ◽

10.1177/2331216520938929 ◽

2020 ◽

Vol 24 ◽

pp. 233121652093892

Author(s):

Marc R. Schädler ◽

David Hülsmeier ◽

Anna Warzybok ◽

Birger Kollmeier

Keyword(s):

Hearing Loss ◽

Speech Recognition ◽

Automatic Speech Recognition ◽

Recognition Performance ◽

Recognition System ◽

Speech Recognition System ◽

Impaired Hearing ◽

Automatic Speech Recognition System ◽

Frequency Dependent ◽

Hearing Thresholds

The benefit in speech-recognition performance due to the compensation of a hearing loss can vary between listeners, even if unaided performance and hearing thresholds are similar. To accurately predict the individual performance benefit due to a specific hearing device, a prediction model is proposed which takes into account hearing thresholds and a frequency-dependent suprathreshold component of impaired hearing. To test the model, the German matrix sentence test was performed in unaided and individually aided conditions in quiet and in noise by 18 listeners with different degrees of hearing loss. The outcomes were predicted by an individualized automatic speech-recognition system where the individualization parameter for the suprathreshold component of hearing loss was inferred from tone-in-noise detection thresholds. The suprathreshold component was implemented as a frequency-dependent multiplicative noise (mimicking level uncertainty) in the feature-extraction stage of the automatic speech-recognition system. Its inclusion improved the root-mean-square prediction error of individual speech-recognition thresholds (SRTs) from 6.3 dB to 4.2 dB and of individual benefits in SRT due to common compensation strategies from 5.1 dB to 3.4 dB. The outcome predictions are highly correlated with both the corresponding observed SRTs ( R2 = .94) and the benefits in SRT ( R2 = .89) and hence might help to better understand—and eventually mitigate—the perceptual consequences of as yet unexplained hearing problems, also discussed in the context of hidden hearing loss.

Download Full-text

A Chinese Small Vocabulary Offline Speech Recognition System Based on Pocketsphinx in Android Platform

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.623.267 ◽

2014 ◽

Vol 623 ◽

pp. 267-273

Author(s):

Xin Fei Liu ◽

Hui Zhou

Keyword(s):

Speech Recognition ◽

Cell Phone ◽

Recognition Performance ◽

Language Model ◽

Recognition System ◽

Speech Recognition System ◽

Development Environment ◽

Online Tool ◽

Android Development ◽

The Voice

This paper describes a Chinese small-vocabulary offline speech recognition system based on PocketSphinx which acoustic models are regenerated by improving the existing models of Sphinx and language model is generated by LMTool online tool. And then build an offline speech recognition system which could run on the Android smartphone in Android development environment in Linux system. The experiment results show that the system used for recognizing the voice commands for cell phone has good recognition performance.

Download Full-text

A connected speech recognition system using a diphone-based language model

10.1109/icassp.1985.1168263 ◽

2005 ◽

Cited By ~ 10

Author(s):

A. Colla ◽

C. Scagliola ◽

D. Sciarra

Keyword(s):

Speech Recognition ◽

Language Model ◽

Recognition System ◽

Speech Recognition System ◽

Connected Speech

Download Full-text