An articulation index based procedure for predicting the speech recognition           performance of hearing‐impaired individuals

Chaslav V. Pavlovic; Gerald A. Studebaker; Robert L. Sherbecoe

doi:10.1121/1.394082

Recognition of Synthetic Speech by Hearing-Impaired Elderly Listeners

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3405.1180 ◽

1991 ◽

Vol 34 (5) ◽

pp. 1180-1184 ◽

Cited By ~ 27

Author(s):

Larry E. Humes ◽

Kathleen J. Nelson ◽

David B. Pisoni

Keyword(s):

Hearing Loss ◽

Speech Recognition ◽

Recognition Performance ◽

The Elderly ◽

Hearing Impaired ◽

The Other ◽

Natural Speech ◽

Synthetic Speech ◽

Closed Set ◽

Open Set

The Modified Rhyme Test (MRT), recorded using natural speech and two forms of synthetic speech, DECtalk and Votrax, was used to measure both open-set and closed-set speech-recognition performance. Performance of hearing-impaired elderly listeners was compared to two groups of young normal-hearing adults, one listening in quiet, and the other listening in a background of spectrally shaped noise designed to simulate the peripheral hearing loss of the elderly. Votrax synthetic speech yielded significant decrements in speech recognition compared to either natural or DECtalk synthetic speech for all three subject groups. There were no differences in performance between natural speech and DECtalk speech for the elderly hearing-impaired listeners or the young listeners with simulated hearing loss. The normal-hearing young adults listening in quiet out-performed both of the other groups, but there were no differences in performance between the young listeners with simulated hearing loss and the elderly hearing-impaired listeners. When the closed-set identification of synthetic speech was compared to its open-set recognition, the hearing-impaired elderly gained as much from the reduction in stimulus/response uncertainty as the two younger groups. Finally, among the elderly hearing-impaired listeners, speech-recognition performance was correlated negatively with hearing sensitivity, but scores were correlated positively among the different talker conditions. Those listeners with the greatest hearing loss had the most difficulty understanding speech and those having the most trouble understanding natural speech also had the greatest difficulty with synthetic speech.

Download Full-text

Some Effects of Training on Speech Recognition by Hearing-Impaired Adults

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.2402.207 ◽

1981 ◽

Vol 24 (2) ◽

pp. 207-216 ◽

Cited By ~ 71

Author(s):

Brian E. Walden ◽

Sue A. Erdman ◽

Allen A. Montgomery ◽

Daniel M. Schwartz ◽

Robert A. Prosek

Keyword(s):

Speech Recognition ◽

Recognition Test ◽

Recognition Performance ◽

Rehabilitation Program ◽

Hearing Impaired ◽

Aural Rehabilitation ◽

Sentence Recognition ◽

Consonant Recognition ◽

The Individual

The purpose of this research was to determine some of the effects of consonant recognition training on the speech recognition performance of hearing-impaired adults. Two groups of ten subjects each received seven hours of either auditory or visual consonant recognition training, in addition to a standard two-week, group-oriented, inpatient aural rehabilitation program. A third group of fifteen subjects received the standard two-week program, but no supplementary individual consonant recognition training. An audiovisual sentence recognition test, as well as tests of auditory and visual consonant recognition, were administered both before and ibltowing training. Subjects in all three groups significantly increased in their audiovisual sentence recognition performance, but subjects receiving the individual consonant recognition training improved significantly more than subjects receiving only the standard two-week program. A significant increase in consonant recognition performance was observed in the two groups receiving the auditory or visual consonant recognition training. The data are discussed from varying statistical and clinical perspectives.

Download Full-text

Speech recognition and the Articulation Index for normal and hearing‐ impaired listeners

The Journal of the Acoustical Society of America ◽

10.1121/1.392269 ◽

1985 ◽

Vol 77 (1) ◽

pp. 281-288 ◽

Cited By ~ 43

Author(s):

Candace A. Kamm ◽

Donald D. Dirks ◽

Theodore S. Bell

Keyword(s):

Speech Recognition ◽

Hearing Impaired ◽

Articulation Index

Download Full-text

Application of the Articulation Index and the Speech Transmission Index to the Recognition of Speech by Normal-Hearing and Hearing-Impaired Listeners

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.2904.447 ◽

1986 ◽

Vol 29 (4) ◽

pp. 447-462 ◽

Cited By ~ 51

Author(s):

Larry E. Humes ◽

Donald D. Dirks ◽

Theodore S. Bell ◽

Christopher Ahlstbom ◽

Gail E. Kincaid

Keyword(s):

Recognition Performance ◽

Normal Hearing ◽

Hearing Impaired ◽

Hybrid Index ◽

The Third ◽

Transmission Index ◽

Speech Transmission ◽

Articulation Index ◽

Reverberant Speech ◽

Prospective Investigation

The present article is divided into four major sections dealing with the application of acoustical indices to the prediction of speech recognition performance. In the first section, two acoustical indices, the Articulation Index (AI) and the Speech Transmission Index (STI), are described. In the next section, the effectiveness of the AI and the STI in describing the performance of normal-hearing and hearing-impaired subjects listening to spectrally distorted (filtered) and temporally distorted (reverberant) speech is examined retrospectively. In the third section, the results of a prospective investigation that examined the recognition of nonsense syllables under conditions of babble competition, filtering and reverberation are described. Finally, in the fourth section, the ability of the acoustical indices to describe the performance of 10 hearing-impaired listeners, 5 listening in quiet and 5 in babble, is examined. It is concluded that both the AI and the STI have significant shortcomings. A hybrid index, designated mSTI, which takes the best features from each procedure, is described and demonstrated to be the best alternative presently available.

Download Full-text

Speech-Recognition Difficulties of the Hearing-Impaired Elderly

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3304.726 ◽

1990 ◽

Vol 33 (4) ◽

pp. 726-735 ◽

Cited By ~ 108

Author(s):

Larry E. Humes ◽

Lisa Roberts

Keyword(s):

Hearing Loss ◽

Speech Recognition ◽

Sensorineural Hearing Loss ◽

Recognition Performance ◽

The Elderly ◽

Normal Hearing ◽

Hearing Impaired ◽

Sensorineural Hearing ◽

Elderly Subjects ◽

Threshold Elevation

The role that sensorineural hearing loss plays in the speech-recognition difficulties of the hearing-impaired elderly is examined. One approach to this issue was to make between-group comparisons of performance for three groups of subjects: (a) young normal-hearing adults; (b) elderly hearing-impaired adults; and (c) young normal-hearing adults with simulated sensorineural hearing loss equivalent to that of the elderly subjects produced by a spectrally shaped masking noise. Another approach to this issue employed correlational analyses to examine the relation between audibility and speech recognition within the group of elderly hearing-impaired subjects. An additional approach was pursued in which an acoustical index incorporating adjustments for threshold elevation was used to examine the role audibility played in the speech-recognition performance of the hearing-impaired elderly. A wide range of listening conditions was sampled in this experiment. The conclusion was that the primary determiner of speech-recognition performance in the elderly hearing-impaired subjects was their threshold elevation.

Download Full-text

Broad binaural fusion impairs segregation of speech based on voice pitch differences in a ‘cocktail party’ environment

10.1101/805309 ◽

2019 ◽

Cited By ~ 1

Author(s):

Yonghee Oh ◽

Curtis L. Hartling ◽

Nirmal Kumar Srinivasan ◽

Morgan Eddolls ◽

Anna C. Diedesch ◽

...

Keyword(s):

Speech Recognition ◽

Hearing Aids ◽

Recognition Performance ◽

Spatial Separation ◽

Hearing Impaired ◽

Cocktail Party ◽

Voice Pitch ◽

Strong Negative Correlation ◽

Improved Performance ◽

Central Auditory Neurons

ABSTRACTIn the normal auditory system, central auditory neurons are sharply tuned to the same frequency ranges for each ear. This precise tuning is mirrored behaviorally as the binaural fusion of tones evoking similar pitches across ears. In contrast, hearing-impaired listeners exhibit abnormally broad tuning of binaural pitch fusion, fusing sounds with pitches differing by up to 3-4 octaves across ears into a single object. Here we present evidence that such broad fusion may similarly impair the segregation and recognition of speech based on voice pitch differences in a ‘cocktail party’ environment. Speech recognition performance in a multi-talker environment was measured in four groups of adult subjects: normal-hearing (NH) listeners and hearing-impaired listeners with bilateral hearing aids (HAs), bimodal cochlear implant (CI) worn with a contralateral HA, or bilateral CIs. Performance was measured as the threshold target-to-masker ratio needed to understand a target talker in the presence of masker talkers either co-located or symmetrically spatially separated from the target. Binaural pitch fusion was also measured. Voice pitch differences between target and masker talkers improved speech recognition performance for the NH, bilateral HA, and bimodal CI groups, but not the bilateral CI group. Spatial separation only improved performance for the NH group, indicating an inability of the hearing-impaired groups to benefit from spatial release from masking. A moderate to strong negative correlation was observed between the benefit from voice pitch differences and the breadth of binaural pitch fusion in all groups except the bilateral CI group in the co-located spatial condition. Hence, tuning of binaural pitch fusion predicts the ability to segregate voices based on pitch when acoustic cues are available. The findings suggest that obligatory binaural fusion, with a concomitant loss of information from individual streams, may occur at a level of processing before auditory object formation and segregation.

Download Full-text

Identification of the Spectrotemporal Modulations That Support Speech Intelligibility in Hearing-Impaired and Normal-Hearing Listeners

Journal of Speech Language and Hearing Research ◽

10.1044/2018_jslhr-h-18-0045 ◽

2019 ◽

Vol 62 (4) ◽

pp. 1051-1067 ◽

Cited By ~ 5

Author(s):

Jonathan H. Venezia ◽

Allison-Graham Martin ◽

Gregory Hickok ◽

Virginia M. Richards

Keyword(s):

Hearing Loss ◽

Speech Recognition ◽

Speech Intelligibility ◽

Large Scale ◽

Recognition Performance ◽

Recognition Task ◽

Small Region ◽

Normal Hearing ◽

Hearing Impaired ◽

Age Related

Purpose Age-related sensorineural hearing loss can dramatically affect speech recognition performance due to reduced audibility and suprathreshold distortion of spectrotemporal information. Normal aging produces changes within the central auditory system that impose further distortions. The goal of this study was to characterize the effects of aging and hearing loss on perceptual representations of speech. Method We asked whether speech intelligibility is supported by different patterns of spectrotemporal modulations (STMs) in older listeners compared to young normal-hearing listeners. We recruited 3 groups of participants: 20 older hearing-impaired (OHI) listeners, 19 age-matched normal-hearing listeners, and 10 young normal-hearing (YNH) listeners. Listeners performed a speech recognition task in which randomly selected regions of the speech STM spectrum were revealed from trial to trial. The overall amount of STM information was varied using an up–down staircase to hold performance at 50% correct. Ordinal regression was used to estimate weights showing which regions of the STM spectrum were associated with good performance (a “classification image” or CImg). Results The results indicated that (a) large-scale CImg patterns did not differ between the 3 groups; (b) weights in a small region of the CImg decreased systematically as hearing loss increased; (c) CImgs were also nonsystematically distorted in OHI listeners, and the magnitude of this distortion predicted speech recognition performance even after accounting for audibility; and (d) YNH listeners performed better overall than the older groups. Conclusion We conclude that OHI/older normal-hearing listeners rely on the same speech STMs as YNH listeners but encode this information less efficiently. Supplemental Material https://doi.org/10.23641/asha.7859981

Download Full-text

Extended High Frequencies Provide Both Spectral and Temporal Information to Improve Speech-in-Speech Recognition

Trends in Hearing ◽

10.1177/2331216520980299 ◽

2020 ◽

Vol 24 ◽

pp. 233121652098029

Author(s):

Allison Trine ◽

Brian B. Monson

Keyword(s):

Speech Recognition ◽

Pure Tone ◽

Recognition Performance ◽

Low Frequency ◽

Exploratory Analysis ◽

Temporal Information ◽

Significant Benefit ◽

Head Orientation ◽

Spectral Structure ◽

High Frequencies

Several studies have demonstrated that extended high frequencies (EHFs; >8 kHz) in speech are not only audible but also have some utility for speech recognition, including for speech-in-speech recognition when maskers are facing away from the listener. However, the contribution of EHF spectral versus temporal information to speech recognition is unknown. Here, we show that access to EHF temporal information improved speech-in-speech recognition relative to speech bandlimited at 8 kHz but that additional access to EHF spectral detail provided an additional small but significant benefit. Results suggest that both EHF spectral structure and the temporal envelope contribute to the observed EHF benefit. Speech recognition performance was quite sensitive to masker head orientation, with a rotation of only 15° providing a highly significant benefit. An exploratory analysis indicated that pure-tone thresholds at EHFs are better predictors of speech recognition performance than low-frequency pure-tone thresholds.

Download Full-text

Working Memory Capacity May Influence Perceived Effort during Aided Speech Recognition in Noise

Journal of the American Academy of Audiology ◽

10.3766/jaaa.23.7.7 ◽

2012 ◽

Vol 23 (08) ◽

pp. 577-589 ◽

Cited By ~ 73

Author(s):

Mary Rudner ◽

Thomas Lunner ◽

Thomas Behrens ◽

Elisabet Sundewall Thorén ◽

Jerker Rönnberg

Keyword(s):

Working Memory ◽

Hearing Loss ◽

Speech Recognition ◽

Hearing Aid ◽

Recognition Performance ◽

Cognitive Capacity ◽

Subjective Ratings ◽

Perceived Effort ◽

Noise Type ◽

Speech Recognition In Noise

Background: Recently there has been interest in using subjective ratings as a measure of perceived effort during speech recognition in noise. Perceived effort may be an indicator of cognitive load. Thus, subjective effort ratings during speech recognition in noise may covary both with signal-to-noise ratio (SNR) and individual cognitive capacity. Purpose: The present study investigated the relation between subjective ratings of the effort involved in listening to speech in noise, speech recognition performance, and individual working memory (WM) capacity in hearing impaired hearing aid users. Research Design: In two experiments, participants with hearing loss rated perceived effort during aided speech perception in noise. Noise type and SNR were manipulated in both experiments, and in the second experiment hearing aid compression release settings were also manipulated. Speech recognition performance was measured along with WM capacity. Study Sample: There were 46 participants in all with bilateral mild to moderate sloping hearing loss. In Experiment 1 there were 16 native Danish speakers (eight women and eight men) with a mean age of 63.5 yr (SD = 12.1) and average pure tone (PT) threshold of 47. 6 dB (SD = 9.8). In Experiment 2 there were 30 native Swedish speakers (19 women and 11 men) with a mean age of 70 yr (SD = 7.8) and average PT threshold of 45.8 dB (SD = 6.6). Data Collection and Analysis: A visual analog scale (VAS) was used for effort rating in both experiments. In Experiment 1, effort was rated at individually adapted SNRs while in Experiment 2 it was rated at fixed SNRs. Speech recognition in noise performance was measured using adaptive procedures in both experiments with Dantale II sentences in Experiment 1 and Hagerman sentences in Experiment 2. WM capacity was measured using a letter-monitoring task in Experiment 1 and the reading span task in Experiment 2. Results: In both experiments, there was a strong and significant relation between rated effort and SNR that was independent of individual WM capacity, whereas the relation between rated effort and noise type seemed to be influenced by individual WM capacity. Experiment 2 showed that hearing aid compression setting influenced rated effort. Conclusions: Subjective ratings of the effort involved in speech recognition in noise reflect SNRs, and individual cognitive capacity seems to influence relative rating of noise type.

Download Full-text

Speech Recognition Performance at Loudness Discomfort Level

Scandinavian Audiology ◽

10.3109/01050398109076187 ◽

1981 ◽

Vol 10 (4) ◽

pp. 239-246 ◽

Cited By ~ 9

Author(s):

D. D. Dirks ◽

C. A. Kamm ◽

J. R. Dubno ◽

T. M. Velde

Keyword(s):

Speech Recognition ◽

Recognition Performance

Download Full-text