Speech Recognition in Multitalker Babble Using Digits, Words, and Sentences

2005 ◽  
Vol 16 (09) ◽  
pp. 726-739 ◽  
Author(s):  
Rachel A. McArdle ◽  
Richard H. Wilson ◽  
Christopher A. Burks

The purpose of this mixed model design was to examine recognition performance differences when measuring speech recognition in multitalker babble on listeners with normal hearing (n = 36) and listeners with hearing loss (n = 72) utilizing stimulus of varying linguistic complexity (digits, words, and sentence materials). All listeners were administered two trials of two lists of each material in a descending speech-to-babble ratio. For each of the materials, recognition performances by the listeners with normal hearing were significantly better than the performances by the listeners with hearing loss. The mean separation between groups at the 50% point in signal-to-babble ratio on each of the three materials was ~8 dB. The 50% points for digits were obtained at a significantly lower signal-to-babble ratio than for sentences or words that were equivalent. There were no interlist differences between the two lists for the digits and words, but there was a significant disparity between QuickSIN™ lists for the listeners with hearing loss. A two-item questionnaire was used to obtain a subjective measurement of speech recognition, which showed moderate correlations with objective measures of speech recognition in noise using digits (r = .641), sentences (r = .572), and words (r = .673).

2021 ◽  
Vol 4 (2) ◽  
pp. 45-50
Author(s):  
Ecem KARTAL ÖZCAN ◽  
Merve ÖZBAL BATUK ◽  
Şule KAYA ◽  
Gonca SENNAROĞLU

Assessment of speech perception in noise in children with hearing aids: Preliminary results* Objective: Noisy environments are a part of the daily life of children, just like adults. Children with hearing loss who wear hearing aids are more susceptible to the negative effects of noise than their normal-hearing peers. This study aims to evaluate the speech recognition in noise performance of hearing aid users and compare them with their normal-hearing peers. Material and Method: Five children aged 6-12 years with bilateral moderate to severe symmetrical sensorineural hearing loss and using bilateral behind-the-ear hearing aids were included in the study. 4 different conditions of the Turkish HINT-C were applied, and a speech recognition threshold (SRT) is determined for each condition. Results: Regardless of their age, the SRT needed by children with hearing aids to achieve equal performance with their normal-hearing peers was found to be higher for all test conditions. As seen in children with normal hearing in general, the mean noise front score of the children with hearing loss was higher than the mean noise right and noise left scores. Conclusion: The results of this study revealed that children with bilaterally symmetrical moderate to severe hearing loss achieved poor speech recognition scores in environments similar to the classroom environment, compared to their normal-hearing peers. Our results guided appropriate rehabilitation and follow-up. Keywords: noise, speech recognition in noise, hearing loss, hearing aid, pediatric audiology, HINT, HINT-C


2012 ◽  
Vol 23 (08) ◽  
pp. 577-589 ◽  
Author(s):  
Mary Rudner ◽  
Thomas Lunner ◽  
Thomas Behrens ◽  
Elisabet Sundewall Thorén ◽  
Jerker Rönnberg

Background: Recently there has been interest in using subjective ratings as a measure of perceived effort during speech recognition in noise. Perceived effort may be an indicator of cognitive load. Thus, subjective effort ratings during speech recognition in noise may covary both with signal-to-noise ratio (SNR) and individual cognitive capacity. Purpose: The present study investigated the relation between subjective ratings of the effort involved in listening to speech in noise, speech recognition performance, and individual working memory (WM) capacity in hearing impaired hearing aid users. Research Design: In two experiments, participants with hearing loss rated perceived effort during aided speech perception in noise. Noise type and SNR were manipulated in both experiments, and in the second experiment hearing aid compression release settings were also manipulated. Speech recognition performance was measured along with WM capacity. Study Sample: There were 46 participants in all with bilateral mild to moderate sloping hearing loss. In Experiment 1 there were 16 native Danish speakers (eight women and eight men) with a mean age of 63.5 yr (SD = 12.1) and average pure tone (PT) threshold of 47. 6 dB (SD = 9.8). In Experiment 2 there were 30 native Swedish speakers (19 women and 11 men) with a mean age of 70 yr (SD = 7.8) and average PT threshold of 45.8 dB (SD = 6.6). Data Collection and Analysis: A visual analog scale (VAS) was used for effort rating in both experiments. In Experiment 1, effort was rated at individually adapted SNRs while in Experiment 2 it was rated at fixed SNRs. Speech recognition in noise performance was measured using adaptive procedures in both experiments with Dantale II sentences in Experiment 1 and Hagerman sentences in Experiment 2. WM capacity was measured using a letter-monitoring task in Experiment 1 and the reading span task in Experiment 2. Results: In both experiments, there was a strong and significant relation between rated effort and SNR that was independent of individual WM capacity, whereas the relation between rated effort and noise type seemed to be influenced by individual WM capacity. Experiment 2 showed that hearing aid compression setting influenced rated effort. Conclusions: Subjective ratings of the effort involved in speech recognition in noise reflect SNRs, and individual cognitive capacity seems to influence relative rating of noise type.


1997 ◽  
Vol 40 (2) ◽  
pp. 423-431 ◽  
Author(s):  
Sandra Gordon-Salant ◽  
Peter J. Fitzgibbons

The influence of selected cognitive factors on age-related changes in speech recognition was examined by measuring the effects of recall task, speech rate, and availability of contextual cues on recognition performance by young and elderly listeners. Stimuli were low and high context sentences from the R-SPIN test presented at normal and slowed speech rates in noise. Response modes were final word recall and sentence recall. The effects of hearing loss and age were examined by comparing performances of young and elderly listeners with normal hearing and young and elderly listeners with hearing loss. Listeners with hearing loss performed more poorly than listeners with normal hearing in nearly every condition. In addition, elderly listeners exhibited poorer performance than younger listeners on the sentence recall task, but not on the word recall task, indicating that added memory demands have a detrimental effect on elderly listeners' performance. Slowing of speech rate did not have a differential effect on performance of young and elderly listeners. All listeners performed well when stimulus contextual cues were available. Taken together, these results support the notion that the performance of elderly listeners with hearing loss is influenced by a combination of auditory processing factors, memory demands, and speech contextual information.


2014 ◽  
Vol 25 (06) ◽  
pp. 529-540 ◽  
Author(s):  
Erin C. Schafer ◽  
Danielle Bryant ◽  
Katie Sanders ◽  
Nicole Baldus ◽  
Katherine Algier ◽  
...  

Background: Several recent investigations support the use of frequency modulation (FM) systems in children with normal hearing and auditory processing or listening disorders such as those diagnosed with auditory processing disorders, autism spectrum disorders, attention-deficit hyperactivity disorder, Friedreich ataxia, and dyslexia. The American Academy of Audiology (AAA) published suggested procedures, but these guidelines do not cite research evidence to support the validity of the recommended procedures for fitting and verifying nonoccluding open-ear FM systems on children with normal hearing. Documenting the validity of these fitting procedures is critical to maximize the potential FM-system benefit in the abovementioned populations of children with normal hearing and those with auditory-listening problems. Purpose: The primary goal of this investigation was to determine the validity of the AAA real-ear approach to fitting FM systems on children with normal hearing. The secondary goal of this study was to examine speech-recognition performance in noise and loudness ratings without and with FM systems in children with normal hearing sensitivity. Research Design: A two-group, cross-sectional design was used in the present study. Study Sample: Twenty-six typically functioning children, ages 5–12 yr, with normal hearing sensitivity participated in the study. Intervention: Participants used a nonoccluding open-ear FM receiver during laboratory-based testing. Data Collection and Analysis: Participants completed three laboratory tests: (1) real-ear measures, (2) speech recognition performance in noise, and (3) loudness ratings. Four real-ear measures were conducted to (1) verify that measured output met prescribed-gain targets across the 1000–4000 Hz frequency range for speech stimuli, (2) confirm that the FM-receiver volume did not exceed predicted uncomfortable loudness levels, and (3 and 4) measure changes to the real-ear unaided response when placing the FM receiver in the child’s ear. After completion of the fitting, speech recognition in noise at a –5 signal-to-noise ratio and loudness ratings at a +5 signal-to-noise ratio were measured in four conditions: (1) no FM system, (2) FM receiver on the right ear, (3) FM receiver on the left ear, and (4) bilateral FM system. Results: The results of this study suggested that the slightly modified AAA real-ear measurement procedures resulted in a valid fitting of one FM system on children with normal hearing. On average, prescriptive targets were met for 1000, 2000, 3000, and 4000 Hz within 3 dB, and maximum output of the FM system never exceeded and was significantly lower than predicted uncomfortable loudness levels for the children. There was a minimal change in the real-ear unaided response when the open-ear FM receiver was placed into the ear. Use of the FM system on one or both ears resulted in significantly better speech recognition in noise relative to a no-FM condition, and the unilateral and bilateral FM receivers resulted in a comfortably loud signal when listening in background noise. Conclusions: Real-ear measures are critical for obtaining an appropriate fit of an FM system on children with normal hearing.


2008 ◽  
Vol 19 (07) ◽  
pp. 548-556 ◽  
Author(s):  
Richard H. Wilson ◽  
Wendy B. Cates

Background: The Speech Recognition in Noise Test (SPRINT) is a word-recognition instrument that presents the 200 Northwestern University Auditory Test No. 6 (NU-6) words binaurally at 50 dB HL in a multitalker babble at a 9 dB signal-to-noise ratio (S/N) (Cord et al, 1992). The SPRINT was developed by and used by the Army as a more valid predictor of communication abilities (than pure-tone thresholds or word-recognition in quiet) for issues involving fitness for duty from a hearing perspective of Army personnel. The Words-in-Noise test (WIN) is a slightly different word-recognition task in a fixed level multitalker babble with 10 NU-6 words presented at each of 7 S/N from 24 to 0 dB S/N in 4 dB decrements (Wilson, 2003; Wilson and McArdle, 2007). For the two instruments, both the babble and the speakers of the words are different. The SPRINT uses all 200 NU-6 words, whereas the WIN uses a maximum of 70 words. Purpose: The purpose was to compare recognition performances by 24 young listeners with normal hearing and 48 older listeners with sensorineural hearing on the SPRINT and WIN protocols. Research Design: A quasi-experimental, mixed model design was used. Study Sample: The 24 young listeners with normal hearing (19 to 29 years, mean = 23.3 years) were from the local university and had normal hearing (≤20 dB HL; American National Standards Institute, 2004) at the 250–8000 Hz octave intervals. The 48 older listeners with sensorineural hearing loss (60 to 82 years, mean = 69.9 years) had the following inclusion criteria: (1) a threshold at 500 Hz between 15 and 30 dB HL, (2) a threshold at 1000 Hz between 20 and 40 dB HL, (3) a three-frequency pure-tone average (500, 1000, and 2000 Hz) of ≤40 dB HL, (4) word-recognition scores in quiet ≥40%, and (5) no history of middle ear or retrocochlear pathology as determined by an audiologic evaluation. Data Collection and Analysis: The speech materials were presented bilaterally in the following order: (1) the SPRINT at 50 dB HL, (2) two half lists of NU-6 words in quiet at 60 dB HL and 80 dB HL, and (3) the two 35-word lists of the WIN materials with the multitalker babble fixed at 60 dB HL. Data collection occurred during a 40–60 minute session. Recognition performances on each stimulus word were analyzed. Results: The listeners with normal hearing obtained 92.5% correct on the SPRINT with a 50% point on the WIN of 2.7 dB S/N. The listeners with hearing loss obtained 65.3% correct on the SPRINT and a WIN 50% point at 12.0 dB S/N. The SPRINT and WIN were significantly correlated (r = −0.81, p < .01), indicating that the SPRINT had good concurrent validity. The high-frequency, pure-tone average (1000, 2000, 4000 Hz) had higher correlations with the SPRINT, WIN, and NU-6 in quiet than did the traditional three-frequency pure-tone average (500, 1000, 2000 Hz). Conclusions: Graphically and numerically the SPRINT and WIN were highly related, which is indicative of good concurrent validity of the SPRINT.


2017 ◽  
Vol 60 (9) ◽  
pp. 2725-2739 ◽  
Author(s):  
Jing Shen ◽  
Pamela E. Souza

PurposeThis study investigated the effect of dynamic pitch in target speech on older and younger listeners' speech recognition in temporally modulated noise. First, we examined whether the benefit from dynamic-pitch cues depends on the temporal modulation of noise. Second, we tested whether older listeners can benefit from dynamic-pitch cues for speech recognition in noise. Last, we explored the individual factors that predict the amount of dynamic-pitch benefit for speech recognition in noise.MethodYounger listeners with normal hearing and older listeners with varying levels of hearing sensitivity participated in the study, in which speech reception thresholds were measured with sentences in nonspeech noise.ResultsThe younger listeners benefited more from dynamic pitch for speech recognition in temporally modulated noise than unmodulated noise. Older listeners were able to benefit from the dynamic-pitch cues but received less benefit from noise modulation than the younger listeners. For those older listeners with hearing loss, the amount of hearing loss strongly predicted the dynamic-pitch benefit for speech recognition in noise.ConclusionsDynamic-pitch cues aid speech recognition in noise, particularly when noise has temporal modulation. Hearing loss negatively affects the dynamic-pitch benefit to older listeners with significant hearing loss.


2014 ◽  
Vol 25 (07) ◽  
pp. 688-696 ◽  
Author(s):  
Richard H. Wilson

Background: The abrupt transition of a signal from off to on and vice versa typically produces spectral splatter that can mask other signals that are spectrally removed from the nominal signal frequency. Both the Miller and Licklider (1950) and Cherry (1953) studies of interrupted speech and alternated speech, respectively, acknowledged the generation of extraneous noise by the rapid on and off characteristics of their unshaped signals but noted for slower interruption rates (e.g., 10 interruptions per second); the masking effects were minimal. Recent studies of interrupted speech have avoided this issue by shaping the rise-fall times with a digital algorithm (e.g., Jin and Nelson, 2010; Wang and Humes, 2010). A second variable in the interrupted speech paradigm is the temporal location or placement of the interruptions (i.e., where in the waveform the interruptions occur). Here the issue is this: what parts of an utterance are necessary to enable intelligibility (e.g., Fogerty and Kewley-Port, 2009)? Interruptions may or may not disturb these necessary cues. Purpose: Here is the prompting question: do shaped and unshaped rise-fall characteristics of the on-segments of interrupted speech produce the same or different recognition performances? A second question arises: are recognition performances on complementary halves of an interrupted signal the same or different? Research Design: This study used a mixed-model design with two within-subject variables (unshaped and shaped rise-fall characteristic, complementary halves) and one between-subjects variable (listener group). Study Sample: A total of 12 young listeners (age range: 19–29 yr) with normal hearing and 12 older listeners (age range: 53–80 yr) with hearing loss for pure tones participated. Data Collection and Analysis: A total of 95 consonant-vowel nucleus-consonant words were interrupted (10 interruptions per second; 50% duty cycle) by parsing alternate 50 msec segments to separate files, which provided complementary temporal halves of the target word referenced to word onset; the first on-segment of the 0 msec condition started at word onset, whereas the first on-segment of the 50 msec condition started 50 msec after word onset. The interruption routine either applied no shaping of the 4 msec rise-fall times or a cos2 shape. Each listener received 25 practice words then a unique randomization of 280 interrupted words (70 words, 2 rise-fall shapes, and 2 interrupt onset conditions). Results: The listeners with normal hearing performed 8–16% better on the various comparable conditions than did the older listeners with hearing loss. The mean performance differences between shaped and unshaped rise-fall characteristics ranged from <1–3% and were not significant. Performance was significantly 10–17% better on the 0 msec condition than on the 50 msec condition. There was no significant interaction between the two main variables, rise-fall shape, and onset time of the interruptions. Conclusions: The rise-fall shape of the onset and offset of the on-segment of the interruption cycle does not affect recognition performance of words. The location of the interruptions in a word can have a significant effect on recognition performance.


1990 ◽  
Vol 33 (4) ◽  
pp. 726-735 ◽  
Author(s):  
Larry E. Humes ◽  
Lisa Roberts

The role that sensorineural hearing loss plays in the speech-recognition difficulties of the hearing-impaired elderly is examined. One approach to this issue was to make between-group comparisons of performance for three groups of subjects: (a) young normal-hearing adults; (b) elderly hearing-impaired adults; and (c) young normal-hearing adults with simulated sensorineural hearing loss equivalent to that of the elderly subjects produced by a spectrally shaped masking noise. Another approach to this issue employed correlational analyses to examine the relation between audibility and speech recognition within the group of elderly hearing-impaired subjects. An additional approach was pursued in which an acoustical index incorporating adjustments for threshold elevation was used to examine the role audibility played in the speech-recognition performance of the hearing-impaired elderly. A wide range of listening conditions was sampled in this experiment. The conclusion was that the primary determiner of speech-recognition performance in the elderly hearing-impaired subjects was their threshold elevation.


2017 ◽  
Vol 60 (8) ◽  
pp. 2310-2320 ◽  
Author(s):  
Christi W. Miller ◽  
Erin K. Stewart ◽  
Yu-Hsiang Wu ◽  
Christopher Bishop ◽  
Ruth A. Bentler ◽  
...  

Purpose This study evaluated the relationship between working memory (WM) and speech recognition in noise with different noise types as well as in the presence of visual cues. Method Seventy-six adults with bilateral, mild to moderately severe sensorineural hearing loss (mean age: 69 years) participated. Using a cross-sectional design, 2 measures of WM were taken: a reading span measure, and Word Auditory Recognition and Recall Measure (Smith, Pichora-Fuller, & Alexander, 2016). Speech recognition was measured with the Multi-Modal Lexical Sentence Test for Adults (Kirk et al., 2012) in steady-state noise and 4-talker babble, with and without visual cues. Testing was under unaided conditions. Results A linear mixed model revealed visual cues and pure-tone average as the only significant predictors of Multi-Modal Lexical Sentence Test outcomes. Neither WM measure nor noise type showed a significant effect. Conclusion The contribution of WM in explaining unaided speech recognition in noise was negligible and not influenced by noise type or visual cues. We anticipate that with audibility partially restored by hearing aids, the effects of WM will increase. For clinical practice to be affected, more significant effect sizes are needed.


Sign in / Sign up

Export Citation Format

Share Document