scholarly journals The categorical neural organization of speech aids its perception in noise

2019 ◽  
Author(s):  
Gavin M. Bidelman ◽  
Lauren C. Bush ◽  
Alex M. Boudreaux

ABSTRACTWe investigated whether the categorical perception (CP) of speech might also provide a mechanism that aids its perception in noise. We varied signal-to-noise ratio (SNR) [clear, 0 dB, -5 dB] while listeners classified an acoustic-phonetic continuum (/u/ to /a/). Noise-related changes in behavioral categorization were only observed at the lowest SNR. Event-related brain potentials (ERPs) differentiated phonetic vs. non-phonetic (category ambiguous) speech by the P2 wave (∼180-320 ms). Paralleling behavior, neural responses to speech with clear phonetic status (i.e., continuum endpoints) were largely invariant to noise, whereas responses to ambiguous tokens declined with decreasing SNR. Results demonstrate that phonetic speech representations are more resistant to degradation than corresponding acoustic representations. Findings suggest the mere process of binning speech sounds into categories provides a robust mechanism to aid perception at the “cocktail party” by fortifying abstract categories from the acoustic signal and making the speech code more resistant to external interferences.

Author(s):  
Yuxia Wang ◽  
Xiaohu Yang ◽  
Hongwei Ding ◽  
Can Xu ◽  
Chang Liu

Purpose The purpose of this study was to examine the aging effects on the categorical perception (CP) of Mandarin lexical Tones 1–4 and Tones 1–2 in noise. It also investigated whether listeners' categorical tone perception in noise correlated with their general tone identification of 20 natural vowel-plus-tone signals in noise. Method Twelve younger and 12 older listeners with normal hearing were recruited in both tone identification and discrimination tasks in a CP paradigm where fundamental frequency contours of target stimuli varied systematically from the flat tone (Tone 1) to the rising/falling tones (Tones 2/4). Both tasks were conducted in quiet and noise with signal-to-noise ratios set at −5 and −10 dB, respectively, and general tone identification of natural speech signals was also tested in noise conditions. Results Compared with younger listeners, older listeners had shallower identification slopes and smaller discrimination peakedness in Tones 1–2/4 perception in all listening conditions, except for Tones 1–4 perception in quiet where no group differences were found. Meanwhile, noise affected Tones 1–2/4 perception: The signal-to-noise ratio condition at −10 dB brought shallower slope in Tones 1–2/4 identification and less peakedness in Tones 1–4 discrimination for both listener groups. Older listeners' CP in noise, the identification slopes in particular, positively correlated with their general tone identification in noise, but such correlations were partially missing for younger listeners. Conclusions Both aging and the presence of speech-shaped noise significantly reduced the CP of Mandarin Tones 1–2/4. Listeners' Mandarin tone recognition may be related to their CP of Mandarin tones.


2013 ◽  
Vol 110 (10) ◽  
pp. 2312-2324 ◽  
Author(s):  
A. Mouraux ◽  
A. L. De Paepe ◽  
E. Marot ◽  
L. Plaghki ◽  
G. D. Iannetti ◽  
...  

It has been hypothesized that the human cortical responses to nociceptive and nonnociceptive somatosensory inputs differ. Supporting this view, somatosensory-evoked potentials (SEPs) elicited by thermal nociceptive stimuli have been suggested to originate from areas 1 and 2 of the contralateral primary somatosensory (S1), operculo-insular, and cingulate cortices, whereas the early components of nonnociceptive SEPs mainly originate from area 3b of S1. However, to avoid producing a burn lesion, and sensitize or fatigue nociceptors, thermonociceptive SEPs are typically obtained by delivering a small number of stimuli with a large and variable interstimulus interval (ISI). In contrast, the early components of nonnociceptive SEPs are usually obtained by applying many stimuli at a rapid rate. Hence, previously reported differences between nociceptive and nonnociceptive SEPs could be due to differences in signal-to-noise ratio and/or differences in the contribution of cognitive processes related, for example, to arousal and attention. Here, using intraepidermal electrical stimulation to selectively activate Aδ-nociceptors at a fast and constant 1-s ISI, we found that the nociceptive SEPs obtained with a long ISI are no longer identified, indicating that these responses are not obligatory for nociception. Furthermore, using a blind source separation, we found that, unlike the obligatory components of nonnociceptive SEPs, the obligatory components of nociceptive SEPs do not receive a significant contribution from a contralateral source possibly originating from S1. Instead, they were best explained by sources compatible with bilateral operculo-insular and/or cingulate locations. Taken together, our results indicate that the obligatory components of nociceptive and nonnociceptive SEPs are fundamentally different.


1983 ◽  
Vol 50 (6) ◽  
pp. 1516-1521 ◽  
Author(s):  
T. L. Langford

The responses of low-frequency, primary-like neurons of the ventral cochlear nucleus were measured as a function of signal-to-noise ratio for gated tonal signals and noise maskers that were varied in bandwidth and duration. For a given signal-to-noise ratio, each of the maskers is known to produce the same amount of masking in psychophysical experiments. The neural discharge rates measured in the present experiment were different, however, for each of the maskers for each signal-to-noise ratio. In contrast, all three maskers produced the same orderly relationship between signal-to-noise ratio and the degree of neural synchrony to the signals. It is concluded that information concerning the presence and relative strength of a low-frequency tonal signal in a background of masking noise is carried by the nervous system as the degree to which responses are synchronized to the phase of the signal.


Author(s):  
David A. Grano ◽  
Kenneth H. Downing

The retrieval of high-resolution information from images of biological crystals depends, in part, on the use of the correct photographic emulsion. We have been investigating the information transfer properties of twelve emulsions with a view toward 1) characterizing the emulsions by a few, measurable quantities, and 2) identifying the “best” emulsion of those we have studied for use in any given experimental situation. Because our interests lie in the examination of crystalline specimens, we've chosen to evaluate an emulsion's signal-to-noise ratio (SNR) as a function of spatial frequency and use this as our critereon for determining the best emulsion.The signal-to-noise ratio in frequency space depends on several factors. First, the signal depends on the speed of the emulsion and its modulation transfer function (MTF). By procedures outlined in, MTF's have been found for all the emulsions tested and can be fit by an analytic expression 1/(1+(S/S0)2). Figure 1 shows the experimental data and fitted curve for an emulsion with a better than average MTF. A single parameter, the spatial frequency at which the transfer falls to 50% (S0), characterizes this curve.


Author(s):  
W. Kunath ◽  
K. Weiss ◽  
E. Zeitler

Bright-field images taken with axial illumination show spurious high contrast patterns which obscure details smaller than 15 ° Hollow-cone illumination (HCI), however, reduces this disturbing granulation by statistical superposition and thus improves the signal-to-noise ratio. In this presentation we report on experiments aimed at selecting the proper amount of tilt and defocus for improvement of the signal-to-noise ratio by means of direct observation of the electron images on a TV monitor.Hollow-cone illumination is implemented in our microscope (single field condenser objective, Cs = .5 mm) by an electronic system which rotates the tilted beam about the optic axis. At low rates of revolution (one turn per second or so) a circular motion of the usual granulation in the image of a carbon support film can be observed on the TV monitor. The size of the granular structures and the radius of their orbits depend on both the conical tilt and defocus.


Author(s):  
W. Baumeister ◽  
R. Rachel ◽  
R. Guckenberger ◽  
R. Hegerl

IntroductionCorrelation averaging (CAV) is meanwhile an established technique in image processing of two-dimensional crystals /1,2/. The basic idea is to detect the real positions of unit cells in a crystalline array by means of correlation functions and to average them by real space superposition of the aligned motifs. The signal-to-noise ratio improves in proportion to the number of motifs included in the average. Unlike filtering in the Fourier domain, CAV corrects for lateral displacements of the unit cells; thus it avoids the loss of resolution entailed by these distortions in the conventional approach. Here we report on some variants of the method, aimed at retrieving a maximum of information from images with very low signal-to-noise ratios (low dose microscopy of unstained or lightly stained specimens) while keeping the procedure economical.


Author(s):  
D. C. Joy ◽  
R. D. Bunn

The information available from an SEM image is limited both by the inherent signal to noise ratio that characterizes the image and as a result of the transformations that it may undergo as it is passed through the amplifying circuits of the instrument. In applications such as Critical Dimension Metrology it is necessary to be able to quantify these limitations in order to be able to assess the likely precision of any measurement made with the microscope.The information capacity of an SEM signal, defined as the minimum number of bits needed to encode the output signal, depends on the signal to noise ratio of the image - which in turn depends on the probe size and source brightness and acquisition time per pixel - and on the efficiency of the specimen in producing the signal that is being observed. A detailed analysis of the secondary electron case shows that the information capacity C (bits/pixel) of the SEM signal channel could be written as :


1979 ◽  
Vol 10 (4) ◽  
pp. 221-230 ◽  
Author(s):  
Veronica Smyth

Three hundred children from five to 12 years of age were required to discriminate simple, familiar, monosyllabic words under two conditions: 1) quiet, and 2) in the presence of background classroom noise. Of the sample, 45.3% made errors in speech discrimination in the presence of background classroom noise. The effect was most marked in children younger than seven years six months. The results are discussed considering the signal-to-noise ratio and the possible effects of unwanted classroom noise on learning processes.


2020 ◽  
Vol 63 (11) ◽  
pp. 3855-3864
Author(s):  
Wanting Huang ◽  
Lena L. N. Wong ◽  
Fei Chen ◽  
Haihong Liu ◽  
Wei Liang

Purpose Fundamental frequency (F0) is the primary acoustic cue for lexical tone perception in tonal languages but is processed in a limited way in cochlear implant (CI) systems. The aim of this study was to evaluate the importance of F0 contours in sentence recognition in Mandarin-speaking children with CIs and find out whether it is similar to/different from that in age-matched normal-hearing (NH) peers. Method Age-appropriate sentences, with F0 contours manipulated to be either natural or flattened, were randomly presented to preschool children with CIs and their age-matched peers with NH under three test conditions: in quiet, in white noise, and with competing sentences at 0 dB signal-to-noise ratio. Results The neutralization of F0 contours resulted in a significant reduction in sentence recognition. While this was seen only in noise conditions among NH children, it was observed throughout all test conditions among children with CIs. Moreover, the F0 contour-induced accuracy reduction ratios (i.e., the reduction in sentence recognition resulting from the neutralization of F0 contours compared to the normal F0 condition) were significantly greater in children with CIs than in NH children in all test conditions. Conclusions F0 contours play a major role in sentence recognition in both quiet and noise among pediatric implantees, and the contribution of the F0 contour is even more salient than that in age-matched NH children. These results also suggest that there may be differences between children with CIs and NH children in how F0 contours are processed.


2020 ◽  
Vol 63 (1) ◽  
pp. 345-356
Author(s):  
Meital Avivi-Reich ◽  
Megan Y. Roberts ◽  
Tina M. Grieco-Calub

Purpose This study tested the effects of background speech babble on novel word learning in preschool children with a multisession paradigm. Method Eight 3-year-old children were exposed to a total of 8 novel word–object pairs across 2 story books presented digitally. Each story contained 4 novel consonant–vowel–consonant nonwords. Children were exposed to both stories, one in quiet and one in the presence of 4-talker babble presented at 0-dB signal-to-noise ratio. After each story, children's learning was tested with a referent selection task and a verbal recall (naming) task. Children were exposed to and tested on the novel word–object pairs on 5 separate days within a 2-week span. Results A significant main effect of session was found for both referent selection and verbal recall. There was also a significant main effect of exposure condition on referent selection performance, with more referents correctly selected for word–object pairs that were presented in quiet compared to pairs presented in speech babble. Finally, children's verbal recall of novel words was statistically better than baseline performance (i.e., 0%) on Sessions 3–5 for words exposed in quiet, but only on Session 5 for words exposed in speech babble. Conclusions These findings suggest that background speech babble at 0-dB signal-to-noise ratio disrupts novel word learning in preschool-age children. As a result, children may need more time and more exposures of a novel word before they can recognize or verbally recall it.


Sign in / Sign up

Export Citation Format

Share Document