Informational masking vs. crowding — A mid-level trade-off between auditory and visual processing

AbstractIn noisy or cluttered environments, sensory cortical mechanisms help combine auditory or visual features into perceived objects. Knowing that individuals vary greatly in their ability to suppress unwanted sensory information, and knowing that the sizes of auditory and visual cortical regions are correlated, we wondered whether there might be a corresponding relation between an individual’s ability to suppress auditory vs. visual interference. In auditory masking, background sound makes spoken words unrecognizable. When masking arises due to interference at central auditory processing stages, beyond the cochlea, it is called informational masking (IM). A strikingly similar phenomenon in vision, called visual crowding, occurs when nearby clutter makes a target object unrecognizable, despite being resolved at the retina. We here compare susceptibilities to auditory IM and visual crowding in the same participants. Surprisingly, across participants, we find a negative correlation (R = −0.7) between IM susceptibility and crowding susceptibility: Participants who have low susceptibility to IM tend to have high susceptibility to crowding, and vice versa. This reveals a mid-level trade-off between auditory and visual processing.

Download Full-text

An auditory-visual tradeoff in susceptibility to clutter

Scientific Reports ◽

10.1038/s41598-021-00328-0 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Min Zhang ◽

Rachel N Denison ◽

Denis G Pelli ◽

Thuy Tien C Le ◽

Antje Ihlefeld

Keyword(s):

Visual Processing ◽

Auditory Processing ◽

Target Object ◽

Central Auditory Processing ◽

Auditory Masking ◽

Informational Masking ◽

Cluttered Environments ◽

Knowing That ◽

Visual Crowding ◽

The Brain

AbstractSensory cortical mechanisms combine auditory or visual features into perceived objects. This is difficult in noisy or cluttered environments. Knowing that individuals vary greatly in their susceptibility to clutter, we wondered whether there might be a relation between an individual’s auditory and visual susceptibilities to clutter. In auditory masking, background sound makes spoken words unrecognizable. When masking arises due to interference at central auditory processing stages, beyond the cochlea, it is called informational masking. A strikingly similar phenomenon in vision, called visual crowding, occurs when nearby clutter makes a target object unrecognizable, despite being resolved at the retina. We here compare susceptibilities to auditory informational masking and visual crowding in the same participants. Surprisingly, across participants, we find a negative correlation (R = –0.7) between susceptibility to informational masking and crowding: Participants who have low susceptibility to auditory clutter tend to have high susceptibility to visual clutter, and vice versa. This reveals a tradeoff in the brain between auditory and visual processing.

Download Full-text

Time Course of Early Audiovisual Interactions during Speech and Nonspeech Central Auditory Processing: A Magnetoencephalography Study

Journal of Cognitive Neuroscience ◽

10.1162/jocn.2008.21019 ◽

2009 ◽

Vol 21 (2) ◽

pp. 259-274 ◽

Cited By ~ 8

Author(s):

Ingo Hertrich ◽

Klaus Mathiak ◽

Werner Lutzenberger ◽

Hermann Ackermann

Keyword(s):

Visual Processing ◽

Auditory Processing ◽

Time Course ◽

Visual Motion ◽

Sensory Information ◽

Visual Speech ◽

Dipole Source ◽

Auditory Information ◽

Central Auditory Processing ◽

Motion Onset

Cross-modal fusion phenomena suggest specific interactions of auditory and visual sensory information both within the speech and nonspeech domains. Using whole-head magnetoencephalography, this study recorded M50 and M100 fields evoked by ambiguous acoustic stimuli that were visually disambiguated to perceived /ta/ or /pa/ syllables. As in natural speech, visual motion onset preceded the acoustic signal by 150 msec. Control conditions included visual and acoustic nonspeech signals as well as visual-only and acoustic-only stimuli. (a) Both speech and nonspeech motion yielded a consistent attenuation of the auditory M50 field, suggesting a visually induced “preparatory baseline shift” at the level of the auditory cortex. (b) Within the temporal domain of the auditory M100 field, visual speech and nonspeech motion gave rise to different response patterns (nonspeech: M100 attenuation; visual /pa/: left-hemisphere M100 enhancement; /ta/: no effect). (c) These interactions could be further decomposed using a six-dipole model. One of these three pairs of dipoles (V270) was fitted to motion-induced activity at a latency of 270 msec after motion onset, that is, the time domain of the auditory M100 field, and could be attributed to the posterior insula. This dipole source responded to nonspeech motion and visual /pa/, but was found suppressed in the case of visual /ta/. Such a nonlinear interaction might reflect the operation of a binary distinction between the marked phonological feature “labial” versus its underspecified competitor “coronal.” Thus, visual processing seems to be shaped by linguistic data structures even prior to its fusion with auditory information channel.

Download Full-text

Automatic Sensory Predictions: A Review of Predictive Mechanisms in the Brain and Their Link to Conscious Processing

Frontiers in Human Neuroscience ◽

10.3389/fnhum.2021.702520 ◽

2021 ◽

Vol 15 ◽

Author(s):

Ruxandra I. Tivadar ◽

Robert T. Knight ◽

Athina Tzovara

Keyword(s):

Visual Processing ◽

Auditory Processing ◽

Predictive Coding ◽

Sensory Information ◽

Neural Substrates ◽

Predictive Processing ◽

Computational Techniques ◽

Auditory Modality ◽

Future Events ◽

Predictive Processes

The human brain has the astonishing capacity of integrating streams of sensory information from the environment and forming predictions about future events in an automatic way. Despite being initially developed for visual processing, the bulk of predictive coding research has subsequently focused on auditory processing, with the famous mismatch negativity signal as possibly the most studied signature of a surprise or prediction error (PE) signal. Auditory PEs are present during various consciousness states. Intriguingly, their presence and characteristics have been linked with residual levels of consciousness and return of awareness. In this review we first give an overview of the neural substrates of predictive processes in the auditory modality and their relation to consciousness. Then, we focus on different states of consciousness - wakefulness, sleep, anesthesia, coma, meditation, and hypnosis - and on what mysteries predictive processing has been able to disclose about brain functioning in such states. We review studies investigating how the neural signatures of auditory predictions are modulated by states of reduced or lacking consciousness. As a future outlook, we propose the combination of electrophysiological and computational techniques that will allow investigation of which facets of sensory predictive processes are maintained when consciousness fades away.

Download Full-text

Spoken Language Processing: A Convergent Approach to Conceptualizing (Central) Auditory Processing

ASHA Leader ◽

10.1044/leader.ftr2.11082006.6 ◽

2006 ◽

Vol 11 (8) ◽

pp. 6-33 ◽

Cited By ~ 1

Author(s):

Larry Medwetsky

Keyword(s):

Language Processing ◽

Auditory Processing ◽

Spoken Language ◽

Central Auditory Processing ◽

Spoken Language Processing ◽

Convergent Approach

Download Full-text

A Study of Children’s Auditory Processing Performance Scale for Screening of Central Auditory Processing Disorders

Audiology and Speech Research ◽

10.21848/audiol.2010.6.2.137 ◽

2010 ◽

Vol 6 (2) ◽

pp. 137-145

Author(s):

Kyunghee Won ◽

Hyunsook Jang

Keyword(s):

Auditory Processing ◽

Central Auditory Processing ◽

Auditory Processing Disorders ◽

Central Auditory Processing Disorders ◽

Performance Scale

Download Full-text

Visual and Tactile Information About Object-Curvature Control Fingertip Forces and Grasp Kinematics in Human Dexterous Manipulation

Journal of Neurophysiology ◽

10.1152/jn.2000.84.6.2984 ◽

2000 ◽

Vol 84 (6) ◽

pp. 2984-2997 ◽

Cited By ~ 64

Author(s):

Per Jenmalm ◽

Seth Dahlstedt ◽

Roland S. Johansson

Keyword(s):

Visual Processing ◽

Visual Cues ◽

Grip Force ◽

Center Of Mass ◽

Precision Grip ◽

Sensory Information ◽

Afferent Input ◽

Dependent Manner ◽

Tactile Sensibility ◽

Grasp Stability

Most objects that we manipulate have curved surfaces. We have analyzed how subjects during a prototypical manipulatory task use visual and tactile sensory information for adapting fingertip actions to changes in object curvature. Subjects grasped an elongated object at one end using a precision grip and lifted it while instructed to keep it level. The principal load of the grasp was tangential torque due to the location of the center of mass of the object in relation to the horizontal grip axis joining the centers of the opposing grasp surfaces. The curvature strongly influenced the grip forces required to prevent rotational slips. Likewise the curvature influenced the rotational yield of the grasp that developed under the tangential torque load due to the viscoelastic properties of the fingertip pulps. Subjects scaled the grip forces parametrically with object curvature for grasp stability. Moreover in a curvature-dependent manner, subjects twisted the grasp around the grip axis by a radial flexion of the wrist to keep the desired object orientation despite the rotational yield. To adapt these fingertip actions to object curvature, subjects could use both vision and tactile sensibility integrated with predictive control. During combined blindfolding and digital anesthesia, however, the motor output failed to predict the consequences of the prevailing curvature. Subjects used vision to identify the curvature for efficient feedforward retrieval of grip force requirements before executing the motor commands. Digital anesthesia caused little impairment of grip force control when subjects had vision available, but the adaptation of the twist became delayed. Visual cues about the form of the grasp surface obtained before contact was used to scale the grip force, whereas the scaling of the twist depended on visual cues related to object movement. Thus subjects apparently relied on different visuomotor mechanisms for adaptation of grip force and grasp kinematics. In contrast, blindfolded subjects used tactile cues about the prevailing curvature obtained after contact with the object for feedforward adaptation of both grip force and twist. We conclude that humans use both vision and tactile sensibility for feedforward parametric adaptation of grip forces and grasp kinematics to object curvature. Normal control of the twist action, however, requires digital afferent input, and different visuomotor mechanisms support the control of the grasp twist and the grip force. This differential use of vision may have a bearing to the two-stream model of human visual processing.

Download Full-text

Speech Processing Disorder in Neural Hearing Loss

Case Reports in Medicine ◽

10.1155/2012/206716 ◽

2012 ◽

Vol 2012 ◽

pp. 1-7 ◽

Cited By ~ 2

Author(s):

Joseph P. Pillion

Keyword(s):

Speech Processing ◽

Auditory Processing ◽

Neurological Disorders ◽

Auditory Neuropathy ◽

Central Auditory Processing ◽

Temporal Lobes ◽

Electrophysiological Tests ◽

Bilateral Lesions ◽

Clinical Conditions ◽

Inferior Colliculi

Deficits in central auditory processing may occur in a variety of clinical conditions including traumatic brain injury, neurodegenerative disease, auditory neuropathy/dyssynchrony syndrome, neurological disorders associated with aging, and aphasia. Deficits in central auditory processing of a more subtle nature have also been studied extensively in neurodevelopmental disorders in children with learning disabilities, ADD, and developmental language disorders. Illustrative cases are reviewed demonstrating the use of an audiological test battery in patients with auditory neuropathy/dyssynchrony syndrome, bilateral lesions to the inferior colliculi, and bilateral lesions to the temporal lobes. Electrophysiological tests of auditory function were utilized to define the locus of dysfunction at neural levels ranging from the auditory nerve, midbrain, and cortical levels.

Download Full-text

Auditory Processing Disorders in Elderly Persons vs. Linguistic and Emotional Prosody

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18126427 ◽

2021 ◽

Vol 18 (12) ◽

pp. 6427

Author(s):

Anna Rasmus ◽

Aleksandra Błachnio

Keyword(s):

Interpersonal Relationships ◽

Auditory Processing ◽

Temporal Pattern ◽

Right Hemisphere ◽

Elderly Persons ◽

Central Auditory Processing ◽

Auditory Processing Disorders ◽

Emotional Prosody ◽

Frequency Pattern ◽

Central Processing

Background: Language communication, which is one of the basic forms of building and maintaining interpersonal relationships, deteriorates in elder age. One of the probable causes is a decline in auditory functioning, including auditory central processing. The aim of the present study is to evaluate the profile of central auditory processing disorders in the elderly as well as the relationship between these disorders and the perception of emotional and linguistic prosody. Methods: The Right Hemisphere Language Battery (RHLB-PL), and the Brain-Boy Universal Professional (BUP) were used. Results: There are statistically significant relationships between emotional prosody and: spatial hearing (r(18) = 0.46, p = 0.04); the time of the reaction (r(18) = 0.49, p = 0.03); recognizing the frequency pattern (r(18) = 0.49, p = 0.03 (4); and recognizing the duration pattern (r(18) = 0.45, p = 0.05. There are statistically significant correlations between linguistic prosody and: pitch discrimination (r(18) = 0.5, p = 0.02); recognition of the frequency pattern (r(18) = 0.55, p = 0.01); recognition of the temporal pattern; and emotional prosody (r(18) = 0.58, p = 0.01). Conclusions: The analysis of the disturbed components of auditory central processing among the tested samples showed a reduction in the functions related to frequency differentiation, the recognition of the temporal pattern, the process of discriminating between important sounds, and the speed of reaction. De-automation of the basic functions of auditory central processing, which we observe in older age, lowers the perception of both emotional and linguistic prosody, thus reducing the quality of communication in older people.

Download Full-text

Previously reward-associated sounds interfere with goal-directed auditory processing

Quarterly Journal of Experimental Psychology ◽

10.1177/1747021821990033 ◽

2021 ◽

pp. 174702182199003

Author(s):

Andy J Kim ◽

David S Lee ◽

Brian A Anderson

Keyword(s):

Visual Processing ◽

Auditory Processing ◽

Dichotic Listening ◽

Auditory Information ◽

Correct Identification ◽

Subsequent Test ◽

Dichotic Listening Task ◽

Listening Task ◽

Task Irrelevant ◽

Visual Domain

Previously reward-associated stimuli have consistently been shown to involuntarily capture attention in the visual domain. Although previously reward-associated but currently task-irrelevant sounds have also been shown to interfere with visual processing, it remains unclear whether such stimuli can interfere with the processing of task-relevant auditory information. To address this question, we modified a dichotic listening task to measure interference from task-irrelevant but previously reward-associated sounds. In a training phase, participants were simultaneously presented with a spoken letter and number in different auditory streams and learned to associate the correct identification of each of three letters with high, low, and no monetary reward, respectively. In a subsequent test phase, participants were again presented with the same auditory stimuli but were instead instructed to report the number while ignoring spoken letters. In both the training and test phases, response time measures demonstrated that attention was biased in favour of the auditory stimulus associated with high value. Our findings demonstrate that attention can be biased towards learned reward cues in the auditory domain, interfering with goal-directed auditory processing.

Download Full-text