Auditory scene analysis by songbirds: Stream segregation of birdsong by European starlings (Sturnus vulgaris).

Stewart H. Hulse; Scott A. MacDougall-Shackleton; Amy B. Wisniewski

doi:10.1037/0735-7036.111.1.3

Auditory scene analysis by European starlings (Sturnus vulgaris): Perceptual segregation of tone sequences

The Journal of the Acoustical Society of America ◽

10.1121/1.423063 ◽

1998 ◽

Vol 103 (6) ◽

pp. 3581-3587 ◽

Cited By ~ 75

Author(s):

Scott A. MacDougall-Shackleton ◽

Stewart H. Hulse ◽

Timothy Q. Gentner ◽

Wesley White

Keyword(s):

Auditory Scene Analysis ◽

Sturnus Vulgaris ◽

Scene Analysis ◽

European Starlings ◽

Auditory Scene

Download Full-text

Auditory scene analysis in European starlings (Sturnus vulgaris): Discrimination of song segments, their segregation from multiple and reversed conspecific songs, and evidence for conspecific song categorization.

Journal of Comparative Psychology ◽

10.1037/0735-7036.111.4.337 ◽

1997 ◽

Vol 111 (4) ◽

pp. 337-350 ◽

Cited By ~ 39

Author(s):

Amy B. Wisniewski ◽

Stewart H. Hulse

Keyword(s):

Auditory Scene Analysis ◽

Sturnus Vulgaris ◽

Scene Analysis ◽

European Starlings ◽

Auditory Scene

Download Full-text

Neural mechanisms of rhythmic masking release in monkey primary auditory cortex: implications for models of auditory scene analysis

Journal of Neurophysiology ◽

10.1152/jn.01010.2011 ◽

2012 ◽

Vol 107 (9) ◽

pp. 2366-2382 ◽

Cited By ~ 13

Author(s):

Yonatan I. Fishman ◽

Christophe Micheyl ◽

Mitchell Steinschneider

Keyword(s):

Auditory Cortex ◽

Perceptual Organization ◽

Primary Auditory Cortex ◽

Auditory Scene Analysis ◽

Neural Mechanisms ◽

Stream Segregation ◽

Scene Analysis ◽

Neural Basis ◽

Masking Release ◽

Auditory Scene

The ability to detect and track relevant acoustic signals embedded in a background of other sounds is crucial for hearing in complex acoustic environments. This ability is exemplified by a perceptual phenomenon known as “rhythmic masking release” (RMR). To demonstrate RMR, a sequence of tones forming a target rhythm is intermingled with physically identical “Distracter” sounds that perceptually mask the rhythm. The rhythm can be “released from masking” by adding “Flanker” tones in adjacent frequency channels that are synchronous with the Distracters. RMR represents a special case of auditory stream segregation, whereby the target rhythm is perceptually segregated from the background of Distracters when they are accompanied by the synchronous Flankers. The neural basis of RMR is unknown. Previous studies suggest the involvement of primary auditory cortex (A1) in the perceptual organization of sound patterns. Here, we recorded neural responses to RMR sequences in A1 of awake monkeys in order to identify neural correlates and potential mechanisms of RMR. We also tested whether two current models of stream segregation, when applied to these responses, could account for the perceptual organization of RMR sequences. Results suggest a key role for suppression of Distracter-evoked responses by the simultaneous Flankers in the perceptual restoration of the target rhythm in RMR. Furthermore, predictions of stream segregation models paralleled the psychoacoustics of RMR in humans. These findings reinforce the view that preattentive or “primitive” aspects of auditory scene analysis may be explained by relatively basic neural mechanisms at the cortical level.

Download Full-text

Auditory Scene Analysis: An Attention Perspective

Journal of Speech Language and Hearing Research ◽

10.1044/2017_jslhr-h-17-0041 ◽

2017 ◽

Vol 60 (10) ◽

pp. 2989-3000 ◽

Cited By ~ 18

Author(s):

Elyse S. Sussman

Keyword(s):

Neural Model ◽

Auditory Scene Analysis ◽

Stream Segregation ◽

Scene Analysis ◽

Sound Sources ◽

Auditory Scene ◽

System Data ◽

New Perspective ◽

Reported Data ◽

Task Goals

Purpose This review article provides a new perspective on the role of attention in auditory scene analysis. Method A framework for understanding how attention interacts with stimulus-driven processes to facilitate task goals is presented. Previously reported data obtained through behavioral and electrophysiological measures in adults with normal hearing are summarized to demonstrate attention effects on auditory perception—from passive processes that organize unattended input to attention effects that act at different levels of the system. Data will show that attention can sharpen stream organization toward behavioral goals, identify auditory events obscured by noise, and limit passive processing capacity. Conclusions A model of attention is provided that illustrates how the auditory system performs multilevel analyses that involve interactions between stimulus-driven input and top-down processes. Overall, these studies show that (a) stream segregation occurs automatically and sets the basis for auditory event formation; (b) attention interacts with automatic processing to facilitate task goals; and (c) information about unattended sounds is not lost when selecting one organization over another. Our results support a neural model that allows multiple sound organizations to be held in memory and accessed simultaneously through a balance of automatic and task-specific processes, allowing flexibility for navigating noisy environments with competing sound sources. Presentation Video http://cred.pubs.asha.org/article.aspx?articleid=2601618

Download Full-text

Auditory Streaming of Amplitude-Modulated Sounds in the Songbird Forebrain

Journal of Neurophysiology ◽

10.1152/jn.91333.2008 ◽

2009 ◽

Vol 101 (6) ◽

pp. 3212-3225 ◽

Cited By ~ 27

Author(s):

Naoya Itatani ◽

Georg M. Klump

Keyword(s):

Transfer Functions ◽

Modulation Frequency ◽

Human Perception ◽

Auditory Scene Analysis ◽

Stream Segregation ◽

Spectral Energy ◽

Modulation Transfer ◽

Auditory Streaming ◽

European Starlings ◽

Auditory Scene

Streaming in auditory scene analysis refers to the perceptual grouping of multiple interleaved sounds having similar characteristics while sounds with different characteristics are segregated. In human perception, auditory streaming occurs on the basis of temporal features of sounds such as the rate of amplitude modulation. We present results from multiunit recordings in the auditory forebrain of awake European starlings ( Sturnus vulgaris) on the representation of sinusoidally amplitude modulated (SAM) tones to investigate the effect of temporal envelope structure on neural stream segregation. Different types of rate modulation transfer functions in response to SAM tones were observed. The strongest responses were found for modulation frequencies (fmod) <160 Hz. The streaming stimulus consisted of sequences of alternating SAM tones with the same carrier frequency but differing in fmod (ABA-ABA-ABA-…). A signals had a modulation frequency evoking a large excitation, whereas the fmod of B signals was ≤4 octaves higher. Synchrony of B signal responses to the modulation decreased as fmod increased. Spike rate in response to B signals dropped as fmod increased. Faster signal repetition resulted in fewer spikes, suggesting the contribution of forward suppression to the response that may be due to both signals having similar spectral energy and that is not related to the temporal pattern of modulation. These two effects are additive and may provide the basis for a more separated representation of A and B signals by two populations of neurons that can be viewed as a neuronal correlate of segregated streams.

Download Full-text

Multi-microphone speech enhancement informed by auditory scene analysis

2016 IEEE Sensor Array and Multichannel Signal Processing Workshop (SAM) ◽

10.1109/sam.2016.7569625 ◽

2016 ◽

Cited By ~ 2

Author(s):

Axel Plinge ◽

Sharon Gannot

Keyword(s):

Speech Enhancement ◽

Auditory Scene Analysis ◽

Scene Analysis ◽

Auditory Scene

Download Full-text

Attention effects on auditory scene analysis: insights from event-related brain potentials

Psychological Research ◽

10.1007/s00426-014-0547-7 ◽

2014 ◽

Vol 78 (3) ◽

pp. 361-378 ◽

Cited By ~ 10

Author(s):

Mona Isabel Spielmann ◽

Erich Schröger ◽

Sonja A. Kotz ◽

Alexandra Bendixen

Keyword(s):

Auditory Scene Analysis ◽

Scene Analysis ◽

Brain Potentials ◽

Auditory Scene

Download Full-text

Auditory scene analysis based on time-frequency integration of shared FM and AM (II): Optimum time-domain integration and stream sound reconstruction

Systems and Computers in Japan ◽

10.1002/scj.1160 ◽

2002 ◽

Vol 33 (10) ◽

pp. 83-94 ◽

Cited By ~ 2

Author(s):

Mototsugu Abe ◽

Shigeru Ando

Keyword(s):

Time Domain ◽

Auditory Scene Analysis ◽

Scene Analysis ◽

Optimum Time ◽

Time Frequency ◽

Auditory Scene ◽

Domain Integration

Download Full-text

SNR-based mask compensation for computational auditory scene analysis applied to speech recognition in a car environment

10.21437/interspeech.2010-270 ◽

2010 ◽

Author(s):

Ji Hun Park ◽

Seon Man Kim ◽

Jae Sam Yoon ◽

Hong Kook Kim ◽

Sung Joo Lee ◽

...

Keyword(s):

Speech Recognition ◽

Auditory Scene Analysis ◽

Scene Analysis ◽

Computational Auditory Scene Analysis ◽

Auditory Scene

Download Full-text

Three directions in research on auditory scene analysis

10.1121/1.4799217 ◽

2013 ◽

Author(s):

Albert S. Bregman

Keyword(s):

Auditory Scene Analysis ◽

Scene Analysis ◽

Auditory Scene

Download Full-text