scholarly journals Psychophysical evidence for auditory motion parallax

2018 ◽  
Vol 115 (16) ◽  
pp. 4264-4269 ◽  
Author(s):  
Daria Genzel ◽  
Michael Schutte ◽  
W. Owen Brimijoin ◽  
Paul R. MacNeilage ◽  
Lutz Wiegrebe

Distance is important: From an ecological perspective, knowledge about the distance to either prey or predator is vital. However, the distance of an unknown sound source is particularly difficult to assess, especially in anechoic environments. In vision, changes in perspective resulting from observer motion produce a reliable, consistent, and unambiguous impression of depth known as motion parallax. Here we demonstrate with formal psychophysics that humans can exploit auditory motion parallax, i.e., the change in the dynamic binaural cues elicited by self-motion, to assess the relative depths of two sound sources. Our data show that sensitivity to relative depth is best when subjects move actively; performance deteriorates when subjects are moved by a motion platform or when the sound sources themselves move. This is true even though the dynamic binaural cues elicited by these three types of motion are identical. Our data demonstrate a perceptual strategy to segregate intermittent sound sources in depth and highlight the tight interaction between self-motion and binaural processing that allows assessment of the spatial layout of complex acoustic scenes.

2021 ◽  
Author(s):  
HyungGoo Kim ◽  
Dora Angelaki ◽  
Gregory DeAngelis

Detecting objects that move in a scene is a fundamental computation performed by the visual system. This computation is greatly complicated by observer motion, which causes most objects to move across the retinal image. How the visual system detects scene-relative object motion during self-motion is poorly understood. Human behavioral studies suggest that the visual system may identify local conflicts between motion parallax and binocular disparity cues to depth, and may use these signals to detect moving objects. We describe a novel mechanism for performing this computation based on neurons in macaque area MT with incongruent depth tuning for binocular disparity and motion parallax cues. Neurons with incongruent tuning respond selectively to scene-relative object motion and their responses are predictive of perceptual decisions when animals are trained to detect a moving object during selfmotion. This finding establishes a novel functional role for neurons with incongruent tuning for multiple depth cues.


Perception ◽  
1997 ◽  
Vol 26 (1_suppl) ◽  
pp. 333-333
Author(s):  
D Runde ◽  
D Ruschin ◽  
E Feddersen

An observer moving in a natural environment is usually able to separate the constant changes of his retinal images in such a way that he perceives the environment and the changes of his observation point independently. The necessary and sufficient conditions to perceive a stable environment in spite of the retinal change produced by self-motion are, however, as yet unknown. We found that under certain conditions a scene that changes during observer motion can appear more stable than a rigid one. In our experiment a scene consisting of a number of LEDs distributed in a dark room was visible through a window. A mechanical device controlled by a head-tracker was used to move the LEDs during head motion to either reduce or enhance motion parallax by a predefined gain factor. The subjects rated the scene with respect to different attributes including apparent deformation and degree of motion perceived. They were also asked to adjust the parallax gain to the value of greatest apparent stability of the scene. Monocular as well as binocular trials were conducted and different fixation points were employed. The result was a general tendency in all conditions to perceive scene motion when the scene was in fact rigid and to perceive the greatest stability when the scene was distorted in such a way as to produce reduced motion parallax.


2018 ◽  
Vol 120 (6) ◽  
pp. 2939-2952 ◽  
Author(s):  
Samira Anderson ◽  
Robert Ellis ◽  
Julie Mehta ◽  
Matthew J. Goupell

The effects of aging and stimulus configuration on binaural masking level differences (BMLDs) were measured behaviorally and electrophysiologically, using the frequency-following response (FFR) to target brainstem/midbrain encoding. The tests were performed in 15 younger normal-hearing (<30 yr) and 15 older normal-hearing (>60 yr) participants. The stimuli consisted of a 500-Hz target tone embedded in a narrowband (50-Hz bandwidth) or wideband (1,500-Hz bandwidth) noise masker. The interaural phase conditions included NoSo (tone and noise presented interaurally in-phase), NoSπ (noise presented interaurally in-phase and tone presented out-of-phase), and NπSo (noise presented interaurally out-of-phase and tone presented in-phase) configurations. In the behavioral experiment, aging reduced the magnitude of the BMLD. The magnitude of the BMLD was smaller for the NoSo–NπSo threshold difference compared with the NoSo–NoSπ threshold difference, and it was also smaller in narrowband compared with wideband conditions, consistent with previous measurements. In the electrophysiology experiment, older participants had reduced FFR magnitudes and smaller differences between configurations. There were significant changes in FFR magnitude between the NoSo to NoSπ configurations but not between the NoSo to NπSo configurations. The age-related reduction in FFR magnitudes suggests a temporal processing deficit, but no correlation was found between FFR magnitudes and behavioral BMLDs. Therefore, independent mechanisms may be contributing to the behavioral and neural deficits. Specifically, older participants had higher behavioral thresholds than younger participants for the NoSπ and NπSo configurations but had equivalent thresholds for the NoSo configuration. However, FFR magnitudes were reduced in older participants across all configurations. NEW & NOTEWORTHY Behavioral and electrophysiological testing reveal an aging effect for stimuli presented in wideband and narrowband noise conditions, such that behavioral binaural masking level differences and subcortical spectral magnitudes are reduced in older compared with younger participants. These deficits in binaural processing may limit the older participant's ability to use spatial cues to understand speech in environments containing competing sound sources.


Perception ◽  
1998 ◽  
Vol 27 (8) ◽  
pp. 937-949 ◽  
Author(s):  
Takanao Yajima ◽  
Hiroyasu Ujike ◽  
Keiji Uchikawa

The two main questions addressed in this study were (a) what effect does yoking the relative expansion and contraction (EC) of retinal images to forward and backward head movements have on the resultant magnitude and stability of perceived depth, and (b) how does this relative EC image motion interact with the depth cues of motion parallax? Relative EC image motion was produced by moving a small CCD camera toward and away from the stimulus, two random-dot surfaces separated in depth, in synchrony with the observers' forward and backward head movements. Observers viewed the stimuli monocularly, on a helmet-mounted display, while moving their heads at various velocities, including zero velocity. The results showed that (a) the magnitude of perceived depth was smaller with smaller head velocities (<10 cm s−1), including the zero-head-velocity condition, than with a larger velocity (10 cm s−1), and (b) perceived depth, when motion parallax and the EC image motion cues were simultaneously presented, is equal to the greater of the two possible perceived depths produced from either of these two cues alone. The results suggested the role of nonvisual information of self-motion on perceiving depth.


2000 ◽  
Vol 83 (4) ◽  
pp. 2300-2314 ◽  
Author(s):  
U. Koch ◽  
B. Grothe

To date, most physiological studies that investigated binaural auditory processing have addressed the topic rather exclusively in the context of sound localization. However, there is strong psychophysical evidence that binaural processing serves more than only sound localization. This raises the question of how binaural processing of spatial cues interacts with cues important for feature detection. The temporal structure of a sound is one such feature important for sound recognition. As a first approach, we investigated the influence of binaural cues on temporal processing in the mammalian auditory system. Here, we present evidence that binaural cues, namely interaural intensity differences (IIDs), have profound effects on filter properties for stimulus periodicity of auditory midbrain neurons in the echolocating big brown bat, Eptesicus fuscus. Our data indicate that these effects are partially due to changes in strength and timing of binaural inhibitory inputs. We measured filter characteristics for the periodicity (modulation frequency) of sinusoidally frequency modulated sounds (SFM) under different binaural conditions. As criteria, we used 50% filter cutoff frequencies of modulation transfer functions based on discharge rate as well as synchronicity of discharge to the sound envelope. The binaural conditions were contralateral stimulation only, equal stimulation at both ears (IID = 0 dB), and more intense at the ipsilateral ear (IID = −20, −30 dB). In 32% of neurons, the range of modulation frequencies the neurons responded to changed considerably comparing monaural and binaural (IID =0) stimulation. Moreover, in ∼50% of neurons the range of modulation frequencies was narrower when the ipsilateral ear was favored (IID = −20) compared with equal stimulation at both ears (IID = 0). In ∼10% of the neurons synchronization differed when comparing different binaural cues. Blockade of the GABAergic or glycinergic inputs to the cells recorded from revealed that inhibitory inputs were at least partially responsible for the observed changes in SFM filtering. In 25% of the neurons, drug application abolished those changes. Experiments using electronically introduced interaural time differences showed that the strength of ipsilaterally evoked inhibition increased with increasing modulation frequencies in one third of the cells tested. Thus glycinergic and GABAergic inhibition is at least one source responsible for the observed interdependence of temporal structure of a sound and spatial cues.


Perception ◽  
1979 ◽  
Vol 8 (2) ◽  
pp. 125-134 ◽  
Author(s):  
Brian Rogers ◽  
Maureen Graham

The perspective transformations of the retinal image, produced by either the movement of an observer or the movement of objects in the visual world, were found to produce a reliable, consistent, and unambiguous impression of relative depth in the absence of all other cues to depth and distance. The stimulus displays consisted of computer-generated random-dot patterns that could be transformed by each movement of the observer or the display oscilloscope to simulate the relative movement information produced by a three-dimensional surface. Using a stereoscopic matching task, the second experiment showed that the perceived depth from parallax transformations is in close agreement with the degree of relative image displacement, as well as producing a compelling impression of three-dimensionality not unlike that found with random-dot stereograms.


Acta Acustica ◽  
2021 ◽  
Vol 5 ◽  
pp. 10
Author(s):  
Johannes M. Arend ◽  
Heinrich R. Liesefeld ◽  
Christoph Pörschmann

Nearby sound sources provide distinct binaural cues, mainly in the form of interaural level differences, which vary with respect to distance and azimuth. However, there is a long-standing controversy regarding whether humans can actually utilize binaural cues for distance estimation of nearby sources. Therefore, we conducted three experiments using non-individual binaural synthesis. In Experiment 1, subjects had to estimate the relative distance of loudness-normalized and non-normalized nearby sources in static and dynamic binaural rendering in a multi-stimulus comparison task under anechoic conditions. Loudness normalization was used as a plausible method to compensate for noticeable intensity differences between stimuli. With the employed loudness normalization, nominal distance did not significantly affect distance ratings for most conditions despite the presence of non-individual binaural distance cues. In Experiment 2, subjects had to judge the relative distance between loudness-normalized sources in dynamic binaural rendering in a forced-choice task. Below chance performance in this more sensitive task revealed that the employed loudness normalization strongly affected distance estimation. As this finding indicated a general issue with loudness normalization for studies on relative distance estimation, Experiment 3 directly tested the validity of loudness normalization and a frequently used amplitude normalization. Results showed that both normalization methods lead to remaining (incorrect) intensity cues, which subjects most likely used for relative distance estimation. The experiments revealed that both examined normalization methods have consequential drawbacks. These drawbacks might in parts explain conflicting findings regarding the effectiveness of binaural cues for relative distance estimation in the literature.


2008 ◽  
Vol 98 (4) ◽  
pp. 273-293 ◽  
Author(s):  
Douglas A. Hanes ◽  
Julia Keller ◽  
Gin McCollum

Perception ◽  
2018 ◽  
Vol 47 (12) ◽  
pp. 1179-1195 ◽  
Author(s):  
Julián Villegas ◽  
Naoki Fukasawa

Changes in frequency such as those found in Risset tones have been associated with moving sound sources in the vertical plane (Pratt effect) and the horizontal plane (Doppler illusion). We investigated the reported origin and motion of unspatialized Risset tones presented monotically and diotically, and Risset tones simulated to be in the sagittal or coronal plane, approaching or receding, from above or horizontally. Independent of the artificial spatialization used (none, spatializing frequency components collectively or individually, elevated or not), upward glissandi were more likely to be judged as approaching than receding, and downward glissandi as receding than approaching, in most cases from the horizon. Glissandi associations with horizontal movements were more common in stimuli simulated on the sagittal plane than in stimuli simulated on the coronal plane. These findings suggest that the Doppler illusion is stronger than the Pratt effect, at least for Risset tones presented over headphones and simulated to be in the sagittal plane. These findings may contribute to better understanding of the association between auditory motion perception and changes in frequency.


Sign in / Sign up

Export Citation Format

Share Document