Data-driven functional clustering reveals dominance of face, place, and body selectivity in the ventral visual pathway

Regions selective for faces, places, and bodies feature prominently in the literature on the human ventral visual pathway. Are selectivities for these categories in fact the most robust response profiles in this pathway, or is their prominence an artifact of biased sampling of the hypothesis space in prior work? Here we use a data-driven structure discovery method that avoids the assumptions built into most prior work by 1) giving equal consideration to all possible response profiles over the conditions tested, 2) relaxing implicit anatomical constraints (that important functional profiles should manifest themselves in spatially contiguous voxels arising in similar locations across subjects), and 3) testing for dominant response profiles over images, rather than categories, thus enabling us to discover, rather than presume, the categories respected by the brain. Even with these assumptions relaxed, face, place, and body selectivity emerge as dominant in the ventral stream.

Download Full-text

Reconstructing feedback representations in ventral visual pathway with a generative adversarial autoencoder

10.1101/2020.07.23.218859 ◽

2020 ◽

Author(s):

Haider Al-Tahan ◽

Yalda Mohsenzadeh

Keyword(s):

Neural Network ◽

Visual Information ◽

Visual Pathway ◽

Functional Magnetic Resonance ◽

Low Level ◽

Ventral Visual Pathway ◽

High Level ◽

Feedback Connections ◽

The Brain ◽

Insight Into

AbstractWhile vision evokes a dense network of feedforward and feedback neural processes in the brain, visual processes are primarily modeled with feedforward hierarchical neural networks, leaving the computational role of feedback processes poorly understood. Here, we developed a generative autoencoder neural network model and adversarially trained it on a categorically diverse data set of images. We hypothesized that the feedback processes in the ventral visual pathway can be represented by reconstruction of the visual information performed by the generative model. We compared representational similarity of the activity patterns in the proposed model with temporal (magnetoencephalography) and spatial (functional magnetic resonance imaging) visual brain responses. The proposed generative model identified two segregated neural dynamics in the visual brain. A temporal hierarchy of processes transforming low level visual information into high level semantics in the feedforward sweep, and a temporally later dynamics of inverse processes reconstructing low level visual information from a high level latent representation in the feedback sweep. Our results append to previous studies on neural feedback processes by presenting a new insight into the algorithmic function and the information carried by the feedback processes in the ventral visual pathway.Author summaryIt has been shown that the ventral visual cortex consists of a dense network of regions with feedforward and feedback connections. The feedforward path processes visual inputs along a hierarchy of cortical areas that starts in early visual cortex (an area tuned to low level features e.g. edges/corners) and ends in inferior temporal cortex (an area that responds to higher level categorical contents e.g. faces/objects). Alternatively, the feedback connections modulate neuronal responses in this hierarchy by broadcasting information from higher to lower areas. In recent years, deep neural network models which are trained on object recognition tasks achieved human-level performance and showed similar activation patterns to the visual brain. In this work, we developed a generative neural network model that consists of encoding and decoding sub-networks. By comparing this computational model with the human brain temporal (magnetoencephalography) and spatial (functional magnetic resonance imaging) response patterns, we found that the encoder processes resemble the brain feedforward processing dynamics and the decoder shares similarity with the brain feedback processing dynamics. These results provide an algorithmic insight into the spatiotemporal dynamics of feedforward and feedback processes in biological vision.

Download Full-text

The relative coding strength of object identity and nonidentity features in human occipito-temporal cortex and convolutional neural networks

10.1101/2020.08.11.246967 ◽

2020 ◽

Author(s):

Yaoda Xu ◽

Maryam Vaziri-Pashkam

Keyword(s):

Visual Processing ◽

Visual Pathway ◽

Feature Representation ◽

Object Categorization ◽

Object Identity ◽

Visual Areas ◽

Ventral Visual Pathway ◽

Identity Representation ◽

The Brain ◽

Early Human

ABSTRACTAny given visual object input is characterized by multiple visual features, such as identity, position and size. Despite the usefulness of identity and nonidentity features in vision and their joint coding throughout the primate ventral visual processing pathway, they have so far been studied relatively independently. Here we document the relative coding strength of object identity and nonidentity features in a brain region and how this may change across the human ventral visual pathway. We examined a total of four nonidentity features, including two Euclidean features (position and size) and two non-Euclidean features (image statistics and spatial frequency content of an image). Overall, identity representation increased and nonidentity feature representation decreased along the ventral visual pathway, with identity outweighed the non-Euclidean features, but not the Euclidean ones, in higher levels of visual processing. A similar analysis was performed in 14 convolutional neural networks (CNNs) pretrained to perform object categorization with varying architecture, depth, and with/without recurrent processing. While the relative coding strength of object identity and nonidentity features in lower CNN layers matched well with that in early human visual areas, the match between higher CNN layers and higher human visual regions were limited. Similar results were obtained regardless of whether a CNN was trained with real-world or stylized object images that emphasized shape representation. Together, by measuring the relative coding strength of object identity and nonidentity features, our approach provided a new tool to characterize feature coding in the human brain and the correspondence between the brain and CNNs.SIGNIFICANCE STATEMENTThis study documented the relative coding strength of object identity compared to four types of nonidentity features along the human ventral visual processing pathway and compared brain responses with those of 14 CNNs pretrained to perform object categorization. Overall, identity representation increased and nonidentity feature representation decreased along the ventral visual pathway, with the coding strength of the different nonidentity features differed at higher levels of visual processing. While feature coding in lower CNN layers matched well with that of early human visual areas, the match between higher CNN layers and higher human visual regions were limited. Our approach provided a new tool to characterize feature coding in the human brain and the correspondence between the brain and CNNs.

Download Full-text

Binocular depth processing in the ventral visual pathway

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2015.0259 ◽

2016 ◽

Vol 371 (1697) ◽

pp. 20150259 ◽

Cited By ~ 16

Author(s):

Bram-Ernst Verhoef ◽

Rufin Vogels ◽

Peter Janssen

Keyword(s):

Three Dimensional ◽

Visual Pathway ◽

Object Processing ◽

Depth Processing ◽

Ventral Visual Pathway ◽

Ventral Pathway ◽

Dimensional Shape ◽

Three Dimensional Shape ◽

The Brain ◽

Binocular Depth

One of the most powerful forms of depth perception capitalizes on the small relative displacements, or binocular disparities, in the images projected onto each eye. The brain employs these disparities to facilitate various computations, including sensori-motor transformations (reaching, grasping), scene segmentation and object recognition. In accordance with these different functions, disparity activates a large number of regions in the brain of both humans and monkeys. Here, we review how disparity processing evolves along different regions of the ventral visual pathway of macaques, emphasizing research based on both correlational and causal techniques. We will discuss the progression in the ventral pathway from a basic absolute disparity representation to a more complex three-dimensional shape code. We will show that, in the course of this evolution, the underlying neuronal activity becomes progressively more bound to the global perceptual experience. We argue that these observations most probably extend beyond disparity processing per se , and pertain to object processing in the ventral pathway in general. We conclude by posing some important unresolved questions whose answers may significantly advance the field, and broaden its scope. This article is part of the themed issue ‘Vision in our three-dimensional world’.

Download Full-text

Using neural distance to predict reaction time for categorizing the animacy, shape, and abstract properties of objects

Scientific Reports ◽

10.1038/s41598-019-49732-7 ◽

2019 ◽

Vol 9 (1) ◽

Cited By ~ 4

Author(s):

J. Brendan Ritchie ◽

Hans Op de Beeck

Keyword(s):

Reaction Times ◽

Visual Pathway ◽

Neural Activation ◽

Object Category ◽

Asymmetric Effects ◽

Ventral Visual Pathway ◽

Shape Distance ◽

Stimulus Design ◽

High Level ◽

The Brain

Abstract A large number of neuroimaging studies have shown that information about object category can be decoded from regions of the ventral visual pathway. One question is how this information might be functionally exploited in the brain. In an attempt to help answer this question, some studies have adopted a neural distance-to-bound approach, and shown that distance to a classifier decision boundary through neural activation space can be used to predict reaction times (RT) on animacy categorization tasks. However, these experiments have not controlled for possible visual confounds, such as shape, in their stimulus design. In the present study we sought to determine whether, when animacy and shape properties are orthogonal, neural distance in low- and high-level visual cortex would predict categorization RTs, and whether a combination of animacy and shape distance might predict RTs when categories crisscrossed the two stimulus dimensions, and so were not linearly separable. In line with previous results, we found that RTs correlated with neural distance, but only for animate stimuli, with similar, though weaker, asymmetric effects for the shape and crisscrossing tasks. Taken together, these results suggest there is potential to expand the neural distance-to-bound approach to other divisions beyond animacy and object category.

Download Full-text

Using neural distance to predict reaction time for categorizing the animacy, shape, and abstract properties of objects

10.1101/496539 ◽

2018 ◽

Cited By ~ 2

Author(s):

J.Brendan Ritchie ◽

Hans Op de Beeck

Keyword(s):

Reaction Times ◽

Visual Pathway ◽

Neural Activation ◽

Object Category ◽

Asymmetric Effects ◽

Ventral Visual Pathway ◽

Shape Distance ◽

Stimulus Design ◽

High Level ◽

The Brain

A large number of neuroimaging studies have shown that information about object category can be decoded from regions of the ventral visual pathway. One question is how this information might be functionally exploited in the brain. In an attempt to answer this question, some studies have adopted a neural distance-to-bound approach, and shown that distance to a classifier decision boundary through neural activation space can be used to predict reaction times (RT) on animacy categorization tasks. However, these experiments have not controlled for possible visual confounds, such as shape, in their stimulus design. In the present study we sought to determine whether, when animacy and shape properties are orthogonal, neural distance in low- and high-level visual cortex would predict categorization RTs. We also investigated whether a combination of animacy and shape distance might predict RTs when categories crisscrossed the two stimulus dimensions, and so were not linearly separable. In line with previous results, we found that RTs correlated with neural distance, but only for animate stimuli, with similar, though weaker, asymmetric effects for the shape and crisscrossing tasks. Taken together, these results suggest there is potential to expand the neural distance-to-bound approach to other divisions beyond animacy and object category.

Download Full-text

Faculty Opinions recommendation of Spatiotemporal dynamics of orthographic and lexical processing in the ventral visual pathway.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.739132220.793580669 ◽

2020 ◽

Author(s):

Marlene Behrmann

Keyword(s):

Lexical Processing ◽

Visual Pathway ◽

Spatiotemporal Dynamics ◽

Ventral Visual Pathway

Download Full-text

Topography of Visual Features in the Human Ventral Visual Pathway

Neuroscience Bulletin ◽

10.1007/s12264-021-00734-4 ◽

2021 ◽

Author(s):

Shijia Fan ◽

Xiaosha Wang ◽

Xiaoying Wang ◽

Tao Wei ◽

Yanchao Bi

Keyword(s):

Visual Pathway ◽

Visual Features ◽

Ventral Visual Pathway

Download Full-text

View-tuned and view-invariant face encoding in IT cortex is explained by selected natural image fragments

Scientific Reports ◽

10.1038/s41598-021-86842-7 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Yunjun Nam ◽

Takayuki Sato ◽

Go Uchida ◽

Ekaterina Malakhova ◽

Shimon Ullman ◽

...

Keyword(s):

Face Recognition ◽

Visual Pathway ◽

Visual Features ◽

Natural Image ◽

Face Representation ◽

Low Level ◽

Ventral Visual Pathway ◽

Neural Substrate ◽

The Face

AbstractHumans recognize individual faces regardless of variation in the facial view. The view-tuned face neurons in the inferior temporal (IT) cortex are regarded as the neural substrate for view-invariant face recognition. This study approximated visual features encoded by these neurons as combinations of local orientations and colors, originated from natural image fragments. The resultant features reproduced the preference of these neurons to particular facial views. We also found that faces of one identity were separable from the faces of other identities in a space where each axis represented one of these features. These results suggested that view-invariant face representation was established by combining view sensitive visual features. The face representation with these features suggested that, with respect to view-invariant face representation, the seemingly complex and deeply layered ventral visual pathway can be approximated via a shallow network, comprised of layers of low-level processing for local orientations and colors (V1/V2-level) and the layers which detect particular sets of low-level elements derived from natural image fragments (IT-level).

Download Full-text

Temporal Dynamics of Shape Processing Differentiate Contributions of Dorsal and Ventral Visual Pathways

Journal of Cognitive Neuroscience ◽

10.1162/jocn_a_01391 ◽

2019 ◽

Vol 31 (6) ◽

pp. 821-836 ◽

Cited By ~ 2

Author(s):

Elliot Collins ◽

Erez Freud ◽

Jana M. Kainerstorfer ◽

Jiaming Cao ◽

Marlene Behrmann

Keyword(s):

Visual Information ◽

Temporal Dynamics ◽

Object Perception ◽

Visual Pathway ◽

Shape Sensitivity ◽

Shape Information ◽

Dorsal Pathway ◽

Ventral Visual Pathway ◽

Shape Processing ◽

Ventral Pathway

Although shape perception is primarily considered a function of the ventral visual pathway, previous research has shown that both dorsal and ventral pathways represent shape information. Here, we examine whether the shape-selective electrophysiological signals observed in dorsal cortex are a product of the connectivity to ventral cortex or are independently computed. We conducted multiple EEG studies in which we manipulated the input parameters of the stimuli so as to bias processing to either the dorsal or ventral visual pathway. Participants viewed displays of common objects with shape information parametrically degraded across five levels. We measured shape sensitivity by regressing the amplitude of the evoked signal against the degree of stimulus scrambling. Experiment 1, which included grayscale versions of the stimuli, served as a benchmark establishing the temporal pattern of shape processing during typical object perception. These stimuli evoked broad and sustained patterns of shape sensitivity beginning as early as 50 msec after stimulus onset. In Experiments 2 and 3, we calibrated the stimuli such that visual information was delivered primarily through parvocellular inputs, which mainly project to the ventral pathway, or through koniocellular inputs, which mainly project to the dorsal pathway. In the second and third experiments, shape sensitivity was observed, but in distinct spatio-temporal configurations from each other and from that elicited by grayscale inputs. Of particular interest, in the koniocellular condition, shape selectivity emerged earlier than in the parvocellular condition. These findings support the conclusion of distinct dorsal pathway computations of object shape, independent from the ventral pathway.

Download Full-text

Visualization of hypermetabolism in the ventral visual pathway by FDG-PET in the Charles Bonnet syndrome

Clinical Neurology and Neurosurgery ◽

10.1016/j.clineuro.2021.106832 ◽

2021 ◽

pp. 106832

Author(s):

Hanako Sasaki ◽

Yasunori Fujimoto ◽

Hiroyuki Ima ◽

Yu Kageyama

Keyword(s):

Fdg Pet ◽

Visual Pathway ◽

Charles Bonnet Syndrome ◽

Ventral Visual Pathway

Download Full-text