Mapping visual symbols onto spoken language along the ventral visual stream

Reading involves transforming arbitrary visual symbols into sounds and meanings. This study interrogated the neural representations in ventral occipitotemporal cortex (vOT) that support this transformation process. Twenty-four adults learned to read 2 sets of 24 novel words that shared phonemes and semantic categories but were written in different artificial orthographies. Following 2 wk of training, participants read the trained words while neural activity was measured with functional MRI. Representational similarity analysis on item pairs from the same orthography revealed that right vOT and posterior regions of left vOT were sensitive to basic visual similarity. Left vOT encoded letter identity and representations became more invariant to position along a posterior-to-anterior hierarchy. Item pairs that shared sounds or meanings, but were written in different orthographies with no letters in common, evoked similar neural patterns in anterior left vOT. These results reveal a hierarchical, posterior-to-anterior gradient in vOT, in which representations of letters become increasingly invariant to position and are transformed to convey spoken language information.

Download Full-text

Action and object representation in the ventral "what" stream

10.31234/osf.io/af65s ◽

2021 ◽

Author(s):

Moritz Wurm ◽

Alfonso Caramazza

Keyword(s):

Object Recognition ◽

Action Recognition ◽

Object Representation ◽

Ventral Stream ◽

Object Representations ◽

Visual Stream ◽

Ventral Visual Stream ◽

Occipitotemporal Cortex ◽

Object Features

The ventral visual stream is conceived as a pathway for object recognition. However, we also recognize the actions an object can be involved in. Here, we show that action recognition relies on a pathway in lateral occipitotemporal cortex, partially overlapping and topographically aligned with object representations that are precursors for action recognition. By contrast, object features that are more relevant for object recognition, such as color and texture, are restricted to medial areas of the ventral stream. We argue that the ventral stream bifurcates into lateral and medial pathways for action and object recognition, respectively. This account explains a number of observed phenomena, such as the duplication of object domains and the specific representational profiles in lateral and medial areas.

Download Full-text

Neural dynamics of real-world object vision that guide behaviour

10.1101/147298 ◽

2017 ◽

Cited By ~ 4

Author(s):

Radoslaw M. Cichy ◽

Nikolaus Kriegeskorte ◽

Kamila M. Jozwik ◽

Jasper J.F. van den Bosch ◽

Ian Charest

Keyword(s):

Human Brain ◽

Real World ◽

Brain Activity ◽

Stimulus Onset ◽

Visual Features ◽

Object Representations ◽

Neuronal Dynamics ◽

Visual Stream ◽

Semantic Categories ◽

Ventral Visual Stream

1AbstractVision involves complex neuronal dynamics that link the sensory stream to behaviour. To capture the richness and complexity of the visual world and the behaviour it entails, we used an ecologically valid task with a rich set of real-world object images. We investigated how human brain activity, resolved in space with functional MRI and in time with magnetoencephalography, links the sensory stream to behavioural responses. We found that behaviour-related brain activity emerged rapidly in the ventral visual pathway within 200ms of stimulus onset. The link between stimuli, brain activity, and behaviour could not be accounted for by either category membership or visual features (as provided by an artificial deep neural network model). Our results identify behaviourally-relevant brain activity during object vision, and suggest that object representations guiding behaviour are complex and can neither be explained by visual features or semantic categories alone. Our findings support the view that visual representations in the ventral visual stream need to be understood in terms of their relevance to behaviour, and highlight the importance of complex behavioural assessment for human brain mapping.

Download Full-text

Persistent neural activity encoding real-time presence of visual stimuli decays along the ventral stream

10.1101/088021 ◽

2016 ◽

Cited By ~ 1

Author(s):

Edden M. Gerber ◽

Tal Golan ◽

Robert T. Knight ◽

Leon Y. Deouell

Keyword(s):

Real Time ◽

Neural Activity ◽

Large Scale ◽

Response Duration ◽

Visual Stream ◽

Visual Hierarchy ◽

Early Visual Cortex ◽

Ventral Visual Stream ◽

Intracranial Recordings ◽

Posterior Anterior

AbstractNeural populations that encode sensory experience should be persistently active for as long as the experience persists. However, research into visual neural activity has focused almost exclusively on onset-driven responses that cannot account for sustained perception. We used intracranial recordings in humans to determine the degree to which the presence of a visual stimulus is persistently encoded by neural activity. The correspondence between stimulus duration and response duration was strongest in early visual cortex and gradually diminished along the visual hierarchy, such that is was weakest in inferior-temporal category-selective regions. A similar posterior-anterior gradient was found within inferior temporal face-selective regions, with posterior but not anterior sites showing persistent face-selective activity. The results suggest that regions that appear uniform in terms of their category selectivity are dissociated by how they temporally represent a stimulus in support of real-time experience, and delineate a large-scale organizing principle of the ventral visual stream.

Download Full-text

Dynamic representations of faces in the human ventral visual stream link visual features to behaviour

10.1101/394916 ◽

2018 ◽

Author(s):

Diana C. Dima ◽

Krish D. Singh

Keyword(s):

Face Perception ◽

Recognition Task ◽

Visual Features ◽

Expression Recognition ◽

Visual Stream ◽

Neural Representations ◽

Presentation Time ◽

Ventral Visual Stream ◽

Configural Representations ◽

Two Stages

AbstractHumans can rapidly extract information from faces even in challenging viewing conditions, yet the neural representations supporting this ability are still not well understood. Here, we manipulated the presentation duration of backward-masked facial expressions and used magnetoencephalography (MEG) to investigate the computations underpinning rapid face processing. Multivariate analyses revealed two stages in face perception, with the ventral visual stream encoding facial features prior to facial configuration. When presentation time was reduced, the emergence of sustained featural and configural representations was delayed. Importantly, these representations explained behaviour during an expression recognition task. Together, these results describe the adaptable system linking visual features, brain and behaviour during face perception.

Download Full-text

The dynamics of invariant object recognition in the human visual system

Journal of Neurophysiology ◽

10.1152/jn.00394.2013 ◽

2014 ◽

Vol 111 (1) ◽

pp. 91-102 ◽

Cited By ~ 135

Author(s):

Leyla Isik ◽

Ethan M. Meyers ◽

Joel Z. Leibo ◽

Tomaso Poggio

Keyword(s):

Object Recognition ◽

Visual System ◽

Human Visual System ◽

Visual Information ◽

Compelling Evidence ◽

Visual Stream ◽

Neural Representations ◽

Invariant Object Recognition ◽

Ventral Visual Stream ◽

The Brain

The human visual system can rapidly recognize objects despite transformations that alter their appearance. The precise timing of when the brain computes neural representations that are invariant to particular transformations, however, has not been mapped in humans. Here we employ magnetoencephalography decoding analysis to measure the dynamics of size- and position-invariant visual information development in the ventral visual stream. With this method we can read out the identity of objects beginning as early as 60 ms. Size- and position-invariant visual information appear around 125 ms and 150 ms, respectively, and both develop in stages, with invariance to smaller transformations arising before invariance to larger transformations. Additionally, the magnetoencephalography sensor activity localizes to neural sources that are in the most posterior occipital regions at the early decoding times and then move temporally as invariant information develops. These results provide previously unknown latencies for key stages of human-invariant object recognition, as well as new and compelling evidence for a feed-forward hierarchical model of invariant object recognition where invariance increases at each successive visual area along the ventral stream.

Download Full-text

Using Convolutional Neural Networks to measure the contribution of visual features to the representation of object animacy in the brain

10.31237/osf.io/fxz4q ◽

2019 ◽

Cited By ~ 1

Author(s):

Sushrut Thorat

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Temporal Cortex ◽

Large Degree ◽

Visual Features ◽

Visual Feature ◽

Visual Stream ◽

Feature Information ◽

Ventral Visual Stream ◽

Animal Images

A mediolateral gradation in neural responses for images spanning animals to artificial objects is observed in the ventral temporal cortex (VTC). Which information streams drive this organisation is an ongoing debate. Recently, in Proklova et al. (2016), the visual shape and category (“animacy”) dimensions in a set of stimuli were dissociated using a behavioural measure of visual feature information. fMRI responses revealed a neural cluster (extra-visual animacy cluster - xVAC) which encoded category information unexplained by visual feature information, suggesting extra-visual contributions to the organisation in the ventral visual stream. We reassess these findings using Convolutional Neural Networks (CNNs) as models for the ventral visual stream. The visual features developed in the CNN layers can categorise the shape-matched stimuli from Proklova et al. (2016) in contrast to the behavioural measures used in the study. The category organisations in xVAC and VTC are explained to a large degree by the CNN visual feature differences, casting doubt over the suggestion that visual feature differences cannot account for the animacy organisation. To inform the debate further, we designed a set of stimuli with animal images to dissociate the animacy organisation driven by the CNN visual features from the degree of familiarity and agency (thoughtfulness and feelings). Preliminary results from a new fMRI experiment designed to understand the contribution of these non-visual features are presented.

Download Full-text

Expectation and Surprise Determine Neural Population Responses in the Ventral Visual Stream

Journal of Neuroscience ◽

10.1523/jneurosci.2770-10.2010 ◽

2010 ◽

Vol 30 (49) ◽

pp. 16601-16608 ◽

Cited By ~ 185

Author(s):

T. Egner ◽

J. M. Monti ◽

C. Summerfield

Keyword(s):

Neural Population ◽

Visual Stream ◽

Population Responses ◽

Ventral Visual Stream

Download Full-text

Orthographic processing deficits in developmental dyslexia: Beyond the ventral visual stream

NeuroImage ◽

10.1016/j.neuroimage.2016.01.014 ◽

2016 ◽

Vol 128 ◽

pp. 316-327 ◽

Cited By ~ 32

Author(s):

Marianna Boros ◽

Jean-Luc Anton ◽

Catherine Pech-Georgel ◽

Jonathan Grainger ◽

Marcin Szwed ◽

...

Keyword(s):

Developmental Dyslexia ◽

Orthographic Processing ◽

Visual Stream ◽

Ventral Visual Stream ◽

Processing Deficits

Download Full-text

Single units and conscious vision

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.1998.0333 ◽

1998 ◽

Vol 353 (1377) ◽

pp. 1801-1818 ◽

Cited By ~ 227

Author(s):

◽

N. K. Logothetis

Keyword(s):

Binocular Rivalry ◽

Neural Activity ◽

Visual Awareness ◽

Physiological Data ◽

Neural Basis ◽

Binocular Fusion ◽

Neural Representations ◽

Multistable Perception ◽

Visual Areas ◽

Psychophysical Studies

Figures that can be seen in more than one way are invaluable tools for the study of the neural basis of visual awareness, because such stimuli permit the dissociation of the neural responses that underlie what we perceive at any given time from those forming the sensory representation of a visual pattern. To study the former type of responses, monkeys were subjected to binocular rivalry, and the response of neurons in a number of different visual areas was studied while the animals reported their alternating percepts by pulling levers. Perception–related modulations of neural activity were found to occur to different extents in different cortical visual areas. The cells that were affected by suppression were almost exclusively binocular, and their proportion was found to increase in the higher processing stages of the visual system. The strongest correlations between neural activity and perception were observed in the visual areas of the temporal lobe. A strikingly large number of neurons in the early visual areas remained active during the perceptual suppression of the stimulus, a finding suggesting that conscious visual perception might be mediated by only a subset of the cells exhibiting stimulus selective responses. These physiological findings, together with a number of recent psychophysical studies, offer a new explanation of the phenomenon of binocular rivalry. Indeed, rivalry has long been considered to be closely linked with binocular fusion and stereopsis, and the sequences of dominance and suppression have been viewed as the result of competition between the two monocular channels. The physiological data presented here are incompatible with this interpretation. Rather than reflecting interocular competition, the rivalry is most probably between the two different central neural representations generated by the dichoptically presented stimuli. The mechanisms of rivalry are probably the same as, or very similar to, those underlying multistable perception in general, and further physiological studies might reveal a much about the neural mechanisms of our perceptual organization.

Download Full-text

Predictive coding of action intentions in dorsal and ventral visual stream is based on visual anticipations, memory-based information and motor preparation

10.1101/480590 ◽

2018 ◽

Author(s):

Simona Monaco ◽

Giulia Malfatti ◽

Alessandro Zendron ◽

Elisa Pellencin ◽

Luca Turella

Keyword(s):

Visual Information ◽

Visual Memory ◽

Visual Recognition ◽

Predictive Coding ◽

Action Planning ◽

Neural Signals ◽

Motor Representation ◽

Visual Stream ◽

Ventral Visual Stream ◽

Action Intention

AbstractPredictions of upcoming movements are based on several types of neural signals that span the visual, somatosensory, motor and cognitive system. Thus far, pre-movement signals have been investigated while participants viewed the object to be acted upon. Here, we studied the contribution of information other than vision to the classification of preparatory signals for action, even in absence of online visual information. We used functional magnetic resonance imaging (fMRI) and multivoxel pattern analysis (MVPA) to test whether the neural signals evoked by visual, memory-based and somato-motor information can be reliably used to predict upcoming actions in areas of the dorsal and ventral visual stream during the preparatory phase preceding the action, while participants were lying still. Nineteen human participants (nine women) performed one of two actions towards an object with their eyes open or closed. Despite the well-known role of ventral stream areas in visual recognition tasks and the specialization of dorsal stream areas in somato-motor processes, we decoded action intention in areas of both streams based on visual, memory-based and somato-motor signals. Interestingly, we could reliably decode action intention in absence of visual information based on neural activity evoked when visual information was available, and vice-versa. Our results show a similar visual, memory and somato-motor representation of action planning in dorsal and ventral visual stream areas that allows predicting action intention across domains, regardless of the availability of visual information.

Download Full-text