scholarly journals Global shape processing: Which parts form the whole?

2010 ◽  
Vol 10 (6) ◽  
pp. 16-16 ◽  
Author(s):  
J. Bell ◽  
S. Hancock ◽  
F. A. A. Kingdom ◽  
J. W. Peirce
2019 ◽  
Author(s):  
Adrien Doerig ◽  
Lynn Schmittwilken ◽  
Bilge Sayim ◽  
Mauro Manassi ◽  
Michael H. Herzog

AbstractClassically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of vision, including ffCNNs, can explain human global shape processing (1). Here, we show that Capsule Neural Networks (CapsNets; 2), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. We also show that ffCNNs and standard recurrent CNNs do not, suggesting that the grouping and segmentation capabilities of CapsNets are crucial. Furthermore, we provide psychophysical evidence that grouping and segmentation are implemented recurrently in humans, and show that CapsNets reproduce these results well. We discuss why recurrence seems needed to implement grouping and segmentation efficiently. Together, we provide mutually reinforcing psychophysical and computational evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations.Author SummaryFeedforward Convolutional Neural Networks (ffCNNs) have revolutionized computer vision and are deeply transforming neuroscience. However, ffCNNs only roughly mimic human vision. There is a rapidly expanding body of literature investigating differences between humans and ffCNNs. Several findings suggest that, unlike humans, ffCNNs rely mostly on local visual features. Furthermore, ffCNNs lack recurrent connections, which abound in the brain. Here, we use visual crowding, a well-known psychophysical phenomenon, to investigate recurrent computations in global shape processing. Previously, we showed that no model based on the classic feedforward framework of vision can explain global effects in crowding. Here, we show that Capsule Neural Networks (CapsNets), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. ffCNNs and recurrent CNNs with lateral and top-down recurrent connections do not, suggesting that grouping and segmentation are crucial for human-like global computations. Based on these results, we hypothesize that one computational function of recurrence is to efficiently implement grouping and segmentation. We provide psychophysical evidence that, indeed, grouping and segmentation is based on time consuming recurrent processes in the human brain. CapsNets reproduce these results too. Together, we provide mutually reinforcing computational and psychophysical evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations.


2015 ◽  
Vol 15 (13) ◽  
pp. 18 ◽  
Author(s):  
Vanessa K. Bowden ◽  
J. Edwin Dickinson ◽  
Allison M. Fox ◽  
David R. Badcock

2011 ◽  
Vol 51 (15) ◽  
pp. 1760-1766 ◽  
Author(s):  
Jason Bell ◽  
Elena Gheorghiu ◽  
Robert F. Hess ◽  
Frederick A.A. Kingdom

2013 ◽  
Vol 54 (2) ◽  
pp. 1160 ◽  
Author(s):  
Doreen Wagner ◽  
Velitchko Manahilov ◽  
Gael E. Gordon ◽  
Gunter Loffler

2014 ◽  
Vol 14 (11) ◽  
pp. 12-12
Author(s):  
J. Bell ◽  
M. Forsyth ◽  
D. R. Badcock ◽  
F. A. A. Kingdom

2009 ◽  
Vol 80 (1) ◽  
pp. 162-177 ◽  
Author(s):  
K. Suzanne Scherf ◽  
Marlene Behrmann ◽  
Ruth Kimchi ◽  
Beatriz Luna

2008 ◽  
Vol 1 (2) ◽  
pp. 114-129 ◽  
Author(s):  
K. Suzanne Scherf ◽  
Beatriz Luna ◽  
Ruth Kimchi ◽  
Nancy Minshew ◽  
Marlene Behrmann

2019 ◽  
Vol 31 (6) ◽  
pp. 821-836 ◽  
Author(s):  
Elliot Collins ◽  
Erez Freud ◽  
Jana M. Kainerstorfer ◽  
Jiaming Cao ◽  
Marlene Behrmann

Although shape perception is primarily considered a function of the ventral visual pathway, previous research has shown that both dorsal and ventral pathways represent shape information. Here, we examine whether the shape-selective electrophysiological signals observed in dorsal cortex are a product of the connectivity to ventral cortex or are independently computed. We conducted multiple EEG studies in which we manipulated the input parameters of the stimuli so as to bias processing to either the dorsal or ventral visual pathway. Participants viewed displays of common objects with shape information parametrically degraded across five levels. We measured shape sensitivity by regressing the amplitude of the evoked signal against the degree of stimulus scrambling. Experiment 1, which included grayscale versions of the stimuli, served as a benchmark establishing the temporal pattern of shape processing during typical object perception. These stimuli evoked broad and sustained patterns of shape sensitivity beginning as early as 50 msec after stimulus onset. In Experiments 2 and 3, we calibrated the stimuli such that visual information was delivered primarily through parvocellular inputs, which mainly project to the ventral pathway, or through koniocellular inputs, which mainly project to the dorsal pathway. In the second and third experiments, shape sensitivity was observed, but in distinct spatio-temporal configurations from each other and from that elicited by grayscale inputs. Of particular interest, in the koniocellular condition, shape selectivity emerged earlier than in the parvocellular condition. These findings support the conclusion of distinct dorsal pathway computations of object shape, independent from the ventral pathway.


Sign in / Sign up

Export Citation Format

Share Document