View-dependent and view-independent properties in human object recognition

AbstractPurely parallel neural networks can model object recognition in brief displays – the same conditions under which illusory conjunctions (the incorrect combination of features into perceived objects in a stimulus array) have been demonstrated empirically (Treisman 1986; Treisman & Gelade 1980). Correcting errors of illusory conjunction is the “tag-assignment” problem for a purely parallel processor: the problem of assigning a spatial tag to nonspatial features, feature combinations, and objects. This problem must be solved to model human object recognition over a longer time scale. Our model simulates both the parallel processes that may underlie illusory conjunctions and the serial processes that may solve the tag-assignment problem in normal perception. One component of the model extracts pooled features and another provides attentional tags that correct illusory conjunctions. Our approach addresses two questions: (i) How can objects be identified from simultaneously attended features in a parallel, distributed representation? (ii) How can the spatial selectional requirements of such an attentional process be met by a separation of pathways for spatial and nonspatial processing? Our analysis of these questions yields a neurally plausible simulation of tag assignment based on synchronizing feature processing activity in a spatial focus of attention.

Download Full-text

Human Object Recognition: Appearance vs. Shape

Shape Perception in Human and Computer Vision ◽

10.1007/978-1-4471-5195-1_26 ◽

2013 ◽

pp. 387-397 ◽

Cited By ~ 2

Author(s):

Irving Biederman

Keyword(s):

Object Recognition ◽

Human Object

Download Full-text

Temporal Correlations in Presentation Order during Learning Affects Human Object Recognition

Perception ◽

10.1068/v970077 ◽

1997 ◽

Vol 26 (1_suppl) ◽

pp. 33-33

Author(s):

G M Wallis ◽

H H Bülthoff

Keyword(s):

Neural Network ◽

Object Recognition ◽

Recognition Performance ◽

Presentation Order ◽

Temporal Association ◽

Spatial Correlations ◽

Temporal Correlations ◽

Human Object ◽

Network Simulations ◽

Recognition Learning

The view-based approach to object recognition supposes that objects are stored as a series of associated views. Although representation of these views as combinations of 2-D features allows generalisation to similar views, it remains unclear how very different views might be associated together to allow recognition from any viewpoint. One cue present in the real world other than spatial similarity, is that we usually experience different objects in temporally constrained, coherent order, and not as randomly ordered snapshots. In a series of recent neural-network simulations, Wallis and Baddeley (1997 Neural Computation9 883 – 894) describe how the association of views on the basis of temporal as well as spatial correlations is both theoretically advantageous and biologically plausible. We describe an experiment aimed at testing their hypothesis in human object-recognition learning. We investigated recognition performance of faces previously presented in sequences. These sequences consisted of five views of five different people's faces, presented in orderly sequence from left to right profile in 45° steps. According to the temporal-association hypothesis, the visual system should associate the images together and represent them as different views of the same person's face, although in truth they are images of different people's faces. In a same/different task, subjects were asked to say whether two faces seen from different viewpoints were views of the same person or not. In accordance with theory, discrimination errors increased for those faces seen earlier in the same sequence as compared with those faces which were not ( p<0.05).

Download Full-text

Derivation of an Optimum and Allowable Range of Pan and Tilt Angles in External Sideway Views for Grasping and Placing Tasks in Unmanned Construction Based on Human Object Recognition

2019 IEEE/SICE International Symposium on System Integration (SII) ◽

10.1109/sii.2019.8700335 ◽

2019 ◽

Cited By ~ 1

Author(s):

Ryuya Sato ◽

Mitsuhiro Kamezaki ◽

Satoshi Niuchi ◽

Shigeki Sugano ◽

Hiroyasu Iwata

Keyword(s):

Object Recognition ◽

Human Object ◽

Allowable Range ◽

Grasping And Placing

Download Full-text

Resolving human object recognition in space and time

Nature Neuroscience ◽

10.1038/nn.3635 ◽

2014 ◽

Vol 17 (3) ◽

pp. 455-462 ◽

Cited By ~ 330

Author(s):

Radoslaw Martin Cichy ◽

Dimitrios Pantazis ◽

Aude Oliva

Keyword(s):

Object Recognition ◽

Space And Time ◽

Human Object

Download Full-text

The Role of Attention on Viewpoint-Invariant Object Recognition

Perception ◽

10.1068/v96l1106 ◽

1996 ◽

Vol 25 (1_suppl) ◽

pp. 148-148

Author(s):

B J Stankiewicz ◽

J E Hummel

Keyword(s):

Object Recognition ◽

Shape Representation ◽

Invariant Representation ◽

Shape Constancy ◽

Priming Paradigm ◽

Human Object ◽

Invariant Object Recognition ◽

Series Of Experiments ◽

Invariant Representations

Researchers in the field of visual perception have dedicated a great deal of effort to understanding how humans recognise known objects from novel viewpoints (often referred to as shape constancy). This research has produced a variety of theories—some that emphasise the use of invariant representations, others that emphasise alignment processes used in conjunction with viewpoint-specific representations. Although researchers disagree on the specifics of the representations and processes used during human object recognition, most agree that achieving shape constancy is computationally expensive—that is, it requires work. If it is assumed that attention provides the necessary resources for these computations, these theories suggest that recognition with attention should be qualitatively different from recognition without attention. Specifically, recognition with attention should be more invariant with viewpoint than recognition without attention. We recently reported a series of experiments, in which we used a response-time priming paradigm in which attention and viewpoint were manipulated, that showed attention is necessary for generating a representation of shape that is invariant with left-right reflection. We are now reporting new experiments showing that shape representation activated without attention is not completely view-specific. These experiments demonstrate that the automatic shape representation is invariant with the size and location of an image in the visual field. The results are reported in the context of a recent model proposed by Hummel and Stankiewicz ( Attention and Performance16 in press), as well as in the context of other models of human object recognition that make explicit predictions about the role of attention in generating a viewpoint-invariant representation of object shape.

Download Full-text