Translation-, Rotation-, Scale-, and Distortion-Invariant Object Recognition Through Self-Organization

1997 ◽  
Vol 08 (02) ◽  
pp. 173-179 ◽  
Author(s):  
Soheil Shams

The task of visual object recognition is often complicated by the fact that a single 3-D object can undergo a number of transformations which can substantially alter its projection onto a 2-D surface, such as the retina. Such transformations include translation of the object in the visual field, changes in the size of the object, its orientation in the 2-D plane and the viewing perspective. For a general pattern recognition system to detect and recognize an object after such transformations, it must be able to associate widely differing patterns with the same object label. In this paper, a novel self-organizing model, called the Multiple Elastic Modules (MEM), is presented which attempts to solve this problem by searching a multi-dimensional space, where each axis is defined by one of the transformations (e.g scale, translation, rotation, etc.). A particular object of a specific size, orientation and spatial location is mapped onto a single point in this space. Of course, distortions and minor variations in an object's image will expand this point to a small localized area in this multi-dimensional space. Such a powerful representation scheme comes at a cost of high computational demand due to the combinatorially large search space. The MEM approach to solving this problem efficiently partitions the solution space to search the most promising areas for the correct match. Simulation results are presented on detecting a stick-figure object under translation, distortion, scale, and rotation transformations in a cluttered background.

2009 ◽  
Vol 21 (7) ◽  
pp. 1952-1989 ◽  
Author(s):  
Günter Westphal ◽  
Rolf P. Würtz

We present an object recognition system built on a combination of feature- and correspondence-based pattern recognizers. The feature-based part, called preselection network, is a single-layer feedforward network weighted with the amount of information contributed by each feature to the decision at hand. For processing arbitrary objects, we employ small, regular graphs whose nodes are attributed with Gabor amplitudes, termed parquet graphs. The preselection network can quickly rule out most irrelevant matches and leaves only the ambiguous cases, so-called model candidates, to be verified by a rudimentary version of elastic graph matching, a standard correspondence-based technique for face and object recognition. According to the model, graphs are constructed that describe the object in the input image well. We report the results of experiments on standard databases for object recognition. The method achieved high recognition rates on identity and pose. Unlike many other models, it can also cope with varying background, multiple objects, and partial occlusion.


2021 ◽  
Vol 2021 ◽  
pp. 1-13
Author(s):  
Zahra Sadat Shariatmadar ◽  
Karim Faez

Autonomous object recognition in images is one of the most critical topics in security and commercial applications. Due to recent advances in visual neuroscience, the researchers tend to extend biologically plausible schemes to improve the accuracy of object recognition. Preprocessing is one part of the visual recognition system that has received much less attention. In this paper, we propose a new, simple, and biologically inspired pre processing technique by using the data-driven mechanism of visual attention. In this part, the responses of Retinal Ganglion Cells (RGCs) are simulated. After obtaining these responses, an efficient threshold is selected. Then, the points of the raw image with the most information are extracted according to it. Then, the new images with these points are created, and finally, by combining these images with entropy coefficients, the most salient object is located. After extracting appropriate features, the classifier categorizes the initial image into one of the predefined object categories. Our system was evaluated on the Caltech-101 dataset. Experimental results demonstrate the efficacy and effectiveness of this novel method of preprocessing.


2007 ◽  
Author(s):  
K. Suzanne Scherf ◽  
Marlene Behrmann ◽  
Kate Humphreys ◽  
Beatriz Luna

Sign in / Sign up

Export Citation Format

Share Document