Comparing Object Recognition in Humans and Deep Convolutional Neural Networks—An Eye Tracking Study

Deep convolutional neural networks (DCNNs) and the ventral visual pathway share vast architectural and functional similarities in visual challenges such as object recognition. Recent insights have demonstrated that both hierarchical cascades can be compared in terms of both exerted behavior and underlying activation. However, these approaches ignore key differences in spatial priorities of information processing. In this proof-of-concept study, we demonstrate a comparison of human observers (N = 45) and three feedforward DCNNs through eye tracking and saliency maps. The results reveal fundamentally different resolutions in both visualization methods that need to be considered for an insightful comparison. Moreover, we provide evidence that a DCNN with biologically plausible receptive field sizes called vNet reveals higher agreement with human viewing behavior as contrasted with a standard ResNet architecture. We find that image-specific factors such as category, animacy, arousal, and valence have a direct link to the agreement of spatial object recognition priorities in humans and DCNNs, while other measures such as difficulty and general image properties do not. With this approach, we try to open up new perspectives at the intersection of biological and computer vision research.

Download Full-text

Human and bird detection and classification based on Doppler radar spectrograms and vision images using convolutional neural networks

International Journal of Advanced Robotic Systems ◽

10.1177/17298814211010569 ◽

2021 ◽

Vol 18 (3) ◽

pp. 172988142110105

Author(s):

Jnana Sai Abhishek Varma Gokaraju ◽

Weon Keun Song ◽

Min-Ho Ka ◽

Somyot Kaitwanidvilai

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Doppler Radar ◽

Autonomous Vehicle ◽

Sampling Point ◽

Body Segment ◽

Kinematic Modeling ◽

Deep Convolutional Neural Networks ◽

Kinematic Models

The study investigated object detection and classification based on both Doppler radar spectrograms and vision images using two deep convolutional neural networks. The kinematic models for a walking human and a bird flapping its wings were incorporated into MATLAB simulations to create data sets. The dynamic simulator identified the final position of each ellipsoidal body segment taking its rotational motion into consideration in addition to its bulk motion at each sampling point to describe its specific motion naturally. The total motion induced a micro-Doppler effect and created a micro-Doppler signature that varied in response to changes in the input parameters, such as varying body segment size, velocity, and radar location. Micro-Doppler signature identification of the radar signals returned from the target objects that were animated by the simulator required kinematic modeling based on a short-time Fourier transform analysis of the signals. Both You Only Look Once V3 and Inception V3 were used for the detection and classification of the objects with different red, green, blue colors on black or white backgrounds. The results suggested that clear micro-Doppler signature image-based object recognition could be achieved in low-visibility conditions. This feasibility study demonstrated the application possibility of Doppler radar to autonomous vehicle driving as a backup sensor for cameras in darkness. In this study, the first successful attempt of animated kinematic models and their synchronized radar spectrograms to object recognition was made.

Download Full-text

RGB-D object recognition with multimodal deep convolutional neural networks

2017 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme.2017.8019538 ◽

2017 ◽

Cited By ~ 12

Author(s):

Mohammad Muntasir Rahman ◽

Yanhao Tan ◽

Jian Xue ◽

Ke Lu

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks

Download Full-text

STDP-based spiking deep convolutional neural networks for object recognition

Neural Networks ◽

10.1016/j.neunet.2017.12.005 ◽

2018 ◽

Vol 99 ◽

pp. 56-67 ◽

Cited By ~ 151

Author(s):

Saeed Reza Kheradpisheh ◽

Mohammad Ganjtabesh ◽

Simon J. Thorpe ◽

Timothée Masquelier

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks

Download Full-text

Two-Stage Ensemble of Deep Convolutional Neural Networks for Object Recognition

2018 International Conference on Intelligent Rail Transportation (ICIRT) ◽

10.1109/icirt.2018.8641581 ◽

2018 ◽

Author(s):

Rom Uddamvathanak ◽

Feng Yang ◽

Xulei Yang ◽

Ankit Kumar Das ◽

Yan Shen ◽

...

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks ◽

Two Stage

Download Full-text

Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks

Computational Intelligence and Neuroscience ◽

10.1155/2018/6747098 ◽

2018 ◽

Vol 2018 ◽

pp. 1-13 ◽

Cited By ~ 18

Author(s):

Md Zahangir Alom ◽

Paheding Sidike ◽

Mahmudul Hasan ◽

Tarek M. Taha ◽

Vijayan K. Asari

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Character Recognition ◽

State Of The Art ◽

The State ◽

Superior Performance ◽

Deep Convolutional Neural Networks ◽

Practical Applications ◽

High Degree

In spite of advances in object recognition technology, handwritten Bangla character recognition (HBCR) remains largely unsolved due to the presence of many ambiguous handwritten characters and excessively cursive Bangla handwritings. Even many advanced existing methods do not lead to satisfactory performance in practice that related to HBCR. In this paper, a set of the state-of-the-art deep convolutional neural networks (DCNNs) is discussed and their performance on the application of HBCR is systematically evaluated. The main advantage of DCNN approaches is that they can extract discriminative features from raw data and represent them with a high degree of invariance to object distortions. The experimental results show the superior performance of DCNN models compared with the other popular object recognition approaches, which implies DCNN can be a good candidate for building an automatic HBCR system for practical applications.

Download Full-text

Object recognition using deep convolutional neural networks with complete transfer and partial frozen layers

Optics and Photonics for Counterterrorism, Crime Fighting, and Defence XII ◽

10.1117/12.2241177 ◽

2016 ◽

Cited By ~ 3

Author(s):

Maarten Kruithof ◽

Henri Bouma ◽

Noelle Fischer ◽

Klamer Schutte

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks

Download Full-text

Semantic Relatedness Emerges in Deep Convolutional Neural Networks Designed for Object Recognition

Frontiers in Computational Neuroscience ◽

10.3389/fncom.2021.625804 ◽

2021 ◽

Vol 15 ◽

Author(s):

Taicheng Huang ◽

Zonglei Zhen ◽

Jia Liu

Keyword(s):

Neural Networks ◽

Object Recognition ◽

Convolutional Neural Networks ◽

Hierarchical Structure ◽

Semantic Relatedness ◽

Developmental Trajectory ◽

Top Down ◽

Deep Convolutional Neural Networks ◽

Object Categories ◽

The Hierarchical Structure

Human not only can effortlessly recognize objects, but also characterize object categories into semantic concepts with a nested hierarchical structure. One dominant view is that top-down conceptual guidance is necessary to form such hierarchy. Here we challenged this idea by examining whether deep convolutional neural networks (DCNNs) could learn relations among objects purely based on bottom-up perceptual experience of objects through training for object categorization. Specifically, we explored representational similarity among objects in a typical DCNN (e.g., AlexNet), and found that representations of object categories were organized in a hierarchical fashion, suggesting that the relatedness among objects emerged automatically when learning to recognize them. Critically, the emerged relatedness of objects in the DCNN was highly similar to the WordNet in human, implying that top-down conceptual guidance may not be a prerequisite for human learning the relatedness among objects. In addition, the developmental trajectory of the relatedness among objects during training revealed that the hierarchical structure was constructed in a coarse-to-fine fashion, and evolved into maturity before the establishment of object recognition ability. Finally, the fineness of the relatedness was greatly shaped by the demand of tasks that the DCNN performed, as the higher superordinate level of object classification was, the coarser the hierarchical structure of the relatedness emerged. Taken together, our study provides the first empirical evidence that semantic relatedness of objects emerged as a by-product of object recognition in DCNNs, implying that human may acquire semantic knowledge on objects without explicit top-down conceptual guidance.

Download Full-text