Visual Object Class Recognition

2016 ◽  
pp. 825-840
Author(s):  
Michael Stark ◽  
Bernt Schiele ◽  
Aleš Leonardis
2014 ◽  
Vol 9 (9) ◽  
pp. 1590
Author(s):  
Noridayu Manshor ◽  
Amir Rizaan Abdul Rahiman ◽  
Raja Azlina Raja Mahmood

Author(s):  
Salar Awan ◽  
Mustafa Muhamad ◽  
Kresimir Kusevic ◽  
Paul Mrstik ◽  
Michael Greenspan

2019 ◽  
Vol 9 (9) ◽  
pp. 1829 ◽  
Author(s):  
Jie Jiang ◽  
Hui Xu ◽  
Shichang Zhang ◽  
Yujie Fang

This study proposes a multiheaded object detection algorithm referred to as MANet. The main purpose of the study is to integrate feature layers of different scales based on the attention mechanism and to enhance contextual connections. To achieve this, we first replaced the feed-forward base network of the single-shot detector with the ResNet–101 (inspired by the Deconvolutional Single-Shot Detector) and then applied linear interpolation and the attention mechanism. The information of the feature layers at different scales was fused to improve the accuracy of target detection. The primary contributions of this study are the propositions of (a) a fusion attention mechanism, and (b) a multiheaded attention fusion method. Our final MANet detector model effectively unifies the feature information among the feature layers at different scales, thus enabling it to detect objects with different sizes and with higher precision. We used the 512 × 512 input MANet (the backbone is ResNet–101) to obtain a mean accuracy of 82.7% based on the PASCAL visual object class 2007 test. These results demonstrated that our proposed method yielded better accuracy than those provided by the conventional Single-shot detector (SSD) and other advanced detectors.


Author(s):  
Josef Sivic ◽  
Bryan C. Russell ◽  
Andrew Zisserman ◽  
William T. Freeman ◽  
Alexei A. Efros

Sign in / Sign up

Export Citation Format

Share Document