Saliency Map Prediction using a Method of Object Detection

The philosophy of human visual attention is scientifically explained in the field of cognitive psychology and neuroscience then computationally modeled in the field of computer science and engineering. Visual attention models have been applied in computer vision systems such as object detection, object recognition, image segmentation, image and video compression, action recognition, visual tracking, and so on. This work studies bottom-up visual attention, namely human fixation prediction and salient object detection models. The preliminary study briefly covers from the biological perspective of visual attention, including visual pathway, the theory of visual attention, to the computational model of bottom-up visual attention that generates saliency map. The study compares some models at each stage and observes whether the stage is inspired by biological architecture, concept, or behavior of human visual attention. From the study, the use of low-level features, center-surround mechanism, sparse representation, and higher-level guidance with intrinsic cues dominate the bottom-up visual attention approaches. The study also highlights the correlation between bottom-up visual attention and curiosity.

Download Full-text

Global Context-Aware Progressive Aggregation Network for Salient Object Detection

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6633 ◽

2020 ◽

Vol 34 (07) ◽

pp. 10599-10606 ◽

Cited By ~ 4

Author(s):

Zuyao Chen ◽

Qianqian Xu ◽

Runmin Cong ◽

Qingming Huang

Keyword(s):

Object Detection ◽

Critical Role ◽

Saliency Map ◽

Salient Object Detection ◽

Semantic Features ◽

Context Aware ◽

Salient Object ◽

Deep Convolutional Neural Networks ◽

Global Context ◽

High Level

Deep convolutional neural networks have achieved competitive performance in salient object detection, in which how to learn effective and comprehensive features plays a critical role. Most of the previous works mainly adopted multiple-level feature integration yet ignored the gap between different features. Besides, there also exists a dilution process of high-level features as they passed on the top-down pathway. To remedy these issues, we propose a novel network named GCPANet to effectively integrate low-level appearance features, high-level semantic features, and global context features through some progressive context-aware Feature Interweaved Aggregation (FIA) modules and generate the saliency map in a supervised way. Moreover, a Head Attention (HA) module is used to reduce information redundancy and enhance the top layers features by leveraging the spatial and channel-wise attention, and the Self Refinement (SR) module is utilized to further refine and heighten the input features. Furthermore, we design the Global Context Flow (GCF) module to generate the global context information at different stages, which aims to learn the relationship among different salient regions and alleviate the dilution effect of high-level features. Experimental results on six benchmark datasets demonstrate that the proposed approach outperforms the state-of-the-art methods both quantitatively and qualitatively.

Download Full-text

Salient Object Detection Based on Background Feature Clustering

Advances in Multimedia ◽

10.1155/2017/4183986 ◽

2017 ◽

Vol 2017 ◽

pp. 1-9 ◽

Cited By ~ 3

Author(s):

Kan Huang ◽

Yong Zhang ◽

Bo Lv ◽

Yongbiao Shi

Keyword(s):

Object Detection ◽

Clustering Algorithm ◽

Geodesic Distance ◽

Saliency Map ◽

Salient Object Detection ◽

Salient Object ◽

Edge Preserving ◽

Feature Clustering ◽

Background Distribution ◽

Extensive Evaluation

Automatic estimation of salient object without any prior knowledge tends to greatly enhance many computer vision tasks. This paper proposes a novel bottom-up based framework for salient object detection by first modeling background and then separating salient objects from background. We model the background distribution based on feature clustering algorithm, which allows for fully exploiting statistical and structural information of the background. Then a coarse saliency map is generated according to the background distribution. To be more discriminative, the coarse saliency map is enhanced by a two-step refinement which is composed of edge-preserving element-level filtering and upsampling based on geodesic distance. We provide an extensive evaluation and show that our proposed method performs favorably against other outstanding methods on two most commonly used datasets. Most importantly, the proposed approach is demonstrated to be more effective in highlighting the salient object uniformly and robust to background noise.

Download Full-text

OBJECT DETECTION IN UAV-BORNE THERMAL IMAGES USING BOUNDARY-AWARE SALIENCY MAPS

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b2-2020-1233-2020 ◽

2020 ◽

Vol XLIII-B2-2020 ◽

pp. 1233-1238

Author(s):

M. Li ◽

X. Zhao ◽

J. Li ◽

D. Zhu

Keyword(s):

Object Detection ◽

Real Data ◽

Saliency Map ◽

Thermal Images ◽

Low Contrast ◽

Saliency Maps ◽

Weighted Fusion ◽

Proposed Model ◽

Visible Images ◽

Statistical Results

Abstract. In this paper, we propose a method of object detection based on thermal images acquired from unmanned aerial vehicles (UAV). Compared with visible images, thermal images have lower requirements for illumination conditions, but they have some problems, such as blurred edges and low contrast. To address these problems, we propose to use the saliency map of thermal images for image enhancement as the attention mechanism of the object detector. In the paper, the YOLOv3 network is trained as a detection benchmark and BASNet is used to generate saliency maps from the thermal images. We fuse the thermal images with their corresponding saliency maps through the pixel-level weighted fusion method. Experiment results tested on real data have shown that the proposed method could realize the task of object detection in UAV-borne thermal images. The statistical results show that the average precisions (AP) of pedestrians and vehicles are increased by 4.5% and 2.6% respectively, compared with the benchmark of the YOLOv3 model trained on only the thermal images. The proposed model provides reliable technical support for the application of thermal images with UAV platforms.

Download Full-text

Revise-Net: Exploiting Reverse Attention Mechanism for Salient Object Detection

Remote Sensing ◽

10.3390/rs13234941 ◽

2021 ◽

Vol 13 (23) ◽

pp. 4941

Author(s):

Rukhshanda Hussain ◽

Yash Karbhari ◽

Muhammad Fazal Ijaz ◽

Marcin Woźniak ◽

Pawan Kumar Singh ◽

...

Keyword(s):

Object Detection ◽

State Of The Art ◽

Similarity Index ◽

Saliency Map ◽

Salient Object Detection ◽

Semantic Features ◽

Salient Object ◽

Fully Convolutional Neural Networks ◽

Boundary Estimation ◽

Prediction Module

Recently, deep learning-based methods, especially utilizing fully convolutional neural networks, have shown extraordinary performance in salient object detection. Despite its success, the clean boundary detection of the saliency objects is still a challenging task. Most of the contemporary methods focus on exclusive edge detection modules in order to avoid noisy boundaries. In this work, we propose leveraging on the extraction of finer semantic features from multiple encoding layers and attentively re-utilize it in the generation of the final segmentation result. The proposed Revise-Net model is divided into three parts: (a) the prediction module, (b) a residual enhancement module, and (c) reverse attention modules. Firstly, we generate the coarse saliency map through the prediction modules, which are fine-tuned in the enhancement module. Finally, multiple reverse attention modules at varying scales are cascaded between the two networks to guide the prediction module by employing the intermediate segmentation maps generated at each downsampling level of the REM. Our method efficiently classifies the boundary pixels using a combination of binary cross-entropy, similarity index, and intersection over union losses at the pixel, patch, and map levels, thereby effectively segmenting the saliency objects in an image. In comparison with several state-of-the-art frameworks, our proposed Revise-Net model outperforms them with a significant margin on three publicly available datasets, DUTS-TE, ECSSD, and HKU-IS, both on regional and boundary estimation measures.

Download Full-text

Salient Object Detection with Semantic Priors

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/628 ◽

2017 ◽

Cited By ~ 5

Author(s):

Tam V. Nguyen ◽

Luoqi Liu

Keyword(s):

Object Detection ◽

State Of The Art ◽

Semantic Segmentation ◽

Saliency Map ◽

Salient Object Detection ◽

Salient Object ◽

Artificial Intelligence Research ◽

Semantic Map ◽

Regional Features ◽

Computational Sciences

Salient object detection has increasingly become a popular topic in cognitive and computational sciences, including computer vision and artificial intelligence research. In this paper, we propose integrating semantic priors into the salient object detection process. Our algorithm consists of three basic steps. Firstly, the explicit saliency map is obtained based on the semantic segmentation refined by the explicit saliency priors learned from the data. Next, the implicit saliency map is computed based on a trained model which maps the implicit saliency priors embedded into regional features with the saliency values. Finally, the explicit semantic map and the implicit map are adaptively fused to form a pixel-accurate saliency map which uniformly covers the objects of interest. We further evaluate the proposed framework on two challenging datasets, namely, ECSSD and HKUIS. The extensive experimental results demonstrate that our method outperforms other state-of-the-art methods.

Download Full-text

Saliency Detection via the Improved Hierarchical Principal Component Analysis Method

Wireless Communications and Mobile Computing ◽

10.1155/2020/8822777 ◽

2020 ◽

Vol 2020 ◽

pp. 1-12 ◽

Cited By ~ 23

Author(s):

Yuantao Chen ◽

Jiajun Tao ◽

Qian Zhang ◽

Kai Yang ◽

Xi Chen ◽

...

Keyword(s):

Principal Component Analysis ◽

Object Detection ◽

Saliency Detection ◽

Principal Component ◽

Component Analysis ◽

Saliency Map ◽

Background Information ◽

Reference Image ◽

Grayscale Image ◽

Original Image

Aiming at the problems of intensive background noise, low accuracy, and high computational complexity of the current significant object detection methods, the visual saliency detection algorithm based on Hierarchical Principal Component Analysis (HPCA) has been proposed in the paper. Firstly, the original RGB image has been converted to a grayscale image, and the original grayscale image has been divided into eight layers by the bit surface stratification technique. Each image layer contains significant object information matching the layer image features. Secondly, taking the color structure of the original image as the reference image, the grayscale image is reassigned by the grayscale color conversion method, so that the layered image not only reflects the original structural features but also effectively preserves the color feature of the original image. Thirdly, the Principal Component Analysis (PCA) has been performed on the layered image to obtain the structural difference characteristics and color difference characteristics of each layer of the image in the principal component direction. Fourthly, two features are integrated to get the saliency map with high robustness and to further refine our results; the known priors have been incorporated on image organization, which can place the subject of the photograph near the center of the image. Finally, the entropy calculation has been used to determine the optimal image from the layered saliency map; the optimal map has the least background information and most prominently saliency objects than others. The object detection results of the proposed model are closer to the ground truth and take advantages of performance parameters including precision rate (PRE), recall rate (REC), and F-measure (FME). The HPCA model’s conclusion can obviously reduce the interference of redundant information and effectively separate the saliency object from the background. At the same time, it had more improved detection accuracy than others.

Download Full-text