A Semantic-Preserving Deep Hashing Model for Multi-Label Remote Sensing Image Retrieval

Conventional remote sensing image retrieval (RSIR) systems perform single-label retrieval with a single label to represent the most dominant semantic content for an image. Improved spatial resolution dramatically boosts the remote sensing image scene complexity, as a remote sensing image always contains multiple categories of surface features. In this case, a single label cannot comprehensively describe the semantic content of a complex remote sensing image scene and therefore results in poor retrieval performance in practical applications. As a result, researchers have begun to pay attention to multi-label image retrieval. However, in the era of massive remote sensing data, how to increase retrieval efficiency and reduce feature storage while preserving semantic information remains unsolved. Considering the powerful capability of hashing learning in overcoming the curse of dimensionality caused by high-dimensional image representation in Approximate Nearest Neighbor (ANN) search problems, we propose a new semantic-preserving deep hashing model for multi-label remote sensing image retrieval. Our model consists of three main components: (1) a convolutional neural network to extract image features; (2) a hash layer to generate binary codes; (3) a new loss function to better maintain the multi-label semantic information of hash learning contained in context remote sensing image scene. As far as we know, this is the first attempt to apply deep hashing into the multi-label remote sensing image retrieval. Experimental results indicate the effectiveness and promising of the introduction of hashing methods in the multi-label remote sensing image retrieval.

Download Full-text

Cohesion Intensive Deep Hashing for Remote Sensing Image Retrieval

Remote Sensing ◽

10.3390/rs12010101 ◽

2019 ◽

Vol 12 (1) ◽

pp. 101 ◽

Cited By ~ 3

Author(s):

Lirong Han ◽

Peng Li ◽

Xiao Bai ◽

Christos Grecos ◽

Xiaoyu Zhang ◽

...

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Large Scale ◽

Remote Sensing Data ◽

Optimization Method ◽

Remote Sensing Image ◽

Model Parameters ◽

Remote Sensing Images ◽

Deep Hashing ◽

Deep Model

Recently, the demand for remote sensing image retrieval is growing and attracting the interest of many researchers because of the increasing number of remote sensing images. Hashing, as a method of retrieving images, has been widely applied to remote sensing image retrieval. In order to improve hashing performance, we develop a cohesion intensive deep hashing model for remote sensing image retrieval. The underlying architecture of our deep model is motivated by the state-of-the-art residual net. Residual nets aim at avoiding gradient vanishing and gradient explosion when the net reaches a certain depth. However, different from the residual net which outputs multiple class-labels, we present a residual hash net that is terminated by a Heaviside-like function for binarizing remote sensing images. In this scenario, the representational power of the residual net architecture is exploited to establish an end-to-end deep hashing model. The residual hash net is trained subject to a weighted loss strategy that intensifies the cohesiveness of image hash codes within one class. This effectively addresses the data imbalance problem normally arising in remote sensing image retrieval tasks. Furthermore, we adopted a gradualness optimization method for obtaining optimal model parameters in order to favor accurate binary codes with little quantization error. We conduct comparative experiments on large-scale remote sensing data sets such as UCMerced and AID. The experimental results validate the hypothesis that our method improves the performance of current remote sensing image retrieval.

Download Full-text

Unsupervised Deep Feature Learning for Remote Sensing Image Retrieval

Remote Sensing ◽

10.3390/rs10081243 ◽

2018 ◽

Vol 10 (8) ◽

pp. 1243 ◽

Cited By ~ 29

Author(s):

Xu Tang ◽

Xiangrong Zhang ◽

Fang Liu ◽

Licheng Jiao

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Feature Learning ◽

Remote Sensing Image ◽

Code Word ◽

Image Features ◽

Learning Method ◽

L1 Norm ◽

Image Descriptor ◽

Image Archives

Due to the specific characteristics and complicated contents of remote sensing (RS) images, remote sensing image retrieval (RSIR) is always an open and tough research topic in the RS community. There are two basic blocks in RSIR, including feature learning and similarity matching. In this paper, we focus on developing an effective feature learning method for RSIR. With the help of the deep learning technique, the proposed feature learning method is designed under the bag-of-words (BOW) paradigm. Thus, we name the obtained feature deep BOW (DBOW). The learning process consists of two parts, including image descriptor learning and feature construction. First, to explore the complex contents within the RS image, we extract the image descriptor in the image patch level rather than the whole image. In addition, instead of using the handcrafted feature to describe the patches, we propose the deep convolutional auto-encoder (DCAE) model to deeply learn the discriminative descriptor for the RS image. Second, the k-means algorithm is selected to generate the codebook using the obtained deep descriptors. Then, the final histogrammic DBOW features are acquired by counting the frequency of the single code word. When we get the DBOW features from the RS images, the similarities between RS images are measured using L1-norm distance. Then, the retrieval results can be acquired according to the similarity order. The encouraging experimental results counted on four public RS image archives demonstrate that our DBOW feature is effective for the RSIR task. Compared with the existing RS image features, our DBOW can achieve improved behavior on RSIR.

Download Full-text

Deep Hashing Network Based on Split Channels for Hybrid-Source Remote Sensing Image Retrieval

Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition ◽

10.1145/3430199.3430225 ◽

2020 ◽

Author(s):

Salayidin Sirajidin ◽

Hong Huo ◽

Tao Fang

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Remote Sensing Image ◽

Deep Hashing

Download Full-text

Performance Evaluation of Single-Label and Multi-Label Remote Sensing Image Retrieval Using a Dense Labeling Dataset

Remote Sensing ◽

10.3390/rs10060964 ◽

2018 ◽

Vol 10 (6) ◽

pp. 964 ◽

Cited By ~ 34

Author(s):

Zhenfeng Shao ◽

Ke Yang ◽

Weixun Zhou

Keyword(s):

Remote Sensing ◽

Performance Evaluation ◽

Deep Learning ◽

Image Retrieval ◽

Semantic Segmentation ◽

Semantic Content ◽

Remote Sensing Image ◽

Remote Sensing Images ◽

Benchmark Datasets ◽

Feature Based

Benchmark datasets are essential for developing and evaluating remote sensing image retrieval (RSIR) approaches. However, most of the existing datasets are single-labeled, with each image in these datasets being annotated by a single label representing the most significant semantic content of the image. This is sufficient for simple problems, such as distinguishing between a building and a beach, but multiple labels and sometimes even dense (pixel) labels are required for more complex problems, such as RSIR and semantic segmentation.We therefore extended the existing multi-labeled dataset collected for multi-label RSIR and presented a dense labeling remote sensing dataset termed "DLRSD". DLRSD contained a total of 17 classes, and the pixels of each image were assigned with 17 pre-defined labels. We used DLRSD to evaluate the performance of RSIR methods ranging from traditional handcrafted feature-based methods to deep learning-based ones. More specifically, we evaluated the performances of RSIR methods from both single-label and multi-label perspectives. These results demonstrated the advantages of multiple labels over single labels for interpreting complex remote sensing images. DLRSD provided the literature a benchmark for RSIR and other pixel-based problems such as semantic segmentation.

Download Full-text

Multi-sensor Fusion and Explicit Semantic Preserving-based Deep Hashing for Cross-Modal Remote Sensing Image Retrieval

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2021.3136641 ◽

2021 ◽

pp. 1-1

Author(s):

Yuxi Sun ◽

Shanshan Feng ◽

Yunming Ye ◽

Xutao Li ◽

Jian Kang ◽

...

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Sensor Fusion ◽

Remote Sensing Image ◽

Deep Hashing

Download Full-text

Slice-feature based deep hashing algorithm for remote sensing image retrieval

Infrared Physics & Technology ◽

10.1016/j.infrared.2020.103299 ◽

2020 ◽

Vol 107 ◽

pp. 103299

Author(s):

Enhai Liu ◽

Xintong Zhang ◽

Xia Xu ◽

Shiyan Fan

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Remote Sensing Image ◽

Deep Hashing ◽

Feature Based ◽

Hashing Algorithm

Download Full-text

Learning Source-Invariant Deep Hashing Convolutional Neural Networks for Cross-Source Remote Sensing Image Retrieval

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2018.2839705 ◽

2018 ◽

Vol 56 (11) ◽

pp. 6521-6536 ◽

Cited By ~ 38

Author(s):

Yansheng Li ◽

Yongjun Zhang ◽

Xin Huang ◽

Jiayi Ma

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Image Retrieval ◽

Convolutional Neural Networks ◽

Remote Sensing Image ◽

Deep Hashing

Download Full-text

Deep Hashing Using Proxy Loss on Remote Sensing Image Retrieval

Remote Sensing ◽

10.3390/rs13152924 ◽

2021 ◽

Vol 13 (15) ◽

pp. 2924

Author(s):

Xue Shan ◽

Pingping Liu ◽

Yifan Wang ◽

Qiuzhan Zhou ◽

Zhen Wang

Keyword(s):

Remote Sensing ◽

Image Retrieval ◽

Metric Learning ◽

Remote Sensing Image ◽

Convergence Time ◽

Aerial Image ◽

Shooting Methods ◽

Retrieval Method ◽

Memory Space ◽

Deep Hashing

With the improvement of various space-satellite shooting methods, the sources, scenes, and quantities of remote sensing data are also increasing. An effective and fast remote sensing image retrieval method is necessary, and many researchers have conducted a lot of work in this direction. Nevertheless, a fast retrieval method called hashing retrieval is proposed to improve retrieval speed, while maintaining retrieval accuracy and greatly reducing memory space consumption. At the same time, proxy-based metric learning losses can reduce convergence time. Naturally, we present a proxy-based hash retrieval method, called DHPL (Deep Hashing using Proxy Loss), which combines hash code learning with proxy-based metric learning in a convolutional neural network. Specifically, we designed a novel proxy metric learning network, and we used one hash loss function to reduce the quantified losses. For the University of California Merced (UCMD) dataset, DHPL resulted in a mean average precision (mAP) of up to 98.53% on 16 hash bits, 98.83% on 32 hash bits, 99.01% on 48 hash bits, and 99.21% on 64 hash bits. For the aerial image dataset (AID), DHPL achieved an mAP of up to 93.53% on 16 hash bits, 97.36% on 32 hash bits, 98.28% on 48 hash bits, and 98.54% on 64 bits. Our experimental results on UCMD and AID datasets illustrate that DHPL could generate great results compared with other state-of-the-art hash approaches.

Download Full-text