EML-GAN: Generative Adversarial Network-Based End-to-End Multi-Task Learning Architecture for Super-Resolution Reconstruction and Scene Classification of Low-Resolution Remote Sensing Imagery

Mainstream image super-resolution (SR) methods are generally based on paired training samples. As the high-resolution (HR) remote sensing images are difficult to collect with a limited imaging device, most of the existing remote sensing super-resolution methods try to down-sample the collected original images to generate an auxiliary low-resolution (LR) image and form a paired pseudo HR-LR dataset for training. However, the distribution of the generated LR images is generally inconsistent with the real images due to the limitation of remote sensing imaging devices. In this paper, we propose a perceptually unpaired super-resolution method by constructing a multi-stage aggregation network (MSAN). The optimization of the network depends on consistency losses. In particular, the first phase is to preserve the contents of the super-resolved results, by constraining the content consistency between the down-scaled SR results and the low-quality low-resolution inputs. The second stage minimizes perceptual feature loss between the current result and LR input to constrain perceptual-content consistency. The final phase employs the generative adversarial network (GAN) to adding photo-realistic textures by constraining perceptual-distribution consistency. Numerous experiments on synthetic remote sensing datasets and real remote sensing images show that our method obtains more plausible results than other SR methods quantitatively and qualitatively. The PSNR of our network is 0.06dB higher than the SOTA method—HAN on the UC Merced test set with complex degradation.

Download Full-text

Target Detection Method for Low-Resolution Remote Sensing Image Based on ESRGAN and ReDet

Photonics ◽

10.3390/photonics8100431 ◽

2021 ◽

Vol 8 (10) ◽

pp. 431

Author(s):

Yuwu Wang ◽

Guobing Sun ◽

Shengwei Guo

Keyword(s):

Remote Sensing ◽

Target Detection ◽

Detection Method ◽

Recognition Rate ◽

Super Resolution ◽

Remote Sensing Image ◽

Remote Sensing Images ◽

Low Resolution ◽

Generative Adversarial Network ◽

Adversarial Network

With the widespread use of remote sensing images, low-resolution target detection in remote sensing images has become a hot research topic in the field of computer vision. In this paper, we propose a Target Detection on Super-Resolution Reconstruction (TDoSR) method to solve the problem of low target recognition rates in low-resolution remote sensing images under foggy conditions. The TDoSR method uses the Enhanced Super-Resolution Generative Adversarial Network (ESRGAN) to perform defogging and super-resolution reconstruction of foggy low-resolution remote sensing images. In the target detection part, the Rotation Equivariant Detector (ReDet) algorithm, which has a higher recognition rate at this stage, is used to identify and classify various types of targets. While a large number of experiments have been carried out on the remote sensing image dataset DOTA-v1.5, the results of this paper suggest that the proposed method achieves good results in the target detection of low-resolution foggy remote sensing images. The principal result of this paper demonstrates that the recognition rate of the TDoSR method increases by roughly 20% when compared with low-resolution foggy remote sensing images.

Download Full-text

TWIST-GAN: Towards Wavelet Transform and Transferred GAN for Spatio-Temporal Single Image Super Resolution

ACM Transactions on Intelligent Systems and Technology ◽

10.1145/3456726 ◽

2021 ◽

Vol 12 (6) ◽

pp. 1-20

Author(s):

Fayaz Ali Dharejo ◽

Farah Deeba ◽

Yuanchun Zhou ◽

Bhagwan Das ◽

Munsif Ali Jatoi ◽

...

Keyword(s):

Remote Sensing ◽

Super Resolution ◽

Generative Adversarial Networks ◽

Single Image ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Adversarial Networks ◽

Image Super Resolution ◽

Spatio Temporal ◽

Single Image Super Resolution

Single Image Super-resolution (SISR) produces high-resolution images with fine spatial resolutions from a remotely sensed image with low spatial resolution. Recently, deep learning and generative adversarial networks (GANs) have made breakthroughs for the challenging task of single image super-resolution (SISR) . However, the generated image still suffers from undesirable artifacts such as the absence of texture-feature representation and high-frequency information. We propose a frequency domain-based spatio-temporal remote sensing single image super-resolution technique to reconstruct the HR image combined with generative adversarial networks (GANs) on various frequency bands (TWIST-GAN). We have introduced a new method incorporating Wavelet Transform (WT) characteristics and transferred generative adversarial network. The LR image has been split into various frequency bands by using the WT, whereas the transfer generative adversarial network predicts high-frequency components via a proposed architecture. Finally, the inverse transfer of wavelets produces a reconstructed image with super-resolution. The model is first trained on an external DIV2 K dataset and validated with the UC Merced Landsat remote sensing dataset and Set14 with each image size of 256 × 256. Following that, transferred GANs are used to process spatio-temporal remote sensing images in order to minimize computation cost differences and improve texture information. The findings are compared qualitatively and qualitatively with the current state-of-art approaches. In addition, we saved about 43% of the GPU memory during training and accelerated the execution of our simplified version by eliminating batch normalization layers.

Download Full-text

Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs71114680 ◽

2015 ◽

Vol 7 (11) ◽

pp. 14680-14707 ◽

Cited By ~ 513

Author(s):

Fan Hu ◽

Gui-Song Xia ◽

Jingwen Hu ◽

Liangpei Zhang

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

High Resolution ◽

Convolutional Neural Networks ◽

Scene Classification ◽

Deep Convolutional Neural Networks ◽

Remote Sensing Imagery

Download Full-text

A comparative study of sampling analysis in scene classification of high-resolution remote sensing imagery

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) ◽

10.1109/igarss.2015.7326290 ◽

2015 ◽

Cited By ~ 5

Author(s):

Jingwen Hu ◽

Gui-Song Xia ◽

Fan Hu ◽

Hong Sun ◽

Liangpei Zhang

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Comparative Study ◽

Scene Classification ◽

Remote Sensing Imagery

Download Full-text

A benchmark for scene classification of high spatial resolution remote sensing imagery

2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) ◽

10.1109/igarss.2015.7326956 ◽

2015 ◽

Cited By ~ 8

Author(s):

Jingwen Hu ◽

Tianbi Jiang ◽

Xinyi Tong ◽

Gui-Song Xia ◽

Liangpei Zhang

Keyword(s):

Remote Sensing ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Scene Classification ◽

Remote Sensing Imagery

Download Full-text

Remote Sensing Image Super-Resolution Reconstruction based on Generative Adversarial Network

10.23940/ijpe.19.07.p4.17831791 ◽

2019 ◽

Author(s):

Aili Wang

Keyword(s):

Remote Sensing ◽

Super Resolution ◽

Remote Sensing Image ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Image Super Resolution

Download Full-text

Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network

10.20944/preprints202003.0313.v1 ◽

2020 ◽

Author(s):

Jakaria Rabbi ◽

Nilanjan Ray ◽

Matthias Schubert ◽

Subir Chowdhury ◽

Dennis Chao

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Super Resolution ◽

Detection Performance ◽

Superior Performance ◽

Single Shot ◽

Small Object ◽

Remote Sensing Images ◽

Low Resolution ◽

End To End

The detection performance of small objects in remote sensing images is not satisfactory compared to large objects, especially in low-resolution and noisy images. A generative adversarial network (GAN)-based model called enhanced super-resolution GAN (ESRGAN) shows remarkable image enhancement performance, but reconstructed images miss high-frequency edge information. Therefore, object detection performance degrades for the small objects on recovered noisy and low-resolution remote sensing images. Inspired by the success of edge enhanced GAN (EEGAN) and ESRGAN, we apply a new edge-enhanced super-resolution GAN (EESRGAN) to improve the image quality of remote sensing images and used different detector networks in an end-to-end manner where detector loss is backpropagated into the EESRGAN to improve the detection performance. We propose an architecture with three components: ESRGAN, Edge Enhancement Network (EEN), and Detection network. We use residual-in-residual dense blocks (RRDB) for both the GAN and EEN, and for the detector network, we use the faster region-based convolutional network (FRCNN) (two-stage detector) and single-shot multi-box detector (SSD) (one stage detector). Extensive experiments on car overhead with context and oil and gas storage tank (created by us) data sets show superior performance of our method compared to the standalone state-of-the-art object detectors.

Download Full-text

Upsampling Real-Time, Low-Resolution CCTV Videos Using Generative Adversarial Networks

Electronics ◽

10.3390/electronics9081312 ◽

2020 ◽

Vol 9 (8) ◽

pp. 1312

Author(s):

Debapriya Hazra ◽

Yung-Cheol Byun

Keyword(s):

High Resolution ◽

Real Time ◽

Super Resolution ◽

Poor Quality ◽

Generative Adversarial Networks ◽

Temporal Consistency ◽

Low Resolution ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Benchmark Datasets

Video super-resolution has become an emerging topic in the field of machine learning. The generative adversarial network is a framework that is widely used to develop solutions for low-resolution videos. Video surveillance using closed-circuit television (CCTV) is significant in every field, all over the world. A common problem with CCTV videos is sudden video loss or poor quality. In this paper, we propose a generative adversarial network that implements spatio-temporal generators and discriminators to enhance real-time low-resolution CCTV videos to high-resolution. The proposed model considers both foreground and background motion of a CCTV video and effectively models the spatial and temporal consistency from low-resolution video frames to generate high-resolution videos. Quantitative and qualitative experiments on benchmark datasets, including Kinetics-700, UCF101, HMDB51 and IITH_Helmet2, showed that our model outperforms the existing GAN models for video super-resolution.

Download Full-text