scholarly journals Task-Driven Super Resolution: Object Detection in Low-Resolution Images

2021 ◽  
pp. 387-395
Author(s):  
Muhammad Haris ◽  
Greg Shakhnarovich ◽  
Norimichi Ukita
2021 ◽  
Vol 2083 (4) ◽  
pp. 042026
Author(s):  
Lizhuo Gao

Abstract Super resolution is applied in many digital image fields. In many cases, only a set of low-resolution images can be obtained, but the image needs a higher resolution, and then SR needs to be applied. SR technology has undergone years of development. Among them, SRGAN is the key work to introduce GAN into the SR field, which can truly restore a large number of details on the basis of low-pixel pictures. ESRGAN is a further improvement on SRGAN. By removing the BN layer in SRGAN, the effect of artifacts in SRGAN is eliminated. However, there is still a problem that the restoration of information on small and medium scales is not accurate enough. The proposed ERDBNet improve the model on the basis of ESRGAN, and use the ERDB block to replace the original RRDB block. The new structure uses a three-layer dense block to replace the original dense block, and a residual structure of the starting point is added to each dense block. The pre-trained network can reach a PSNR of 30.425 after 200k iterations, and the minimum floating PSNR is only 30.213. Compared with the original structure, it is more stable and performs better in the detail recovery of many low-pixel images.


Author(s):  
Dong Seon Cheng ◽  
Marco Cristani ◽  
Vittorio Murino

Image super-resolution is one of the most appealing applications of image processing, capable of retrieving a high resolution image by fusing several registered low resolution images depicting an object of interest. However, employing super-resolution in video data is challenging: a video sequence generally contains a lot of scattered information regarding several objects of interest in cluttered scenes. Especially with hand-held cameras, the overall quality may be poor due to low resolution or unsteadiness. The objective of this chapter is to demonstrate why standard image super-resolution fails in video data, which are the problems that arise, and how we can overcome these problems. In our first contribution, we propose a novel Bayesian framework for super-resolution of persistent objects of interest in video sequences. We call this process Distillation. In the traditional formulation of the image super-resolution problem, the observed target is (1) always the same, (2) acquired using a camera making small movements, and (3) found in a number of low resolution images sufficient to recover high-frequency information. These assumptions are usually unsatisfied in real world video acquisitions and often beyond the control of the video operator. With Distillation, we aim to extend and to generalize the image super-resolution task, embedding it in a structured framework that accurately distills all the informative bits of an object of interest. In practice, the Distillation process: i) individuates, in a semi supervised way, a set of objects of interest, clustering the related video frames and registering them with respect to global rigid transformations; ii) for each one, produces a high resolution image, by weighting each pixel according to the information retrieved about the object of interest. As a second contribution, we extend the Distillation process to deal with objects of interest whose transformations in the appearance are not (only) rigid. Such process, built on top of the Distillation, is hierarchical, in the sense that a process of clustering is applied recursively, beginning with the analysis of whole frames, and selectively focusing on smaller sub-regions whose isolated motion can be reasonably assumed as rigid. The ultimate product of the overall process is a strip of images that describe at high resolution the dynamics of the video, switching between alternative local descriptions in response to visual changes. Our approach is first tested on synthetic data, obtaining encouraging comparative results with respect to known super-resolution techniques, and a good robustness against noise. Second, real data coming from different videos are considered, trying to solve the major details of the objects in motion.


2019 ◽  
Vol 78 ◽  
pp. 236-245 ◽  
Author(s):  
Dewan Fahim Noor ◽  
Yue Li ◽  
Zhu Li ◽  
Shuvra Bhattacharyya ◽  
George York

Author(s):  
Jakaria Rabbi ◽  
Nilanjan Ray ◽  
Matthias Schubert ◽  
Subir Chowdhury ◽  
Dennis Chao

The detection performance of small objects in remote sensing images is not satisfactory compared to large objects, especially in low-resolution and noisy images. A generative adversarial network (GAN)-based model called enhanced super-resolution GAN (ESRGAN) shows remarkable image enhancement performance, but reconstructed images miss high-frequency edge information. Therefore, object detection performance degrades for the small objects on recovered noisy and low-resolution remote sensing images. Inspired by the success of edge enhanced GAN (EEGAN) and ESRGAN, we apply a new edge-enhanced super-resolution GAN (EESRGAN) to improve the image quality of remote sensing images and used different detector networks in an end-to-end manner where detector loss is backpropagated into the EESRGAN to improve the detection performance. We propose an architecture with three components: ESRGAN, Edge Enhancement Network (EEN), and Detection network. We use residual-in-residual dense blocks (RRDB) for both the GAN and EEN, and for the detector network, we use the faster region-based convolutional network (FRCNN) (two-stage detector) and single-shot multi-box detector (SSD) (one stage detector). Extensive experiments on car overhead with context and oil and gas storage tank (created by us) data sets show superior performance of our method compared to the standalone state-of-the-art object detectors.


Sign in / Sign up

Export Citation Format

Share Document