Target Detection and Classification Performance Enhancement using Super-Resolution Infrared Videos

Chiman Kwan; David Gribben; Bence Budavari

doi:10.5121/sipij.2021.12203

Target Detection and Classification Performance Enhancement using Super-Resolution Infrared Videos

Signal & Image Processing An International Journal ◽

10.5121/sipij.2021.12203 ◽

2021 ◽

Vol 12 (2) ◽

pp. 33-45

Author(s):

Chiman Kwan ◽

David Gribben ◽

Bence Budavari

Keyword(s):

High Resolution ◽

Long Range ◽

Target Detection ◽

Performance Enhancement ◽

Super Resolution ◽

Classification Performance ◽

Information Analysis ◽

Average Precision ◽

Defense Systems ◽

Analysis Center

Long range infrared videos such as the Defense Systems Information Analysis Center (DSIAC) videos usually do not have high resolution. In recent years, there are significant advancement in video super-resolution algorithms. Here, we summarize our study on the use of super-resolution videos for target detection and classification. We observed that super-resolution videos can significantly improve the detection and classification performance. For example, for 3000 m range videos, we were able to improve the average precision of target detection from 11% (without super-resolution) to 44% (with 4x super-resolution) and the overall accuracy of target classification from 10% (without super-resolution) to 44% (with 2x superresolution).

Download Full-text

Target Detection and Classification Improvements using Contrast Enhanced 16-bit Infrared Videos

Signal & Image Processing An International Journal ◽

10.5121/sipij.2021.12103 ◽

2021 ◽

Vol 12 (1) ◽

pp. 23-38

Author(s):

Chiman Kwan ◽

David Gribben

Keyword(s):

Target Detection ◽

Second Order ◽

Video Data ◽

Information Analysis ◽

Matching Algorithm ◽

Low Contrast ◽

Defense Systems ◽

Histogram Matching ◽

Contrast Enhanced ◽

Analysis Center

In our earlier target detection and classification papers, we used 8-bit infrared videos in the Defense Systems Information Analysis Center(DSIAC) video dataset. In this paper, we focus on how we can improve the target detection and classification results using 16-bit videos. One problem with the 16-bit videos is that some image frames have very low contrast. Two methods were explored to improve upon previous detection and classification results. The first method used to improve contrast was effectively the same as the baseline 8-bit video data but using the 16-bit raw data rather than the 8-bit data taken from the avi files. The second method used was a second order histogram matching algorithm that preserves the 16-bit nature of the videos while providing normalization and contrast enhancement. Results showed the second order histogram matching algorithm improved the target detection using You Only Look Once (YOLO) and classificationusing Residual Network (ResNet) performance. The average precision (AP) metric in YOLO was improved by 8%. This is quite significant. The overall accuracy (OA) of ResNet has been improved by 12%. This is also very significant.

Download Full-text

Enhancing Small Moving Target Detection Performance in Low-Quality and Long-Range Infrared Videos Using Optical Flow Techniques

Remote Sensing ◽

10.3390/rs12244024 ◽

2020 ◽

Vol 12 (24) ◽

pp. 4024

Author(s):

Chiman Kwan ◽

Bence Budavari

Keyword(s):

Optical Flow ◽

Long Range ◽

Target Detection ◽

Moving Objects ◽

Performance Enhancement ◽

Detection Performance ◽

Small Target ◽

Air Turbulence ◽

Analysis Center ◽

Background Clutter

The detection of small moving objects in long-range infrared videos is challenging due to background clutter, air turbulence, and small target size. In this paper, we summarize the investigation of efficient ways to enhance the performance of small target detection in long-range and low-quality infrared videos containing moving objects. In particular, we focus on unsupervised, modular, flexible, and efficient methods for target detection performance enhancement using motion information extracted from optical flow methods. Three well-known optical flow methods were studied. It was found that optical flow methods need to be combined with contrast enhancement, connected component analysis, and target association in order to be effective for target detection. Extensive experiments using long-range mid-wave infrared (MWIR) videos from the Defense Systems Information Analysis Center (DSIAC) dataset clearly demonstrated the efficacy of our proposed approach.

Download Full-text

Crop Disease Classification on Inadequate Low-Resolution Target Images

Sensors ◽

10.3390/s20164601 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4601

Author(s):

Juan Wen ◽

Yangjing Shi ◽

Xiaoshi Zhou ◽

Yiming Xue

Keyword(s):

High Resolution ◽

Super Resolution ◽

Classification Performance ◽

Disease Classification ◽

Generative Adversarial Networks ◽

Low Resolution ◽

Training Samples ◽

Crop Disease ◽

High Resolution Images ◽

Model Training

Currently, various agricultural image classification tasks are carried out on high-resolution images. However, in some cases, we cannot get enough high-resolution images for classification, which significantly affects classification performance. In this paper, we design a crop disease classification network based on Enhanced Super-Resolution Generative adversarial networks (ESRGAN) when only an insufficient number of low-resolution target images are available. First, ESRGAN is used to recover super-resolution crop images from low-resolution images. Transfer learning is applied in model training to compensate for the lack of training samples. Then, we test the performance of the generated super-resolution images in crop disease classification task. Extensive experiments show that using the fine-tuned ESRGAN model can recover realistic crop information and improve the accuracy of crop disease classification, compared with the other four image super-resolution methods.

Download Full-text

Weak Target Detection in High-Resolution Remote Sensing Images by Combining Super-Resolution and Deformable FPN

IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss39084.2020.9323260 ◽

2020 ◽

Author(s):

Yang Bai ◽

Tongyuan Zou ◽

Shujia Ye ◽

Zhenqiang Qin ◽

Guoming Gao ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Target Detection ◽

Super Resolution ◽

Remote Sensing Images ◽

Weak Target

Download Full-text

Swin-HSTPS: Research on Target Detection Algorithms for Multi-Source High-Resolution Remote Sensing Images

Sensors ◽

10.3390/s21238113 ◽

2021 ◽

Vol 21 (23) ◽

pp. 8113

Author(s):

Kun Fang ◽

Jianquan Ouyang ◽

Buwei Hu

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Target Detection ◽

Target Prediction ◽

Remote Sensing Image ◽

Remote Sensing Images ◽

Average Precision ◽

Detection Model ◽

Feature Information ◽

Small Targets

Traffic port stations are composed of buildings, infrastructure, and transportation vehicles. The target detection of traffic port stations in high-resolution remote sensing images needs to collect feature information of nearby small targets, comprehensively analyze and classify, and finally complete the traffic port station positioning. At present, deep learning methods based on convolutional neural networks have made great progress in single-target detection of high-resolution remote sensing images. How to show good adaptability to the recognition of multi-target complexes of high-resolution remote sensing images is a difficult point in the current remote sensing field. This paper constructs a novel high-resolution remote sensing image traffic port station detection model (Swin-HSTPS) to achieve high-resolution remote sensing image traffic port station detection (such as airports, ports) and improve the multi-target complex in high-resolution remote sensing images The recognition accuracy of high-resolution remote sensing images solves the problem of high-precision positioning by comprehensive analysis of the feature combination information of multiple small targets in high-resolution remote sensing images. The model combines the characteristics of the MixUp hybrid enhancement algorithm, and enhances the image feature information in the preprocessing stage. The PReLU activation function is added to the forward network of the Swin Transformer model network to construct a ResNet-like residual network and perform convolutional feature maps. Non-linear transformation strengthens the information interaction of each pixel block. This experiment evaluates the superiority of the model training by comparing the two indicators of average precision and average recall in the training phase. At the same time, in the prediction stage, the accuracy of the prediction target is measured by confidence. Experimental results show that the optimal average precision of the Swin-HSTPS reaches 85.3%, which is about 8% higher than the average precision of the Swin Transformer detection model. At the same time, the target prediction accuracy is also higher than the Swin Transformer detection model, which can accurately locate traffic port stations such as airports and ports in high-resolution remote sensing images. This model inherits the advantages of the Swin Transformer detection model, and is superior to mainstream models such as R-CNN and YOLOv5 in terms of the target prediction ability of high-resolution remote sensing image traffic port stations.

Download Full-text

Nondestructive Testing Information Analysis Center, FY 1986 Annual Report

10.21236/ada207422 ◽

1986 ◽

Author(s):

George A. Matzkanin

Keyword(s):

Nondestructive Testing ◽

Annual Report ◽

Information Analysis ◽

Analysis Center

Download Full-text

Integrated RF Sensor Signal/Data Processing Information Analysis Center (IAC)

10.21236/ada401075 ◽

2002 ◽

Author(s):

John F. Spina

Keyword(s):

Data Processing ◽

Sensor Signal ◽

Information Analysis ◽

Processing Information ◽

Analysis Center

Download Full-text

BIRD-J -resolved HMBC and BIRD-high-resolution HMBC pulse sequences for measuring heteronuclear long-range coupling constants and proton-proton spin coupling constants in complicated spin systems

Magnetic Resonance in Chemistry ◽

10.1002/mrc.2702 ◽

2011 ◽

Vol 49 (2) ◽

pp. 53-58 ◽

Cited By ~ 9

Author(s):

Kazuo Furihata ◽

Mitsuru Tashiro ◽

Haruo Seto

Keyword(s):

High Resolution ◽

Long Range ◽

Pulse Sequences ◽

Spin Systems ◽

Proton Spin ◽

Coupling Constants ◽

Spin Coupling ◽

Proton Proton ◽

Spin Coupling Constants

Download Full-text

A Two-Step Ship Target Detection Method in High-Resolution Sar Image Based on Coarse-to-Fine Mechanism

IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss39084.2020.9323895 ◽

2020 ◽

Author(s):

Kun Sun ◽

Yuanyuan Li ◽

Cong Li ◽

Yi Liang ◽

Mengdao Xing

Keyword(s):

High Resolution ◽

Target Detection ◽

Detection Method ◽

Sar Image ◽

Coarse To Fine

Download Full-text

Deriving VIIRS High-Spatial Resolution Water Property Data over Coastal and Inland Waters Using Deep Convolutional Neural Network

Remote Sensing ◽

10.3390/rs13101944 ◽

2021 ◽

Vol 13 (10) ◽

pp. 1944

Author(s):

Xiaoming Liu ◽

Menghua Wang

Keyword(s):

Neural Network ◽

High Resolution ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Ocean Color ◽

Super Resolution ◽

Inland Waters ◽

Coastal Oceans ◽

Chl A ◽

Color Data

The Visible Infrared Imaging Radiometer Suite (VIIRS) onboard the Suomi National Polar-orbiting Partnership (SNPP) satellite has been a reliable source of ocean color data products, including five moderate (M) bands and one imagery (I) band normalized water-leaving radiance spectra nLw(λ). The spatial resolutions of the M-band and I-band nLw(λ) are 750 m and 375 m, respectively. With the technique of convolutional neural network (CNN), the M-band nLw(λ) imagery can be super-resolved from 750 m to 375 m spatial resolution by leveraging the high spatial resolution features of I1-band nLw(λ) data. However, it is also important to enhance the spatial resolution of VIIRS-derived chlorophyll-a (Chl-a) concentration and the water diffuse attenuation coefficient at the wavelength of 490 nm (Kd(490)), as well as other biological and biogeochemical products. In this study, we describe our effort to derive high-resolution Kd(490) and Chl-a data based on super-resolved nLw(λ) images at the VIIRS five M-bands. To improve the network performance over extremely turbid coastal oceans and inland waters, the networks are retrained with a training dataset including ocean color data from the Bohai Sea, Baltic Sea, and La Plata River Estuary, covering water types from clear open oceans to moderately turbid and highly turbid waters. The evaluation results show that the super-resolved Kd(490) image is much sharper than the original one, and has more detailed fine spatial structures. A similar enhancement of finer structures is also found in the super-resolved Chl-a images. Chl-a filaments are much sharper and thinner in the super-resolved image, and some of the very fine spatial features that are not shown in the original images appear in the super-resolved Chl-a imageries. The networks are also applied to four other coastal and inland water regions. The results show that super-resolution occurs mainly on pixels of Chl-a and Kd(490) features, especially on the feature edges and locations with a large spatial gradient. The biases between the original M-band images and super-resolved high-resolution images are small for both Chl-a and Kd(490) in moderately to extremely turbid coastal oceans and inland waters, indicating that the super-resolution process does not change the mean values of the original images.

Download Full-text