scholarly journals Weighted Ensemble Object Detection with Optimized Coefficients for Remote Sensing Images

2020 ◽  
Vol 9 (6) ◽  
pp. 370
Author(s):  
Atakan Körez ◽  
Necaattin Barışçı ◽  
Aydın Çetin ◽  
Uçman Ergün

The detection of objects in very high-resolution (VHR) remote sensing images has become increasingly popular with the enhancement of remote sensing technologies. High-resolution images from aircrafts or satellites contain highly detailed and mixed backgrounds that decrease the success of object detection in remote sensing images. In this study, a model that performs weighted ensemble object detection using optimized coefficients is proposed. This model uses the outputs of three different object detection models trained on the same dataset. The model’s structure takes two or more object detection methods as its input and provides an output with an optimized coefficient-weighted ensemble. The Northwestern Polytechnical University Very High Resolution 10 (NWPU-VHR10) and Remote Sensing Object Detection (RSOD) datasets were used to measure the object detection success of the proposed model. Our experiments reveal that the proposed model improved the Mean Average Precision (mAP) performance by 0.78%–16.5% compared to stand-alone models and presents better mean average precision than other state-of-the-art methods (3.55% higher on the NWPU-VHR-10 dataset and 1.49% higher when using the RSOD dataset).

2020 ◽  
Vol 12 (18) ◽  
pp. 2985 ◽  
Author(s):  
Yeneng Lin ◽  
Dongyun Xu ◽  
Nan Wang ◽  
Zhou Shi ◽  
Qiuxiao Chen

Automatic road extraction from very-high-resolution remote sensing images has become a popular topic in a wide range of fields. Convolutional neural networks are often used for this purpose. However, many network models do not achieve satisfactory extraction results because of the elongated nature and varying sizes of roads in images. To improve the accuracy of road extraction, this paper proposes a deep learning model based on the structure of Deeplab v3. It incorporates squeeze-and-excitation (SE) module to apply weights to different feature channels, and performs multi-scale upsampling to preserve and fuse shallow and deep information. To solve the problems associated with unbalanced road samples in images, different loss functions and backbone network modules are tested in the model’s training process. Compared with cross entropy, dice loss can improve the performance of the model during training and prediction. The SE module is superior to ResNext and ResNet in improving the integrity of the extracted roads. Experimental results obtained using the Massachusetts Roads Dataset show that the proposed model (Nested SE-Deeplab) improves F1-Score by 2.4% and Intersection over Union by 2.0% compared with FC-DenseNet. The proposed model also achieves better segmentation accuracy in road extraction compared with other mainstream deep-learning models including Deeplab v3, SegNet, and UNet.


2018 ◽  
Vol 8 (10) ◽  
pp. 1883 ◽  
Author(s):  
Hongyin Han ◽  
Chengshan Han ◽  
Xucheng Xue ◽  
Changhong Hu ◽  
Liang Huang ◽  
...  

Shadows in very high-resolution multispectral remote sensing images hinder many applications, such as change detection, target recognition, and image classification. Though a wide variety of significant research has explored shadow detection, shadow pixels are still more or less omitted and are wrongly confused with vegetation pixels in some cases. In this study, to further manage the problems of shadow omission and vegetation misclassification, a mixed property-based shadow index is developed for detecting shadows in very high-resolution multispectral remote sensing images based on the difference of the hue component and the intensity component between shadows and nonshadows, and the difference of the reflectivity of the red band and the near infrared band between shadows and vegetation cover in nonshadows. Then, the final shadow mask is achieved, with an optimal threshold automatically obtained from the index image histogram. To validate the effectiveness of our approach for shadow detection, three test images are selected from the multispectral WorldView-3 images of Rio de Janeiro, Brazil, and are tested with our method. When compared with other investigated standard shadow detection methods, the resulting images produced by our method deliver a higher average overall accuracy (95.02%) and a better visual sense. The highly accurate data show the efficacy and stability of the proposed approach in appropriately detecting shadows and correctly classifying shadow pixels against the vegetation pixels for very high-resolution multispectral remote sensing images.


Author(s):  
Z. Wu ◽  
X. Chen ◽  
Y. Gao ◽  
Y. Li

Object detection in high resolution remote sensing images is a fundamental and challenging problem in the field of remote sensing imagery analysis for civil and military application due to the complex neighboring environments, which can cause the recognition algorithms to mistake irrelevant ground objects for target objects. Deep Convolution Neural Network(DCNN) is the hotspot in object detection for its powerful ability of feature extraction and has achieved state-of-the-art results in Computer Vision. Common pipeline of object detection based on DCNN consists of region proposal, CNN feature extraction, region classification and post processing. YOLO model frames object detection as a regression problem, using a single CNN predicts bounding boxes and class probabilities in an end-to-end way and make the predict faster. In this paper, a YOLO based model is used for object detection in high resolution sensing images. The experiments on NWPU VHR-10 dataset and our airport/airplane dataset gain from GoogleEarth show that, compare with the common pipeline, the proposed model speeds up the detection process and have good accuracy.


Sign in / Sign up

Export Citation Format

Share Document