Learning a Dynamic High-Resolution Network for Multi-Scale Pedestrian Detection

Although image inpainting based on the generated adversarial network (GAN) has made great breakthroughs in accuracy and speed in recent years, they can only process low-resolution images because of memory limitations and difficulty in training. For high-resolution images, the inpainted regions become blurred and the unpleasant boundaries become visible. Based on the current advanced image generation network, we proposed a novel high-resolution image inpainting method based on multi-scale neural network. This method is a two-stage network including content reconstruction and texture detail restoration. After holding the visually believable fuzzy texture, we further restore the finer details to produce a smoother, clearer, and more coherent inpainting result. Then we propose a special application scene of image inpainting, that is, to delete the redundant pedestrians in the image and ensure the reality of background restoration. It involves pedestrian detection, identifying redundant pedestrians and filling in them with the seemingly correct content. To improve the accuracy of image inpainting in the application scene, we proposed a new mask dataset, which collected the characters in COCO dataset as a mask. Finally, we evaluated our method on COCO and VOC dataset. the experimental results show that our method can produce clearer and more coherent inpainting results, especially for high-resolution images, and the proposed mask dataset can produce better inpainting results in the special application scene.

Download Full-text

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Sensors ◽

10.3390/s21051820 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1820

Author(s):

Xiaotao Shao ◽

Qing Wang ◽

Wei Yang ◽

Yun Chen ◽

Yi Xie ◽

...

Keyword(s):

Semantic Information ◽

Detection System ◽

Pedestrian Detection ◽

Detection Accuracy ◽

The Public ◽

Scale Feature ◽

Detection Algorithms ◽

Multi Scale ◽

Art Works ◽

Feature Pyramid

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.

Download Full-text

Poleward Transport of African Dust to the Iberian Peninsula Organized by Multi-scale Terrain-Induced Circulations: Observations and High-Resolution WRF-Chem Simulation Analyses

10.1002/essoar.10505212.1 ◽

2020 ◽

Author(s):

Saroj Dhital ◽

Michael L. Kaplan ◽

Jose Antonio Garcia Orza ◽

Stephanie Fiedler

Keyword(s):

High Resolution ◽

Iberian Peninsula ◽

Multi Scale ◽

African Dust

Download Full-text

High-Resolution SAR Image Classification Using Multi-Scale Deep Feature Fusion and Covariance Pooling Manifold Network

Remote Sensing ◽

10.3390/rs13020328 ◽

2021 ◽

Vol 13 (2) ◽

pp. 328

Author(s):

Wenkai Liang ◽

Yan Wu ◽

Ming Li ◽

Yice Cao ◽

Xin Hu

Keyword(s):

High Resolution ◽

Image Classification ◽

Feature Fusion ◽

Representation Learning ◽

Sar Image ◽

Gabor Filtering ◽

Feature Maps ◽

Sar Images ◽

Multi Scale ◽

Deep Feature

The classification of high-resolution (HR) synthetic aperture radar (SAR) images is of great importance for SAR scene interpretation and application. However, the presence of intricate spatial structural patterns and complex statistical nature makes SAR image classification a challenging task, especially in the case of limited labeled SAR data. This paper proposes a novel HR SAR image classification method, using a multi-scale deep feature fusion network and covariance pooling manifold network (MFFN-CPMN). MFFN-CPMN combines the advantages of local spatial features and global statistical properties and considers the multi-feature information fusion of SAR images in representation learning. First, we propose a Gabor-filtering-based multi-scale feature fusion network (MFFN) to capture the spatial pattern and get the discriminative features of SAR images. The MFFN belongs to a deep convolutional neural network (CNN). To make full use of a large amount of unlabeled data, the weights of each layer of MFFN are optimized by unsupervised denoising dual-sparse encoder. Moreover, the feature fusion strategy in MFFN can effectively exploit the complementary information between different levels and different scales. Second, we utilize a covariance pooling manifold network to extract further the global second-order statistics of SAR images over the fusional feature maps. Finally, the obtained covariance descriptor is more distinct for various land covers. Experimental results on four HR SAR images demonstrate the effectiveness of the proposed method and achieve promising results over other related algorithms.

Download Full-text

Design of Airborne Multi-Scale Wide-Field-of-View and High-Resolution Imaging System

Acta Optica Sinica ◽

10.3788/aos202141.0208002 ◽

2021 ◽

Vol 41 (2) ◽

pp. 0208002

Author(s):

李江勇 Li Jiangyong ◽

冯位欣 Feng Weixin ◽

刘飞 Liu Fei ◽

魏雅喆 Wei Yazhe ◽

邵晓鹏 Shao Xiaopeng

Keyword(s):

High Resolution ◽

Imaging System ◽

Field Of View ◽

Wide Field ◽

High Resolution Imaging ◽

Multi Scale ◽

Wide Field Of View ◽

Resolution Imaging

Download Full-text

A Hardware Accelerator for Real Time Sliding Window Based Pedestrian Detection on High Resolution Images

VLSI-SoC: Design for Reliability, Security, and Low Power - IFIP Advances in Information and Communication Technology ◽

10.1007/978-3-319-46097-0_3 ◽

2016 ◽

pp. 46-66 ◽

Cited By ~ 1

Author(s):

Asim Khan ◽

Muhammad Umar Karim Khan ◽

Muhammad Bilal ◽

Chong-Min Kyung

Keyword(s):

High Resolution ◽

Real Time ◽

Pedestrian Detection ◽

Sliding Window ◽

Hardware Accelerator ◽

High Resolution Images

Download Full-text

Multi-scale Semantic Segmentation Enriched Features for Pedestrian Detection

2018 24th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2018.8545414 ◽

2018 ◽

Author(s):

Xiaolu Xie ◽

Zengfu Wang

Keyword(s):

Pedestrian Detection ◽

Semantic Segmentation ◽

Multi Scale

Download Full-text

High-Resolution Image Classification Using the Dynamic Differential Evolutionary Algorithm Optimized Multi-scale Kernel Support Vector Machine Method

Advances in Brain Inspired Cognitive Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-030-00563-4_32 ◽

2018 ◽

pp. 334-341

Author(s):

Xueqian Rong ◽

Aizhu Zhang ◽

Genyun Sun ◽

Hui Huang ◽

Ping Ma

Keyword(s):

Support Vector Machine ◽

High Resolution ◽

Support Vector ◽

Machine Method ◽

Support Vector Machine Method ◽

Resolution Image ◽

Multi Scale ◽

High Resolution Image ◽

Kernel Support Vector Machine ◽

Differential Evolutionary Algorithm

Download Full-text

Semantic Segmentation Network for Surface Defect Detection of Automobile Wheel Hub Fusing High-Resolution Feature and Multi-Scale Feature

Applied Sciences ◽

10.3390/app112210508 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10508

Author(s):

Chaowei Tang ◽

Xinxin Feng ◽

Haotian Wen ◽

Xu Zhou ◽

Yanqing Shao ◽

...

Keyword(s):

High Resolution ◽

Defect Detection ◽

Automobile Industry ◽

Surface Defect ◽

Semantic Segmentation ◽

The Body ◽

Multi Scale ◽

Surface Defect Detection ◽

Edge Features ◽

Automobile Wheel

Surface defect detection of an automobile wheel hub is important to the automobile industry because these defects directly affect the safety and appearance of automobiles. At present, surface defect detection networks based on convolutional neural network use many pooling layers when extracting features, reducing the spatial resolution of features and preventing the accurate detection of the boundary of defects. On the basis of DeepLab v3+, we propose a semantic segmentation network for the surface defect detection of an automobile wheel hub. To solve the gridding effect of atrous convolution, the high-resolution network (HRNet) is used as the backbone network to extract high-resolution features, and the multi-scale features extracted by the Atrous Spatial Pyramid Pooling (ASPP) of DeepLab v3+ are superimposed. On the basis of the optical flow, we decouple the body and edge features of the defects to accurately detect the boundary of defects. Furthermore, in the upsampling process, a decoder can accurately obtain detection results by fusing the body, edge, and multi-scale features. We use supervised training to optimize these features. Experimental results on four defect datasets (i.e., wheels, magnetic tiles, fabrics, and welds) show that the proposed network has better F1 score, average precision, and intersection over union than SegNet, Unet, and DeepLab v3+, proving that the proposed network is effective for different defect detection scenarios.

Download Full-text

MULTI-SCALE SEGMENTATION OF HIGH RESOLUTION REMOTE SENSING IMAGES BY INTEGRATING MULTIPLE FEATURES

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-1-w1-247-2017 ◽

2017 ◽

Vol XLII-1/W1 ◽

pp. 247-255 ◽

Cited By ~ 1

Author(s):

Y. Di ◽

G. Jiang ◽

L. Yan ◽

H. Liu ◽

S. Zheng

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Minimum Spanning Tree ◽

Network Evolution ◽

Edge Weight ◽

Remote Sensing Images ◽

Canny Operator ◽

Multiple Features ◽

Multi Scale ◽

Initial Segmentation

Most of multi-scale segmentation algorithms are not aiming at high resolution remote sensing images and have difficulty to communicate and use layers’ information. In view of them, we proposes a method of multi-scale segmentation of high resolution remote sensing images by integrating multiple features. First, Canny operator is used to extract edge information, and then band weighted distance function is built to obtain the edge weight. According to the criterion, the initial segmentation objects of color images can be gained by Kruskal minimum spanning tree algorithm. Finally segmentation images are got by the adaptive rule of Mumford–Shah region merging combination with spectral and texture information. The proposed method is evaluated precisely using analog images and ZY-3 satellite images through quantitative and qualitative analysis. The experimental results show that the multi-scale segmentation of high resolution remote sensing images by integrating multiple features outperformed the software eCognition fractal network evolution algorithm (highest-resolution network evolution that FNEA) on the accuracy and slightly inferior to FNEA on the efficiency.

Download Full-text