Deep Fusion of DOM and DSM Features for Benggang Discovery

Shengyu Shen; Jiasheng Chen; Shaoyi Zhang; Dongbing Cheng; Zhigang Wang; Tong Zhang

doi:10.3390/ijgi10080556

Deep Fusion of DOM and DSM Features for Benggang Discovery

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10080556 ◽

2021 ◽

Vol 10 (8) ◽

pp. 556

Author(s):

Shengyu Shen ◽

Jiasheng Chen ◽

Shaoyi Zhang ◽

Dongbing Cheng ◽

Zhigang Wang ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Features ◽

Surface Model ◽

Sources Of Information ◽

Geomorphic Features ◽

Rich Information ◽

Economic Infrastructure ◽

Training Examples ◽

Digital Orthophoto

Benggang is a typical erosional landform in southern and southeastern China. Since benggang poses significant risks to local ecological environments and economic infrastructure, it is vital to accurately detect benggang-eroded areas. Relying only on remote sensing imagery for benggang detection cannot produce satisfactory results. In this study, we propose integrating high-resolution Digital Orthophoto Map (DOM) and Digital Surface Model (DSM) data for efficient and automatic benggang discovery. The fusion of complementary rich information hidden in both DOM and DSM data is realized by a two-stream convolutional neural network (CNN), which integrates aggregated terrain and activation image features that are both extracted by supervised deep learning. We aggregate local low-level geomorphic features via a supervised diffusion-convolutional embedding branch for expressive representations of benggang terrain variations. Activation image features are obtained from an image-oriented convolutional neural network branch. The two sources of information (DOM and DSM) are fused via a gated neural network, which learns the most discriminative features for the detection of benggang. The evaluation of a challenging benggang dataset demonstrates that our method exceeds several baselines, even with limited training examples. The results show that the fusion of DOM and DSM data is beneficial for benggang detection via supervised convolutional and deep fusion networks.

Download Full-text

Convolutional Neural Network for Crowd Counting on Metro Platforms

Symmetry ◽

10.3390/sym13040703 ◽

2021 ◽

Vol 13 (4) ◽

pp. 703

Author(s):

Jun Zhang ◽

Jiaze Liu ◽

Zhizhong Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Estimation Error ◽

Image Features ◽

Urban Rail Transit ◽

Crowd Counting ◽

Passenger Flow ◽

Urban Rail ◽

Density Map ◽

Flow Detection

Owing to the increased use of urban rail transit, the flow of passengers on metro platforms tends to increase sharply during peak periods. Monitoring passenger flow in such areas is important for security-related reasons. In this paper, in order to solve the problem of metro platform passenger flow detection, we propose a CNN (convolutional neural network)-based network called the MP (metro platform)-CNN to accurately count people on metro platforms. The proposed method is composed of three major components: a group of convolutional neural networks is used on the front end to extract image features, a multiscale feature extraction module is used to enhance multiscale features, and transposed convolution is used for upsampling to generate a high-quality density map. Currently, existing crowd-counting datasets do not adequately cover all of the challenging situations considered in this study. Therefore, we collected images from surveillance videos of a metro platform to form a dataset containing 627 images, with 9243 annotated heads. The results of the extensive experiments showed that our method performed well on the self-built dataset and the estimation error was minimum. Moreover, the proposed method could compete with other methods on four standard crowd-counting datasets.

Download Full-text

An Integrated Approach for Traffic Scene Understanding from Monocular Cameras

10.51202/9783186815125 ◽

2021 ◽

Author(s):

Malte Oeljeklaus

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Scene Perception ◽

Integrated Approach ◽

Image Features ◽

Road Users ◽

Environment Model ◽

Computational Performance ◽

Spatial Reconstruction ◽

The Individual

This thesis investigates methods for traffic scene perception with monocular cameras for a basic environment model in the context of automated vehicles. The developed approach is designed with special attention to the computational limitations present in practical systems. For this purpose, three different scene representations are investigated. These consist of the prevalent road topology as the global scene context, the drivable road area and the detection and spatial reconstruction of other road users. An approach is developed that allows for the simultaneous perception of all environment representations based on a multi-task convolutional neural network. The obtained results demonstrate the efficiency of the multi-task approach. In particular, the effects of shareable image features for the perception of the individual scene representations were found to improve the computational performance. Contents Nomenclature VII 1 Introduction 1 1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Outline and contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2 Related Work and Fundamental Background 8 2.1 Advances in CNN...

Download Full-text

An Efficient and Lightweight Convolutional Neural Network for Remote Sensing Image Scene Classification

Sensors ◽

10.3390/s20071999 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1999 ◽

Cited By ~ 6

Author(s):

Donghang Yu ◽

Qing Xu ◽

Haitao Guo ◽

Chuan Zhao ◽

Yuzhun Lin ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Visual Recognition ◽

Feature Fusion ◽

Remote Sensing Image ◽

Classification Performance ◽

Image Features ◽

Training Dataset ◽

Scene Classification

Classifying remote sensing images is vital for interpreting image content. Presently, remote sensing image scene classification methods using convolutional neural networks have drawbacks, including excessive parameters and heavy calculation costs. More efficient and lightweight CNNs have fewer parameters and calculations, but their classification performance is generally weaker. We propose a more efficient and lightweight convolutional neural network method to improve classification accuracy with a small training dataset. Inspired by fine-grained visual recognition, this study introduces a bilinear convolutional neural network model for scene classification. First, the lightweight convolutional neural network, MobileNetv2, is used to extract deep and abstract image features. Each feature is then transformed into two features with two different convolutional layers. The transformed features are subjected to Hadamard product operation to obtain an enhanced bilinear feature. Finally, the bilinear feature after pooling and normalization is used for classification. Experiments are performed on three widely used datasets: UC Merced, AID, and NWPU-RESISC45. Compared with other state-of-art methods, the proposed method has fewer parameters and calculations, while achieving higher accuracy. By including feature fusion with bilinear pooling, performance and accuracy for remote scene classification can greatly improve. This could be applied to any remote sensing image classification task.

Download Full-text

An image-based gangrene disease classification

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v10i6.pp6001-6007 ◽

2020 ◽

Vol 10 (6) ◽

pp. 6001

Author(s):

Pramod Sekharan Nair ◽

Tsrity Asefa Berihu ◽

Varun Kumar

Keyword(s):

Neural Network ◽

Support Vector Machine ◽

Convolutional Neural Network ◽

Image Features ◽

The Body ◽

Disease Classification ◽

Support Vector ◽

Body Parts ◽

Intermediate Layers ◽

The Given

Gangrene disease is one of the deadliest diseases on the globe which is caused by lack of blood supply to the body parts or any kind of infection. The gangrene disease often affects the human body parts such as fingers, limbs, toes but there are many cases of on muscles and organs. In this paper, the gangrene disease classification is being done from the given images of high resolution. The convolutional neural network (CNN) is used for feature extraction on disease images. The first layer of the convolutional neural network was used to capture the elementary image features such as dots, edges and blobs. The intermediate layers or the hidden layers of the convolutional neural network extracts detailed image features such as shapes, brightness, and contrast as well as color. Finally, the CNN extracted features are given to the Support Vector Machine to classify the gangrene disease. The experiment results show the approach adopted in this study performs better and acceptable.

Download Full-text

Visualization and Interpretation of Convolutional Neural Network Predictions in Detecting Pneumonia in Pediatric Chest Radiographs

Applied Sciences ◽

10.3390/app8101715 ◽

2018 ◽

Vol 8 (10) ◽

pp. 1715 ◽

Cited By ~ 35

Author(s):

Sivaramakrishnan Rajaraman ◽

Sema Candemir ◽

Incheol Kim ◽

George Thoma ◽

Sameer Antani

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Visual Recognition ◽

Performance Metrics ◽

Region Of Interest ◽

Clinical Decision ◽

Image Features ◽

Pediatric Chest ◽

Chest X Ray ◽

Visualization Strategy

Pneumonia affects 7% of the global population, resulting in 2 million pediatric deaths every year. Chest X-ray (CXR) analysis is routinely performed to diagnose the disease. Computer-aided diagnostic (CADx) tools aim to supplement decision-making. These tools process the handcrafted and/or convolutional neural network (CNN) extracted image features for visual recognition. However, CNNs are perceived as black boxes since their performance lack explanations. This is a serious bottleneck in applications involving medical screening/diagnosis since poorly interpreted model behavior could adversely affect the clinical decision. In this study, we evaluate, visualize, and explain the performance of customized CNNs to detect pneumonia and further differentiate between bacterial and viral types in pediatric CXRs. We present a novel visualization strategy to localize the region of interest (ROI) that is considered relevant for model predictions across all the inputs that belong to an expected class. We statistically validate the models’ performance toward the underlying tasks. We observe that the customized VGG16 model achieves 96.2% and 93.6% accuracy in detecting the disease and distinguishing between bacterial and viral pneumonia respectively. The model outperforms the state-of-the-art in all performance metrics and demonstrates reduced bias and improved generalization.

Download Full-text

Super Resolution pada Citra Udara menggunakan Convolutional Neural Network

ELKOMIKA Jurnal Teknik Energi Elektrik Teknik Telekomunikasi & Teknik Elektronika ◽

10.26760/elkomika.v9i1.71 ◽

2021 ◽

Vol 9 (1) ◽

pp. 71

Author(s):

MUHAMMAD EFAN ABDULFATTAH ◽

LEDYA NOVAMIZANTI ◽

SYAMSUL RIZAL

Keyword(s):

Neural Network ◽

Feature Extraction ◽

Deep Learning ◽

Convolutional Neural Network ◽

Unmanned Aerial Vehicle ◽

Super Resolution ◽

Image Features ◽

Scale Factor ◽

Aerial Imagery ◽

Aerial Vehicle

ABSTRAKBencana di Indonesia didominasi oleh bencana hidrometeorologi yang mengakibatkan kerusakan dalam skala besar. Melalui pemetaan, penanganan yang menyeluruh dapat dilakukan guna membantu analisa dan penindakan selanjutnya. Unmanned Aerial Vehicle (UAV) dapat digunakan sebagai alat bantu pemetaan dari udara. Namun, karena faktor kamera maupun perangkat pengolah citra yang tidak memenuhi spesifikasi, hasilnya menjadi kurang informatif. Penelitian ini mengusulkan Super Resolution pada citra udara berbasis Convolutional Neural Network (CNN) dengan model DCSCN. Model terdiri atas Feature Extraction Network untuk mengekstraksi ciri citra, dan Reconstruction Network untuk merekonstruksi citra. Performa DCSCN dibandingkan dengan Super Resolution CNN (SRCNN). Eksperimen dilakukan pada dataset Set5 dengan nilai scale factor 2, 3 dan 4. Secara berurutan SRCNN menghasilkan nilai PSNR dan SSIM sebesar 36.66 dB / 0.9542, 32.75 dB / 0.9090 dan 30.49 dB / 0.8628. Performa DCSCN meningkat menjadi 37.614dB / 0.9588, 33.86 dB / 0.9225 dan 31.48 dB / 0.8851.Kata kunci: citra udara, deep learning, super resolution ABSTRACTDisasters in Indonesia are dominated by hydrometeorological disasters, which cause large-scale damage. Through mapping, comprehensive handling can be done to help the analysis and subsequent action. Unmanned Aerial Vehicle (UAV) can be used as an aerial mapping tool. However, due to the camera and image processing devices that do not meet specifications, the results are less informative. This research proposes Super Resolution on aerial imagery based on Convolutional Neural Network (CNN) with the DCSCN model. The model consists of Feature Extraction Network for extracting image features and Reconstruction Network for reconstructing images. DCSCN's performance is compared to CNN Super Resolution (SRCNN). Experiments were carried out on the Set5 dataset with scale factor values 2, 3, and 4. The SRCNN sequentially produced PSNR and SSIM values of 36.66dB / 0.9542, 32.75dB / 0.9090 and 30.49dB / 0.8628. DCSCN's performance increased to 37,614dB / 0.9588, 33.86dB / 0.9225 and 31.48dB / 0.8851.Keywords: aerial imagery, deep learning, super resolution

Download Full-text

Field position estimation in soccer videos using convolutional neural network-based image features

International Workshop on Advanced Image Technology (IWAIT) 2019 ◽

10.1117/12.2521569 ◽

2019 ◽

Cited By ~ 1

Author(s):

Genki Suzuki ◽

Sho Takahashi ◽

Takahiro Ogawa ◽

Miki Haseyama

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Features ◽

Position Estimation ◽

Field Position

Download Full-text

A Convolutional Neural Network Based on Grouping Structure for Scene Classification

Remote Sensing ◽

10.3390/rs13132457 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2457

Author(s):

Xuan Wu ◽

Zhijie Zhang ◽

Wanchang Zhang ◽

Yaning Yi ◽

Chuanrong Zhang ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Feature Extraction ◽

Convolutional Neural Network ◽

Data Augmentation ◽

Remote Sensing Image ◽

Difficult Problem ◽

Image Features ◽

Attention Mechanism ◽

Scene Classification

Convolutional neural network (CNN) is capable of automatically extracting image features and has been widely used in remote sensing image classifications. Feature extraction is an important and difficult problem in current research. In this paper, data augmentation for avoiding over fitting was attempted to enrich features of samples to improve the performance of a newly proposed convolutional neural network with UC-Merced and RSI-CB datasets for remotely sensed scene classifications. A multiple grouped convolutional neural network (MGCNN) for self-learning that is capable of promoting the efficiency of CNN was proposed, and the method of grouping multiple convolutional layers capable of being applied elsewhere as a plug-in model was developed. Meanwhile, a hyper-parameter C in MGCNN is introduced to probe into the influence of different grouping strategies for feature extraction. Experiments on the two selected datasets, the RSI-CB dataset and UC-Merced dataset, were carried out to verify the effectiveness of this newly proposed convolutional neural network, the accuracy obtained by MGCNN was 2% higher than the ResNet-50. An algorithm of attention mechanism was thus adopted and incorporated into grouping processes and a multiple grouped attention convolutional neural network (MGCNN-A) was therefore constructed to enhance the generalization capability of MGCNN. The additional experiments indicate that the incorporation of the attention mechanism to MGCNN slightly improved the accuracy of scene classification, but the robustness of the proposed network was enhanced considerably in remote sensing image classifications.

Download Full-text

Super-resolution reconstruction of seismic section image via multi-scale convolution neural network

E3S Web of Conferences ◽

10.1051/e3sconf/202130301058 ◽

2021 ◽

Vol 303 ◽

pp. 01058

Author(s):

Meng-Di Deng ◽

Rui-Sheng Jia ◽

Hong-Mei Sun ◽

Xing-Li Zhang

Keyword(s):

Neural Network ◽

High Resolution ◽

Convolutional Neural Network ◽

Super Resolution ◽

Image Features ◽

Image Feature ◽

Reconstruction Method ◽

Low Resolution ◽

Seismic Section ◽

Multi Scale

The resolution of seismic section images can directly affect the subsequent interpretation of seismic data. In order to improve the spatial resolution of low-resolution seismic section images, a super-resolution reconstruction method based on multi-scale convolution is proposed. This method designs a multi-scale convolutional neural network to learn high-low resolution image feature pairs, and realizes mapping learning from low-resolution seismic section images to high-resolution seismic section images. This multi-scale convolutional neural network model consists of four convolutional layers and a sub-pixel convolutional layer. Convolution operations are used to learn abundant seismic section image features, and sub-pixel convolution layer is used to reconstruct high-resolution seismic section image. The experimental results show that the proposed method is superior to the comparison method in peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). In the total training time and reconstruction time, our method is about 22% less than the FSRCNN method and about 18% less than the ESPCN method.

Download Full-text

ASSESSING THE SEMANTIC SIMILARITY OF IMAGES OF SILK FABRICS USING CONVOLUTIONAL NEURAL NETWORKS

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2020-641-2020 ◽

2020 ◽

Vol V-2-2020 ◽

pp. 641-648

Author(s):

D. Clermont ◽

M. Dorozynski ◽

D. Wittich ◽

F. Rottensteiner

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Image Features ◽

Training Data ◽

Loss Functions ◽

Silk Fabrics ◽

Semantic Properties ◽

The Eu

Abstract. This paper proposes several methods for training a Convolutional Neural Network (CNN) for learning the similarity between images of silk fabrics based on multiple semantic properties of the fabrics. In the context of the EU H2020 project SILKNOW (http://silknow.eu/), two variants of training were developed, one based on a Siamese CNN and one based on a triplet architecture. We propose different definitions of similarity and different loss functions for both training strategies, some of them also allowing the use of incomplete information about the training data. We assess the quality of the trained model by using the learned image features in a k-NN classification. We achieve overall accuracies of 93–95% and average F1-scores of 87–92%.

Download Full-text