Semantic Segmentation of Satellite Images: A Deep Learning Approach Integrated with Geospatial Hash Codes

Naisen Yang; Hong Tang

doi:10.3390/rs13142723

Semantic Segmentation of Satellite Images: A Deep Learning Approach Integrated with Geospatial Hash Codes

Remote Sensing ◽

10.3390/rs13142723 ◽

2021 ◽

Vol 13 (14) ◽

pp. 2723

Author(s):

Naisen Yang ◽

Hong Tang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Deep Neural Network ◽

Satellite Images ◽

Spatial Information ◽

Semantic Segmentation ◽

Binary Codes ◽

Underlying Assumption ◽

Hash Codes

Satellite images are always partitioned into regular patches with smaller sizes and then individually fed into deep neural networks (DNNs) for semantic segmentation. The underlying assumption is that these images are independent of one another in terms of geographic spatial information. However, it is well known that many land-cover or land-use categories share common regional characteristics within a certain spatial scale. For example, the style of buildings may change from one city or country to another. In this paper, we explore some deep learning approaches integrated with geospatial hash codes to improve the semantic segmentation results of satellite images. Specifically, the geographic coordinates of satellite images are encoded into a string of binary codes using the geohash method. Then, the binary codes of the geographic coordinates are fed into the deep neural network using three different methods in order to enhance the semantic segmentation ability of the deep neural network for satellite images. Experiments on three datasets demonstrate the effectiveness of embedding geographic coordinates into the neural networks. Our method yields a significant improvement over previous methods that do not use geospatial information.

Download Full-text

Teeth and Landmarks Detection and Classification Based on Deep Neural Networks

Computational Techniques for Dental Image Analysis - Advances in Medical Technologies and Clinical Practice ◽

10.4018/978-1-5225-6243-6.ch006 ◽

2019 ◽

pp. 129-150

Author(s):

Lyudmila N. Tuzova ◽

Dmitry V. Tuzoff ◽

Sergey I. Nikolenko ◽

Alexey S. Krasnov

Keyword(s):

Neural Network ◽

Neural Networks ◽

Image Analysis ◽

Deep Learning ◽

Deep Neural Network ◽

Deep Neural Networks ◽

Rapid Development ◽

Image Interpretation ◽

Semantic Segmentation ◽

Recent Decade

In the recent decade, deep neural networks have enjoyed rapid development in various domains, including medicine. Convolutional neural networks (CNNs), deep neural network structures commonly used for image interpretation, brought the breakthrough in computer vision and became state-of-the-art techniques for various image recognition tasks, such as image classification, object detection, and semantic segmentation. In this chapter, the authors provide an overview of deep learning algorithms and review available literature for dental image analysis with methods based on CNNs. The present study is focused on the problems of landmarks and teeth detection and classification, as these tasks comprise an essential part of dental image interpretation both in clinical dentistry and in human identification systems based on the dental biometrical information.

Download Full-text

Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Mobile Networks and Applications ◽

10.1007/s11036-020-01703-3 ◽

2021 ◽

Vol 26 (1) ◽

pp. 200-215

Author(s):

Muhammad Alam ◽

Jian-Feng Wang ◽

Cong Guangpei ◽

LV Yunrong ◽

Yuanfang Chen

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Image Processing ◽

Deep Learning ◽

Semantic Segmentation ◽

Natural Scene ◽

Remote Sensing Images ◽

Advantages And Disadvantages ◽

Target Segmentation

AbstractIn recent years, the success of deep learning in natural scene image processing boosted its application in the analysis of remote sensing images. In this paper, we applied Convolutional Neural Networks (CNN) on the semantic segmentation of remote sensing images. We improve the Encoder- Decoder CNN structure SegNet with index pooling and U-net to make them suitable for multi-targets semantic segmentation of remote sensing images. The results show that these two models have their own advantages and disadvantages on the segmentation of different objects. In addition, we propose an integrated algorithm that integrates these two models. Experimental results show that the presented integrated algorithm can exploite the advantages of both the models for multi-target segmentation and achieve a better segmentation compared to these two models.

Download Full-text

Uncertainty-Based Human-in-the-Loop Deep Learning for Land Cover Segmentation

Remote Sensing ◽

10.3390/rs12223836 ◽

2020 ◽

Vol 12 (22) ◽

pp. 3836

Author(s):

Carlos García Rodríguez ◽

Jordi Vitrià ◽

Oscar Mora

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Land Cover ◽

Satellite Images ◽

State Of The Art ◽

Low Cost ◽

Human Intervention ◽

Human In The Loop ◽

Learning Techniques

In recent years, different deep learning techniques were applied to segment aerial and satellite images. Nevertheless, state of the art techniques for land cover segmentation does not provide accurate results to be used in real applications. This is a problem faced by institutions and companies that want to replace time-consuming and exhausting human work with AI technology. In this work, we propose a method that combines deep learning with a human-in-the-loop strategy to achieve expert-level results at a low cost. We use a neural network to segment the images. In parallel, another network is used to measure uncertainty for predicted pixels. Finally, we combine these neural networks with a human-in-the-loop approach to produce correct predictions as if developed by human photointerpreters. Applying this methodology shows that we can increase the accuracy of land cover segmentation tasks while decreasing human intervention.

Download Full-text

Image Classification: A Survey

Journal of Informatics Electrical and Electronics Engineering (JIEEE) ◽

10.54060/jieee/001.02.002 ◽

2020 ◽

Vol 1 (2) ◽

pp. 1-9

Author(s):

Ankita Singh ◽

◽

Pawan Singh

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Language Processing ◽

Classification Accuracy ◽

Deep Neural Network ◽

Learning Ability ◽

Final Decision

The Classification of images is a paramount topic in artificial vision systems which have drawn a notable amount of interest over the past years. This field aims to classify an image, which is an input, based on its visual content. Currently, most people relied on hand-crafted features to describe an image in a particular way. Then, using classifiers that are learnable, such as random forest, and decision tree was applied to the extract features to come to a final decision. The problem arises when large numbers of photos are concerned. It becomes a too difficult problem to find features from them. This is one of the reasons that the deep neural network model has been introduced. Owing to the existence of Deep learning, it can become feasible to represent the hierarchical nature of features using a various number of layers and corresponding weight with them. The existing image classification methods have been gradually applied in real-world problems, but then there are various problems in its application processes, such as unsatisfactory effect and extremely low classification accuracy or then and weak adaptive ability. Models using deep learning concepts have robust learning ability, which combines the feature extraction and the process of classification into a whole which then completes an image classification task, which can improve the image classification accuracy effectively. Convolutional Neural Networks are a powerful deep neural network technique. These networks preserve the spatial structure of a problem and were built for object recognition tasks such as classifying an image into respective classes. Neural networks are much known because people are getting a state-of-the-art outcome on complex computer vision and natural language processing tasks. Convolutional neural networks have been extensively used.

Download Full-text

Assessment of Landslide Susceptibility Combining Deep Learning with Semi-Supervised Learning in Jiaohe County, Jilin Province, China

Applied Sciences ◽

10.3390/app10165640 ◽

2020 ◽

Vol 10 (16) ◽

pp. 5640

Author(s):

Jingyu Yao ◽

Shengwu Qin ◽

Shuangshuang Qiao ◽

Wenchao Che ◽

Yang Chen ◽

...

Keyword(s):

Neural Network ◽

Deep Learning ◽

Supervised Learning ◽

Landslide Susceptibility ◽

Deep Neural Network ◽

Spatial Information ◽

Information Gain ◽

Jilin Province ◽

Landslide Susceptibility Mapping ◽

Support Vector

Accurate and timely landslide susceptibility mapping (LSM) is essential to effectively reduce the risk of landslide. In recent years, deep learning has been successfully applied to landslide susceptibility assessment due to the strong ability of fitting. However, in actual applications, the number of labeled samples is usually not sufficient for the training component. In this paper, a deep neural network model based on semi-supervised learning (SSL-DNN) for landslide susceptibility is proposed, which makes full use of a large number of spatial information (unlabeled data) with limited labeled data in the region to train the mode. Taking Jiaohe County in Jilin Province, China as an example, the landslide inventory from 2000 to 2017 was collected and 12 metrological, geographical, and human explanatory factors were compiled. Meanwhile, supervised models such as deep neural network (DNN), support vector machine (SVM), and logistic regression (LR) were implemented for comparison. Then, the landslide susceptibility was plotted and a series of evaluation tools such as class accuracy, predictive rate curves (AUC), and information gain ratio (IGR) were calculated to compare the prediction of models and factors. Experimental results indicate that the proposed SSL-DNN model (AUC = 0.898) outperformed all the comparison models. Therefore, semi-supervised deep learning could be considered as a potential approach for LSM.

Download Full-text

Interpretability of deep convolutional neural networks on rolling bearing fault diagnosis

Measurement Science and Technology ◽

10.1088/1361-6501/ac41a5 ◽

2021 ◽

Author(s):

Huixin Yang ◽

Xiang Li ◽

Wei Zhang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Fault Diagnosis ◽

Deep Neural Network ◽

Rapid Development ◽

Saliency Map ◽

Data Driven ◽

Experimental Investigations ◽

Deep Convolutional Neural Networks

Abstract Despite the rapid development of deep learning-based intelligent fault diagnosis methods on rotating machinery, the data-driven approach generally remains a "black box" to researchers, and its internal mechanism has not been sufficiently understood. The weak interpretability significantly impedes further development and applications of the effective deep neural network-based methods. This paper contributes efforts to understanding the mechanical signal processing of deep learning on the fault diagnosis problems. The diagnostic knowledge learned by the deep neural network is visualized using the neuron activation maximization and the saliency map methods. The discriminative features of different machine health conditions are intuitively observed. The relationship between the data-driven methods and the well-established conventional fault diagnosis knowledge is confirmed by the experimental investigations on two datasets. The results of this study can benefit researchers on understanding the complex neural networks, and increase the reliability of the data-driven fault diagnosis model in the real engineering cases.

Download Full-text

GAIT RECOGNITION BASED ON CONVOLUTIONAL NEURAL NETWORKS

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-2-w4-207-2017 ◽

2017 ◽

Vol XLII-2/W4 ◽

pp. 207-212 ◽

Cited By ~ 10

Author(s):

A. Sokolova ◽

A. Konushin

Keyword(s):

Neural Network ◽

Neural Networks ◽

Feature Extraction ◽

Deep Learning ◽

Deep Neural Network ◽

Gait Recognition ◽

Learning Approach ◽

Network Architectures ◽

Motion Information ◽

Advantages And Disadvantages

In this work we investigate the problem of people recognition by their gait. For this task, we implement deep learning approach using the optical flow as the main source of motion information and combine neural feature extraction with the additional embedding of descriptors for representation improvement. In order to find the best heuristics, we compare several deep neural network architectures, learning and classification strategies. The experiments were made on two popular datasets for gait recognition, so we investigate their advantages and disadvantages and the transferability of considered methods.

Download Full-text

DETECTION AND MONITORING OF BEACH LITTER USING UAV IMAGE AND DEEP NEURAL NETWORK

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-3-w8-55-2019 ◽

2019 ◽

Vol XLII-3/W8 ◽

pp. 55-58 ◽

Cited By ~ 1

Author(s):

S. H. Bak ◽

D. H. Hwang ◽

H. M. Kim ◽

H. J. Yoon

Keyword(s):

Neural Network ◽

Deep Neural Network ◽

Spatial Information ◽

Semantic Segmentation ◽

Detection Algorithm ◽

Monitoring Process ◽

Neural Network Algorithm ◽

Beach Litter ◽

Trained Neural Network ◽

Uav Image

Abstract. Beach litter destroys marine ecosystems and creates aesthetic discomfort that lowers the value of the beach. In order to solve this beach litter problem, it is necessary to study the generation and distribution pattern of waste and the cause of the inflow. However, the data for the study are only sample data collected in some areas of the beach. Also, most of the data covers only the total amount of beach litter. UAV(Unmanned Aerial Vehicle) and Deep Neural Network can be effectively used to detect and monitor beach litter. Using UAV, it is possible to easily survey the entire beach. The Deep Neural Network can also identify the type of coastal litter. Therefore, using UAV and Deep Neural Network, it is possible to acquire spatial information by type of beach litter. This paper proposes a Beach litter detection algorithm based on UAV and Deep Neural Network and a Beach litter monitoring process using it. It also offers optimal shooting altitude and film duplication to detect small beach litter such as plastic bottles and styrofoam pieces found on the beach. In this study, DJI Mavic 2 Pro was used. The camera on the UAV is a 1-inch CMOS with a resolution of 20MP. The images obtained through UAV are produced as orthoimages and input into a pre-trained neural network algorithm. The Deep Neural Network used for Beach litter detection removed the Fully Connected Layer from the Convolutional Neural Network for semantic segmentation.

Download Full-text

A deep neural network approach to predicting clinical outcomes of neuroblastoma patients

10.1101/750364 ◽

2019 ◽

Author(s):

Léon-Charles Tranchevent ◽

Francisco Azuaje ◽

Jagath C. Rajapakse

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Clinical Outcomes ◽

Deep Neural Network ◽

Superior Performance ◽

Support Vector ◽

Disease Etiology ◽

Topological Features ◽

Wide Range

AbstractThe availability of high-throughput omics datasets from large patient cohorts has allowed the development of methods that aim at predicting patient clinical outcomes, such as survival and disease recurrence. Such methods are also important to better understand the biological mechanisms underlying disease etiology and development, as well as treatment responses. Recently, different predictive models, relying on distinct algorithms (including Support Vector Machines and Random Forests) have been investigated. In this context, deep learning strategies are of special interest due to their demonstrated superior performance over a wide range of problems and datasets. One of the main challenges of such strategies is the “small n large p” problem. Indeed, omics datasets typically consist of small numbers of samples and large numbers of features relative to typical deep learning datasets. Neural networks usually tackle this problem through feature selection or by including additional constraints during the learning process.We propose to tackle this problem with a novel strategy that relies on a graph-based method for feature extraction, coupled with a deep neural network for clinical outcome prediction. The omics data are first represented as graphs whose nodes represent patients, and edges represent correlations between the patients’ omics profiles. Topological features, such as centralities, are then extracted from these graphs for every node. Lastly, these features are used as input to train and test various classifiers.We apply this strategy to four neuroblastoma datasets and observe that models based on neural networks are more accurate than state of the art models (DNN: 85%-87%, SVM/RF: 75%-82%). We explore how different parameters and configurations are selected in order to overcome the effects of the small data problem as well as the curse of dimensionality. Our results indicate that the deep neural networks capture complex features in the data that help predicting patient clinical outcomes.

Download Full-text

Tri-net for Semi-Supervised Deep Learning

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/278 ◽

2018 ◽

Cited By ~ 11

Author(s):

Dong-Dong Chen ◽

Wei Wang ◽

Wei Gao ◽

Zhi-Hua Zhou

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Error Rate ◽

Deep Neural Network ◽

Deep Neural Networks ◽

State Of The Art ◽

Fine Tuning ◽

Learning Methods ◽

Model Initialization

Deep neural networks have witnessed great successes in various real applications, but it requires a large number of labeled data for training. In this paper, we propose tri-net, a deep neural network which is able to use massive unlabeled data to help learning with limited labeled data. We consider model initialization, diversity augmentation and pseudo-label editing simultaneously. In our work, we utilize output smearing to initialize modules, use fine-tuning on labeled data to augment diversity and eliminate unstable pseudo-labels to alleviate the influence of suspicious pseudo-labeled data. Experiments show that our method achieves the best performance in comparison with state-of-the-art semi-supervised deep learning methods. In particular, it achieves 8.30% error rate on CIFAR-10 by using only 4000 labeled examples.

Download Full-text