Cloud Detection for Satellite Imagery Using Attention-Based U-Net Convolutional Neural Network

Cloud detection is an important and difficult task in the pre-processing of satellite remote sensing data. The results of traditional cloud detection methods are often unsatisfactory in complex environments or the presence of various noise disturbances. With the rapid development of artificial intelligence technology, deep learning methods have achieved great success in many fields such as image processing, speech recognition, autonomous driving, etc. This study proposes a deep learning model suitable for cloud detection, Cloud-AttU, which is based on a U-Net network and incorporates an attention mechanism. The Cloud-AttU model adopts the symmetric Encoder-Decoder structure, which achieves the fusion of high-level features and low-level features through the skip-connection operation, making the output results contain richer multi-scale information. This symmetrical network structure is concise and stable, significantly enhancing the effect of image segmentation. Based on the characteristics of cloud detection, the model is improved by introducing an attention mechanism that allows model to learn more effective features and distinguish between cloud and non-cloud pixels more accurately. The experimental results show that the method proposed in this paper has a significant accuracy advantage over the traditional cloud detection method. The proposed method is also able to achieve great results in the presence of snow/ice disturbance and other bright non-cloud objects, with strong resistance to disturbance. The Cloud-AttU model proposed in this study has achieved excellent results in the cloud detection tasks, indicating that this symmetric network architecture has great potential for application in satellite image processing and deserves further research.

Download Full-text

Benchmarking Deep Learning Models for Cloud Detection in Landsat-8 and Sentinel-2 Images

Remote Sensing ◽

10.3390/rs13050992 ◽

2021 ◽

Vol 13 (5) ◽

pp. 992

Author(s):

Dan López-Puigdollers ◽

Gonzalo Mateo-García ◽

Luis Gómez-Chova

Keyword(s):

Deep Learning ◽

Strong Dependence ◽

Satellite Image ◽

Detection Methods ◽

Detection Accuracy ◽

Landsat 8 ◽

Cloud Detection ◽

Learning Models ◽

Sensing Applications ◽

Sentinel 2

The systematic monitoring of the Earth using optical satellites is limited by the presence of clouds. Accurately detecting these clouds is necessary to exploit satellite image archives in remote sensing applications. Despite many developments, cloud detection remains an unsolved problem with room for improvement, especially over bright surfaces and thin clouds. Recently, advances in cloud masking using deep learning have shown significant boosts in cloud detection accuracy. However, these works are validated in heterogeneous manners, and the comparison with operational threshold-based schemes is not consistent among many of them. In this work, we systematically compare deep learning models trained on Landsat-8 images on different Landsat-8 and Sentinel-2 publicly available datasets. Overall, we show that deep learning models exhibit a high detection accuracy when trained and tested on independent images from the same Landsat-8 dataset (intra-dataset validation), outperforming operational algorithms. However, the performance of deep learning models is similar to operational threshold-based ones when they are tested on different datasets of Landsat-8 images (inter-dataset validation) or datasets from a different sensor with similar radiometric characteristics such as Sentinel-2 (cross-sensor validation). The results suggest that (i) the development of cloud detection methods for new satellites can be based on deep learning models trained on data from similar sensors and (ii) there is a strong dependence of deep learning models on the dataset used for training and testing, which highlights the necessity of standardized datasets and procedures for benchmarking cloud detection models in the future.

Download Full-text

Diabetic Retinal Grading Using Attention-Based Bilinear Convolutional Neural Network and Complement Cross Entropy

Entropy ◽

10.3390/e23070816 ◽

2021 ◽

Vol 23 (7) ◽

pp. 816

Author(s):

Pingping Liu ◽

Xiaokang Yang ◽

Baixin Jin ◽

Qiuzhan Zhou

Keyword(s):

Neural Network ◽

Image Processing ◽

Diabetic Retinopathy ◽

Convolutional Neural Network ◽

Image Classification ◽

Network Model ◽

Rapid Development ◽

Image Data ◽

Lesion Detection ◽

Great Success

Diabetic retinopathy (DR) is a common complication of diabetes mellitus (DM), and it is necessary to diagnose DR in the early stages of treatment. With the rapid development of convolutional neural networks in the field of image processing, deep learning methods have achieved great success in the field of medical image processing. Various medical lesion detection systems have been proposed to detect fundus lesions. At present, in the image classification process of diabetic retinopathy, the fine-grained properties of the diseased image are ignored and most of the retinopathy image data sets have serious uneven distribution problems, which limits the ability of the network to predict the classification of lesions to a large extent. We propose a new non-homologous bilinear pooling convolutional neural network model and combine it with the attention mechanism to further improve the network’s ability to extract specific features of the image. The experimental results show that, compared with the most popular fundus image classification models, the network model we proposed can greatly improve the prediction accuracy of the network while maintaining computational efficiency.

Download Full-text

RNN-based multispectral satellite image processing for remote sensing applications

International Journal of Pervasive Computing and Communications ◽

10.1108/ijpcc-07-2021-0153 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Venkata Dasu Marri ◽

Veera Narayana Reddy P. ◽

Chandra Mohan Reddy S.

Keyword(s):

Remote Sensing ◽

Image Processing ◽

Deep Learning ◽

Image Classification ◽

Satellite Image ◽

Multispectral Image ◽

Content Type ◽

Colliding Bodies Optimization ◽

Sensing Applications ◽

Remote Sensing Applications

Purpose Image classification is a fundamental form of digital image processing in which pixels are labeled into one of the object classes present in the image. Multispectral image classification is a challenging task due to complexities associated with the images captured by satellites. Accurate image classification is highly essential in remote sensing applications. However, existing machine learning and deep learning–based classification methods could not provide desired accuracy. The purpose of this paper is to classify the objects in the satellite image with greater accuracy. Design/methodology/approach This paper proposes a deep learning-based automated method for classifying multispectral images. The central issue of this work is that data sets collected from public databases are first divided into a number of patches and their features are extracted. The features extracted from patches are then concatenated before a classification method is used to classify the objects in the image. Findings The performance of proposed modified velocity-based colliding bodies optimization method is compared with existing methods in terms of type-1 measures such as sensitivity, specificity, accuracy, net present value, F1 Score and Matthews correlation coefficient and type 2 measures such as false discovery rate and false positive rate. The statistical results obtained from the proposed method show better performance than existing methods. Originality/value In this work, multispectral image classification accuracy is improved with an optimization algorithm called modified velocity-based colliding bodies optimization.

Download Full-text

Deep Learning Method on Target Echo Signal Recognition for Obscurant Penetrating Lidar Detection in Degraded Visual Environments

Sensors ◽

10.3390/s20123424 ◽

2020 ◽

Vol 20 (12) ◽

pp. 3424

Author(s):

Xujia Liang ◽

Zhonghua Huang ◽

Liping Lu ◽

Zhigang Tao ◽

Bing Yang ◽

...

Keyword(s):

Deep Learning ◽

Autonomous Vehicles ◽

Mobile Robotics ◽

Learning Algorithm ◽

Rapid Development ◽

Detection Methods ◽

Signal Recognition ◽

Echo Signal ◽

Deep Learning Algorithm ◽

Recognition Criterion

With the rapid development of autonomous vehicles and mobile robotics, the desire to advance robust light detection and ranging (Lidar) detection methods for real world applications is increasing. However, this task still suffers in degraded visual environments (DVE), including smoke, dust, fog, and rain, as the aerosols lead to false alarm and dysfunction. Therefore, a novel Lidar target echo signal recognition method, based on a multi-distance measurement and deep learning algorithm is presented in this paper; neither the backscatter suppression nor the denoise functions are required. The 2-D spectrogram images are constructed by using the frequency-distance relation derived from the 1-D echo signals of the Lidar sensor individual cell in the course of approaching target. The characteristics of the target echo signal and noise in the spectrogram images are analyzed and determined; thus, the target recognition criterion is established accordingly. A customized deep learning algorithm is subsequently developed to perform the recognition. The simulation and experimental results demonstrate that the proposed method can significantly improve the Lidar detection performance in DVE.

Download Full-text

TCDNet: Trilateral Change Detection Network for Google Earth Image

Remote Sensing ◽

10.3390/rs12172669 ◽

2020 ◽

Vol 12 (17) ◽

pp. 2669

Author(s):

Junhao Qian ◽

Min Xia ◽

Yonghong Zhang ◽

Jia Liu ◽

Yiqing Xu

Keyword(s):

Deep Learning ◽

Change Detection ◽

Remote Sensing Data ◽

Semantic Segmentation ◽

Google Earth ◽

Detection Methods ◽

Global Information ◽

Supervised Methods ◽

Image Pairs ◽

Main Module

Change detection is a very important technique for remote sensing data analysis. Its mainstream solutions are either supervised or unsupervised. In supervised methods, most of the existing change detection methods using deep learning are related to semantic segmentation. However, these methods only use deep learning models to process the global information of an image but do not carry out specific trainings on changed and unchanged areas. As a result, many details of local changes could not be detected. In this work, a trilateral change detection network is proposed. The proposed network has three branches (a main module and two auxiliary modules, all of them are composed of convolutional neural networks (CNNs)), which focus on the overall information of bitemporal Google Earth image pairs, the changed areas and the unchanged areas, respectively. The proposed method is end-to-end trainable, and each component in the network does not need to be trained separately.

Download Full-text

Lexicon-Enhanced Attention Network Based on Text Representation for Sentiment Classification

Applied Sciences ◽

10.3390/app9183717 ◽

2019 ◽

Vol 9 (18) ◽

pp. 3717 ◽

Cited By ~ 3

Author(s):

Wenkuan Li ◽

Dongyuan Li ◽

Hongxia Yin ◽

Lindong Zhang ◽

Zhenfang Zhu ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Attention Mechanism ◽

Sentiment Classification ◽

Classification Model ◽

Linguistic Knowledge ◽

Great Success ◽

Text Representation ◽

Attention Network ◽

Sentiment Lexicon

Text representation learning is an important but challenging issue for various natural language processing tasks. Recently, deep learning-based representation models have achieved great success for sentiment classification. However, these existing models focus on more semantic information rather than sentiment linguistic knowledge, which provides rich sentiment information and plays a key role in sentiment analysis. In this paper, we propose a lexicon-enhanced attention network (LAN) based on text representation to improve the performance of sentiment classification. Specifically, we first propose a lexicon-enhanced attention mechanism by combining the sentiment lexicon with an attention mechanism to incorporate sentiment linguistic knowledge into deep learning methods. Second, we introduce a multi-head attention mechanism in the deep neural network to interactively capture the contextual information from different representation subspaces at different positions. Furthermore, we stack a LAN model to build a hierarchical sentiment classification model for large-scale text. Extensive experiments are conducted to evaluate the effectiveness of the proposed models on four popular real-world sentiment classification datasets at both the sentence level and the document level. The experimental results demonstrate that our proposed models can achieve comparable or better performance than the state-of-the-art methods.

Download Full-text

Real-Time Semantic Segmentation with Dual Encoder and Self-Attention Mechanism for Autonomous Driving

Sensors ◽

10.3390/s21238072 ◽

2021 ◽

Vol 21 (23) ◽

pp. 8072

Author(s):

Yu-Bang Chang ◽

Chieh Tsai ◽

Chang-Hong Lin ◽

Poki Chen

Keyword(s):

Deep Learning ◽

Real Time ◽

Network Architecture ◽

Semantic Segmentation ◽

Autonomous Driving ◽

Attention Mechanism ◽

Trade Off ◽

Segmentation Methods ◽

General Semantic ◽

Deep Learning Model

As the techniques of autonomous driving become increasingly valued and universal, real-time semantic segmentation has become very popular and challenging in the field of deep learning and computer vision in recent years. However, in order to apply the deep learning model to edge devices accompanying sensors on vehicles, we need to design a structure that has the best trade-off between accuracy and inference time. In previous works, several methods sacrificed accuracy to obtain a faster inference time, while others aimed to find the best accuracy under the condition of real time. Nevertheless, the accuracies of previous real-time semantic segmentation methods still have a large gap compared to general semantic segmentation methods. As a result, we propose a network architecture based on a dual encoder and a self-attention mechanism. Compared with preceding works, we achieved a 78.6% mIoU with a speed of 39.4 FPS with a 1024 × 2048 resolution on a Cityscapes test submission.

Download Full-text

TEMPORAL REPETITION DETECTION FOR GROUND VISIBILITY ASSESSMENT

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2020-829-2020 ◽

2020 ◽

Vol V-2-2020 ◽

pp. 829-835

Author(s):

R. Grompone von Gioi ◽

C. Hessel ◽

T. Dagobert ◽

J. M. Morel ◽

C. de Franchis

Keyword(s):

Satellite Image ◽

Detection Methods ◽

Cloud Detection ◽

Satellite Constellations ◽

Repetition Detection ◽

Spectral Bands ◽

Alternative Approach ◽

Earth Observation Satellite ◽

Satellite Image Analysis ◽

Local Image

Abstract. Assessing ground visibility is a crucial step in automatic satellite image analysis. Nevertheless, several recent Earth observation satellite constellations lack specially designed spectral bands and use a frame camera, precluding spectrum-based and parallax-based cloud detection methods. An alternative approach is to detect the parts of each image where the ground is visible. This can be done by comparing locally pairs of registered images in a temporal series: matching regions are necessarily cloud free. Indeed, the ground has persistent patterns that can be observed repetitively in the time series while the appearance of clouds changes at each date. To detect reliably the “visible” ground, we propose here an a contrario local image matching method coupled with an efficient greedy algorithm.

Download Full-text

MCA-Unet: Multi-class Attention-aware U-net for Seismic Phase Picking

10.5194/egusphere-egu21-15841 ◽

2021 ◽

Author(s):

Wei Li ◽

Georg Rümpker ◽

Horst Stöcker ◽

Megha Chakraborty ◽

Darius Fener ◽

...

Keyword(s):

Deep Learning ◽

Network Architecture ◽

Attention Mechanism ◽

Seismic Waveforms ◽

Decoder Architecture ◽

Seismic Event Detection ◽

Feature Information ◽

Multi Scales ◽

Segmentation Problem ◽

Different Levels

<p>This study presents a deep learning based algorithm for seismic event detection and simultaneous phase picking in seismic waveforms. U-net structure-based solutions which consists of a contracting path (encoder) to capture feature information and a symmetric expanding path (decoder) that enables precise localization, have proven to be effective in phase picking. The network architecture of these U-net models mainly comprise of 1D CNN, Bi- & Uni-directional LSTM, transformers and self-attentive layers. Althought, these networks have proven to be a good solution, they may not fully harness the information extracted from multi-scales.</p><p>&#160;In this study, we propose a simple yet powerful deep learning architecture by combining multi-class with attention mechanism, named MCA-Unet, for phase picking. &#160;Specially, we treat the phase picking as an image segmentation problem, and incorporate the attention mechanism into the U-net structure to efficiently deal with the features extracted at different levels with the goal to improve the performance on the seismic phase picking. Our neural network is based on an encoder-decoder architecture composed of 1D convolutions, pooling layers, deconvolutions and multi-attention layers. This architecture is applied and tested to a field seismic dataset (e.g. Wenchuan Earthquake Aftershocks Classification Dataset) to check its performance.</p>

Download Full-text

MalCaps: A Capsule Network Based Model for the Malware Classification

Processes ◽

10.3390/pr9060929 ◽

2021 ◽

Vol 9 (6) ◽

pp. 929

Author(s):

Xiaoliang Zhang ◽

Kehe Wu ◽

Zuge Chen ◽

Chenyi Zhang

Keyword(s):

Feature Extraction ◽

Deep Learning ◽

Network Architecture ◽

Malware Detection ◽

Complex Model ◽

Detection Methods ◽

Precise Position ◽

Malware Classification ◽

Model Generalization ◽

Deep Feature Extraction

The research on malware detection enabled by deep learning has become a hot issue in the field of network security. The existing malware detection methods based on deep learning suffer from some issues, such as weak ability of deep feature extraction, relatively complex model, and insufficient ability of model generalization. Traditional deep learning architectures, such as convolutional neural networks (CNNs) variants, do not consider the spatial hierarchies between features, and lose some information on the precise position of a feature within the feature region, which is crucial for a malware file which has specific sections. In this paper, we draw on the idea of image classification in the field of computer vision and propose a novel malware detection method based on capsule network architecture with hyper-parameter optimized convolutional layers (MalCaps), which overcomes CNNs limitations by removing the need for a pooling layer and introduces capsule layers. Firstly, the malware is transformed into a grayscale image. Then, the dynamic routing-based capsule network is used to detect and classify the image. Without advanced feature extraction and with only a small number of labeled samples, the presented method is tested on an unbalanced Microsoft Malware Classification Challenge (MMCC) dataset and experimental results produce testing accuracy of 99.34%, improving on a number of traditional deep learning models posited in recent malware classification literature.

Download Full-text