Evaluation of Sea fog Detection Accuracy Based on Geostationary Satellite Image Using Machine Learning

Mapping Intimacies ◽

10.5194/egusphere-egu2020-22319 ◽

2020 ◽

Author(s):

NaKyeong Kim ◽

Suho Bak ◽

Minji Jeong ◽

Hongjoo Yoon

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Satellite Remote Sensing ◽

Geostationary Satellite ◽

Machine Learning Algorithms ◽

Detection Accuracy ◽

Sea Fog ◽

Short Period ◽

Wide Range ◽

Fog Detection

A sea fog is a fog caused by the cooling of the air near the ocean-atmosphere boundary layer when the warm sea surface air moves to a cold sea level. Sea fog affects a variety of aspects, including maritime and coastal transportation, military activities and fishing activities. In particular, it is important to detect sea fog as they can lead to ship accidents due to reduced visibility. Due to the wide range of sea fog events and the lack of constant occurrence, it is generally detected through satellite remote sensing. Because sea fog travels in a short period of time, it uses geostationary satellites with higher time resolution than polar satellites to detect fog. A method for detecting fog by using the difference between 11 &#956;m channel and 3.7 &#956;m channel was widely used when detecting fog by satellite remote sensing, but this is difficult to distinguish between lower clouds and fog. Traditional algorithms are difficult to find accurate thresholds for fog and cloud. However, machine learning algorithms can be used as a useful tool to determine this. In this study, based on geostationary satellite imaging data, a comparative analysis of sea fog detection accuracy was conducted through various methods of machine learning, such as Random Forest, Multi-Layer Perceptron, and Convolutional Neural Networks.

Download Full-text

Machine Learning: New Potential for Local and Regional Deep-Seated Landslide Nowcasting

Sensors ◽

10.3390/s20051425 ◽

2020 ◽

Vol 20 (5) ◽

pp. 1425 ◽

Cited By ~ 2

Author(s):

Adriaan L. van Natijne ◽

Roderik C. Lindenbergh ◽

Thom A. Bogaard

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Early Warning ◽

Satellite Remote Sensing ◽

Regional Scale ◽

Warning System ◽

Early Warning Systems ◽

Machine Learning Algorithms ◽

Catchment Scale ◽

Physical Constraints

Nowcasting and early warning systems for landslide hazards have been implemented mostly at the slope or catchment scale. These systems are often difficult to implement at regional scale or in remote areas. Machine Learning and satellite remote sensing products offer new opportunities for both local and regional monitoring of deep-seated landslide deformation and associated processes. Here, we list the key variables of the landslide process and the associated satellite remote sensing products, as well as the available machine learning algorithms and their current use in the field. Furthermore, we discuss both the challenges for the integration in an early warning system, and the risks and opportunities arising from the limited physical constraints in machine learning. This review shows that data products and algorithms are available, and that the technology is ready to be tested for regional applications.

Download Full-text

Machine Learning Predictive Models to Estimate the UCS and Tensile Strength of Rocks in Bakken Field

10.2118/208623-stu ◽

2021 ◽

Author(s):

Abderraouf Chemmakh

Keyword(s):

Machine Learning ◽

Tensile Strength ◽

Compressive Strength ◽

Uniaxial Compressive Strength ◽

Machine Learning Algorithms ◽

Petroleum Engineering ◽

Average Error ◽

Support Vector ◽

Short Period ◽

Wide Range

Abstract Uniaxial Compressive Strength (UCS) and Tensile Strength (TS) are among the essential rock parameters required and determined for rock mechanical studies in Petroleum Engineering. However, the determination of such parameters requires some laboratory experiments, which may be time-consuming and costly at the same time. In order to estimate these parameters efficiently and in a short period, some mathematical tools have been used by different researchers. When regression tools proved to give good results only in the limited range of data used, machine learning methods proved to be very accurate in generating models that can cover a wide range of data. In this study, two machine learning models were used to predict the UCS and TS, Support Vector Regression optimized by Genetic Algorithm (GA-SVR) and Artificial Neural Networks (ANNs). The results were discussed for both uniaxial compressive strength and tensile strength in terms of coefficient of determination R2, root mean squared error (RMSE) and mean average error (MAE). First, for the case of UCS, values of 0.99 and 0.99, values of 3.41 and 2.9 and values of 2.43 and 1.9 were obtained for R2, RMSE and MAE for the ANN and GA-SVR, respectively. Second, for the TS, the same analogy was followed, a coefficient R2 of 0.99 and 0.99, RMSE values of 0.41 and 0.45 and MAE values of 0.30 and 0.39 were obtained for ANNs and GA-SVR, respectively. The next step was to assess these models on a different dataset consisting of data obtained from Bakken Field in Williston Basin, North Dakota, United States. The models showed excellent results comparing to the correlations they were compared with, outperforming them in terms of R2, RMSE and MAE, giving the following results for ANN and SVR respectively, R2 of 0.93, 0.92, RMSE of 9.54, 11.22 and MAE of 7.28, 9.24. The resultant conclusion of this work is that the use of machine learning algorithms can generate universal models which reduce the time and effort to estimate some complex parameters such as UCS and Tensile Strength.

Download Full-text

Semantic segmentation of PolSAR image data using advanced deep learning model

Scientific Reports ◽

10.1038/s41598-021-94422-y ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Rajat Garg ◽

Anil Kumar ◽

Nikunj Bansal ◽

Manish Prateek ◽

Shashi Kumar

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Deep Learning ◽

Urban Area ◽

Urban Areas ◽

Learning Algorithms ◽

Semantic Segmentation ◽

Learning Model ◽

Machine Learning Algorithms ◽

Deep Learning Model

AbstractUrban area mapping is an important application of remote sensing which aims at both estimation and change in land cover under the urban area. A major challenge being faced while analyzing Synthetic Aperture Radar (SAR) based remote sensing data is that there is a lot of similarity between highly vegetated urban areas and oriented urban targets with that of actual vegetation. This similarity between some urban areas and vegetation leads to misclassification of the urban area into forest cover. The present work is a precursor study for the dual-frequency L and S-band NASA-ISRO Synthetic Aperture Radar (NISAR) mission and aims at minimizing the misclassification of such highly vegetated and oriented urban targets into vegetation class with the help of deep learning. In this study, three machine learning algorithms Random Forest (RF), K-Nearest Neighbour (KNN), and Support Vector Machine (SVM) have been implemented along with a deep learning model DeepLabv3+ for semantic segmentation of Polarimetric SAR (PolSAR) data. It is a general perception that a large dataset is required for the successful implementation of any deep learning model but in the field of SAR based remote sensing, a major issue is the unavailability of a large benchmark labeled dataset for the implementation of deep learning algorithms from scratch. In current work, it has been shown that a pre-trained deep learning model DeepLabv3+ outperforms the machine learning algorithms for land use and land cover (LULC) classification task even with a small dataset using transfer learning. The highest pixel accuracy of 87.78% and overall pixel accuracy of 85.65% have been achieved with DeepLabv3+ and Random Forest performs best among the machine learning algorithms with overall pixel accuracy of 77.91% while SVM and KNN trail with an overall accuracy of 77.01% and 76.47% respectively. The highest precision of 0.9228 is recorded for the urban class for semantic segmentation task with DeepLabv3+ while machine learning algorithms SVM and RF gave comparable results with a precision of 0.8977 and 0.8958 respectively.

Download Full-text

Mapping Allochemical Limestone Formations in Hazara, Pakistan Using Google Cloud Architecture: Application of Machine-Learning Algorithms on Multispectral Data

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10020058 ◽

2021 ◽

Vol 10 (2) ◽

pp. 58

Author(s):

Muhammad Fawad Akbar Khan ◽

Khan Muhammad ◽

Shahid Bashir ◽

Shahab Ud Din ◽

Muhammad Hanif

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Learning Algorithms ◽

Remote Sensing Data ◽

Kappa Coefficient ◽

Machine Learning Algorithms ◽

Landsat 8 ◽

Sensing Data ◽

Fossiliferous Limestone

Low-resolution Geological Survey of Pakistan (GSP) maps surrounding the region of interest show oolitic and fossiliferous limestone occurrences correspondingly in Samanasuk, Lockhart, and Margalla hill formations in the Hazara division, Pakistan. Machine-learning algorithms (MLAs) have been rarely applied to multispectral remote sensing data for differentiating between limestone formations formed due to different depositional environments, such as oolitic or fossiliferous. Unlike the previous studies that mostly report lithological classification of rock types having different chemical compositions by the MLAs, this paper aimed to investigate MLAs’ potential for mapping subclasses within the same lithology, i.e., limestone. Additionally, selecting appropriate data labels, training algorithms, hyperparameters, and remote sensing data sources were also investigated while applying these MLAs. In this paper, first, oolitic (Samanasuk), fossiliferous (Lockhart and Margalla) limestone-bearing formations along with the adjoining Hazara formation were mapped using random forest (RF), support vector machine (SVM), classification and regression tree (CART), and naïve Bayes (NB) MLAs. The RF algorithm reported the best accuracy of 83.28% and a Kappa coefficient of 0.78. To further improve the targeted allochemical limestone formation map, annotation labels were generated by the fusion of maps obtained from principal component analysis (PCA), decorrelation stretching (DS), X-means clustering applied to ASTER-L1T, Landsat-8, and Sentinel-2 datasets. These labels were used to train and validate SVM, CART, NB, and RF MLAs to obtain a binary classification map of limestone occurrences in the Hazara division, Pakistan using the Google Earth Engine (GEE) platform. The classification of Landsat-8 data by CART reported 99.63% accuracy, with a Kappa coefficient of 0.99, and was in good agreement with the field validation. This binary limestone map was further classified into oolitic (Samanasuk) and fossiliferous (Lockhart and Margalla) formations by all the four MLAs; in this case, RF surpassed all the other algorithms with an improved accuracy of 96.36%. This improvement can be attributed to better annotation, resulting in a binary limestone classification map, which formed a mask for improved classification of oolitic and fossiliferous limestone in the area.

Download Full-text

Prediction of grape yields from time-series vegetation indices using satellite remote sensing and a machine-learning approach

Remote Sensing Applications Society and Environment ◽

10.1016/j.rsase.2021.100485 ◽

2021 ◽

Vol 22 ◽

pp. 100485

Author(s):

Sara Tokhi Arab ◽

Ryozo Noguchi ◽

Shusuke Matsushita ◽

Tofael Ahamed

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Time Series ◽

Satellite Remote Sensing ◽

Vegetation Indices ◽

Learning Approach ◽

Machine Learning Approach

Download Full-text

Remote sensing inversion of water quality in coastal sea area based on machine learning: a case study of Shenzhen bay, China

10.5194/egusphere-egu21-1972 ◽

2021 ◽

Author(s):

Xiaotong Zhu ◽

Jinhui Jeanne Huang

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Water Quality ◽

Predictive Accuracy ◽

Water Environment ◽

Quality Parameters ◽

Machine Learning Algorithms ◽

Dynamic Monitoring ◽

Support Vector ◽

Seawater Quality

Remote sensing monitoring has the characteristics of wide monitoring range, celerity, low cost for long-term dynamic monitoring of water environment. With the flourish of artificial intelligence, machine learning has enabled remote sensing inversion of seawater quality to achieve higher prediction accuracy. However, due to the physicochemical property of the water quality parameters, the performance of algorithms differs a lot. In order to improve the predictive accuracy of seawater quality parameters, we proposed a technical framework to identify the optimal machine learning algorithms using Sentinel-2 satellite and in-situ seawater sample data. In the study, we select three algorithms, i.e. support vector regression (SVR), XGBoost and deep learning (DL), and four seawater quality parameters, i.e. dissolved oxygen (DO), total dissolved solids (TDS), turbidity(TUR) and chlorophyll-a (Chla). The results show that SVR is a more precise algorithm to inverse DO (R2 = 0.81). XGBoost has the best accuracy for Chla and Tur inversion (R2 = 0.75 and 0.78 respectively) while DL performs better in TDS (R2 =0.789). Overall, this research provides a theoretical support for high precision remote sensing inversion of offshore seawater quality parameters based on machine learning.

Download Full-text

Detection of Spoofing Attack using Machine Learning based on Multi-Layer Neural Network in Single-Frequency GPS Receivers

Journal of Navigation ◽

10.1017/s0373463317000558 ◽

2017 ◽

Vol 71 (1) ◽

pp. 169-188 ◽

Cited By ~ 12

Author(s):

E. Shafiee ◽

M. R. Mosavi ◽

M. Moazedi

Keyword(s):

Neural Network ◽

Machine Learning ◽

Late Phase ◽

Machine Learning Algorithms ◽

Computational Method ◽

Single Frequency ◽

Detection Accuracy ◽

Gps Receiver ◽

Navigation Data ◽

Spoofing Detection

The importance of the Global Positioning System (GPS) and related electronic systems continues to increase in a range of environmental, engineering and navigation applications. However, civilian GPS signals are vulnerable to Radio Frequency (RF) interference. Spoofing is an intentional intervention that aims to force a GPS receiver to acquire and track invalid navigation data. Analysis of spoofing and authentic signal patterns represents the differences as phase, energy and imaginary components of the signal. In this paper, early-late phase, delta, and signal level as the three main features are extracted from the correlation output of the tracking loop. Using these features, spoofing detection can be performed by exploiting conventional machine learning algorithms such as K-Nearest Neighbourhood (KNN) and naive Bayesian classifier. A Neural Network (NN) as a learning machine is a modern computational method for collecting the required knowledge and predicting the output values in complicated systems. This paper presents a new approach for GPS spoofing detection based on multi-layer NN whose inputs are indices of features. Simulation results on a software GPS receiver showed adequate detection accuracy was obtained from NN with a short detection time.

Download Full-text

A MACHINE LEARNING DATASET FOR LARGE-SCOPE HIGH RESOLUTION REMOTE SENSING IMAGE INTERPRETATION CONSIDERING LANDSCAPE SPATIAL HETEROGENEITY

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-2-w13-731-2019 ◽

2019 ◽

Vol XLII-2/W13 ◽

pp. 731-736

Author(s):

Y. Xu ◽

X. Hu ◽

Y. Wei ◽

Y. Yang ◽

D. Wang

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Land Cover ◽

Spatial Heterogeneity ◽

Satellite Remote Sensing ◽

Sampling Method ◽

National Level ◽

Remote Sensing Image ◽

Training Sample ◽

Large Scope

Abstract. The demand for timely information about earth’s surface such as land cover and land use (LC/LU), is consistently increasing. Machine learning method shows its advantage on collecting such information from remotely sensed images while requiring sufficient training sample. For satellite remote sensing image, however, sample datasets covering large scope are still limited. Most existing sample datasets for satellite remote sensing image built based on a few frames of image located on a local area. For large scope (national level) view, choosing a sufficient unbiased sampling method is crucial for constructing balanced training sample dataset. Dependable spatial sample locations considering spatial heterogeneity of land cover are needed for choosing sample images. This paper introduces an ongoing work on establishing a national scope sample dataset for high spatial-resolution satellite remote sensing image processing. Sample sites been chosen sufficiently using spatial sampling method, and divided sample patches been grouped using clustering method for further uses. The neural network model for road detection trained our dataset subset shows an increased performance on both completeness and accuracy, comparing to two widely used public dataset.

Download Full-text

Chronic Kidney Disease Prediction using Machine Learning Models

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.a2213.109119 ◽

2019 ◽

Vol 9 (1) ◽

pp. 6364-6367

Keyword(s):

Machine Learning ◽

Chronic Kidney Disease ◽

Kidney Disease ◽

Kidney Failure ◽

Bone Diseases ◽

Machine Learning Algorithms ◽

Knowledge Generation ◽

Good Prediction ◽

Worst Case ◽

Wide Range

The field of biosciences have advanced to a larger extent and have generated large amounts of information from Electronic Health Records. This have given rise to the acute need of knowledge generation from this enormous amount of data. Data mining methods and machine learning play a major role in this aspect of biosciences. Chronic Kidney Disease(CKD) is a condition in which the kidneys are damaged and cannot filter blood as they always do. A family history of kidney diseases or failure, high blood pressure, type 2 diabetes may lead to CKD. This is a lasting damage to the kidney and chances of getting worser by time is high. The very common complications that results due to a kidney failure are heart diseases, anemia, bone diseases, high potasium and calcium. The worst case situation leads to complete kidney failure and necessitates kidney transplant to live. An early detection of CKD can improve the quality of life to a greater extent. This calls for good prediction algorithm to predict CKD at an earlier stage . Literature shows a wide range of machine learning algorithms employed for the prediction of CKD. This paper uses data preprocessing,data transformation and various classifiers to predict CKD and also proposes best Prediction framework for CKD. The results of the framework show promising results of better prediction at an early stage of CKD

Download Full-text

Progressive and predictive markers of disease evolution: platelet transcriptome in chronic myeloproliferative neoplasms

10.1101/2021.03.12.435190 ◽

2021 ◽

Author(s):

Zhu Shen ◽

Wenfei Du ◽

Cecelia Perkins ◽

Lenn Fechter ◽

Vanita Natu ◽

...

Keyword(s):

Machine Learning ◽

Risk Stratification ◽

Disease Risk ◽

Myeloproliferative Neoplasms ◽

Penalized Regression ◽

Machine Learning Algorithms ◽

Disease Evolution ◽

Chronic Myeloproliferative Neoplasms ◽

Age Related ◽

Wide Range

Predicting disease natural history remains a particularly challenging endeavor in chronic degenerative disorders and cancer, thus limiting early detection, risk stratification, and preventive interventions. Here, profiling the spectrum of chronic myeloproliferative neoplasms (MPNs), as a model, we identify the blood platelet transcriptome as a generalizable strategy for highly sensitive progression biomarkers that also enable prediction via machine learning algorithms. Using RNA sequencing (RNA seq), we derive disease relevant gene expression and alternative splicing in purified platelets from 120 peripheral blood samples constituting two independently collected and mutually validating patient cohorts of the three MPN subtypes: essential thrombocythemia, ET (n=24), polycythemia vera, PV (n=33), and primary or post ET/PV secondary myelofibrosis, MF (n=42), as well as healthy donors (n=21). The MPN platelet transcriptome discriminates each clinical phenotype and reveals an incremental molecular reprogramming that is independent of patient driver mutation status or therapy. Leveraging this dataset, in particular the progressive expression gradient noted across MPN, we develop a machine learning model (Lasso-penalized regression) predictive of the advanced subtype MF at high accuracy (AUC-ROC of 0.95-0.96) with validation under two conditions: i) temporal, with training on the first cohort (n=71) and independent testing on the second (n=49) and ii) 10 fold cross validation on the entire dataset. Lasso-derived signatures offer a robust core set of < 10 MPN progression markers. Mechanistic insights from our data highlight impaired protein homeostasis as a prominent driver of MPN evolution, with persistent integrated stress response. We also identify JAK inhibitor-specific signatures and other interferon, proliferation, and proteostasis associated markers as putative targets for MPN-directed therapy. Our platelet transcriptome snapshot of chronic MPNs establishes a methodological foundation for deciphering disease risk stratification and progression beyond genetic data alone, thus presenting a promising avenue toward potential utility in a wide range of age-related disorders.

Download Full-text