A Novel Hybrid Method for Landslide Susceptibility Mapping-Based GeoDetector and Machine Learning Cluster: A Case of Xiaojin County, China

Landslide susceptibility mapping (LSM) could be an effective way to prevent landslide hazards and mitigate losses. The choice of conditional factors is crucial to the results of LSM, and the selection of models also plays an important role. In this study, a hybrid method including GeoDetector and machine learning cluster was developed to provide a new perspective on how to address these two issues. We defined redundant factors by quantitatively analyzing the single impact and interactive impact of the factors, which was analyzed by GeoDetector, the effect of this step was examined using mean absolute error (MAE). The machine learning cluster contains four models (artificial neural network (ANN), Bayesian network (BN), logistic regression (LR), and support vector machines (SVM)) and automatically selects the best one for generating LSM. The receiver operating characteristic (ROC) curve, prediction accuracy, and the seed cell area index (SCAI) methods were used to evaluate these methods. The results show that the SVM model had the best performance in the machine learning cluster with the area under the ROC curve of 0.928 and with an accuracy of 83.86%. Therefore, SVM was chosen as the assessment model to map the landslide susceptibility of the study area. The landslide susceptibility map demonstrated fit with landslide inventory, indicated the hybrid method is effective in screening landslide influences and assessing landslide susceptibility.

Download Full-text

Factors Affecting Landslide Susceptibility Mapping: Assessing the Influence of Different Machine Learning Approaches, Sampling Strategies and Data Splitting

Land ◽

10.3390/land10090989 ◽

2021 ◽

Vol 10 (9) ◽

pp. 989

Author(s):

Minu Treesa Abraham ◽

Neelima Satyam ◽

Revuri Lokesh ◽

Biswajeet Pradhan ◽

Abdullah Alamri

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Sampling Strategy ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Susceptibility Map ◽

Learning Approaches ◽

Sampling Strategies ◽

Data Splitting

Data driven methods are widely used for the development of Landslide Susceptibility Mapping (LSM). The results of these methods are sensitive to different factors, such as the quality of input data, choice of algorithm, sampling strategies, and data splitting ratios. In this study, five different Machine Learning (ML) algorithms are used for LSM for the Wayanad district in Kerala, India, using two different sampling strategies and nine different train to test ratios in cross validation. The results show that Random Forest (RF), K Nearest Neighbors (KNN), and Support Vector Machine (SVM) algorithms provide better results than Naïve Bayes (NB) and Logistic Regression (LR) for the study area. NB and LR algorithms are less sensitive to the sampling strategy and data splitting, while the performance of the other three algorithms is considerably influenced by the sampling strategy. From the results, both the choice of algorithm and sampling strategy are critical in obtaining the best suited landslide susceptibility map for a region. The accuracies of KNN, RF, and SVM algorithms have increased by 10.51%, 10.02%, and 4.98% with the use of polygon landslide inventory data, while for NB and LR algorithms, the performance was slightly reduced with the use of polygon data. Thus, the sampling strategy and data splitting ratio are less consequential with NB and algorithms, while more data points provide better results for KNN, RF, and SVM algorithms.

Download Full-text

Landslide susceptibility mapping based on convolutional neural network and conventional machine learning methods

10.21203/rs.3.rs-190195/v1 ◽

2021 ◽

Author(s):

Rui Liu ◽

Xin Yang ◽

Chong Xu ◽

Luyao Li ◽

Xiangqiang Zeng

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Conventional Machine

Abstract Landslide susceptibility mapping (LSM) is a useful tool to estimate the probability of landslide occurrence, providing a scientific basis for natural hazards prevention, land use planning, and economic development in landslide-prone areas. To date, a large number of machine learning methods have been applied to LSM, and recently the advanced Convolutional Neural Network (CNN) has been gradually adopted to enhance the prediction accuracy of LSM. The objective of this study is to introduce a CNN based model in LSM and systematically compare its overall performance with the conventional machine learning models of random forest, logistic regression, and support vector machine. Herein, we selected the Jiuzhaigou region in Sichuan Province, China as the study area. A total number of 710 landslides and 12 predisposing factors were stacked to form spatial datasets for LSM. The ROC analysis and several statistical metrics, such as accuracy, root mean square error (RMSE), Kappa coefficient, sensitivity, and specificity were used to evaluate the performance of the models in the training and validation datasets. Finally, the trained models were calculated and the landslide susceptibility zones were mapped. Results suggest that both CNN and conventional machine-learning based models have a satisfactory performance (AUC: 85.72% − 90.17%). The CNN based model exhibits excellent good-of-fit and prediction capability, and achieves the highest performance (AUC: 90.17%) but also significantly reduces the salt-of-pepper effect, which indicates its great potential of application to LSM.

Download Full-text

Landslide susceptibility mapping using machine learning for Wenchuan County, Sichuan province, China

E3S Web of Conferences ◽

10.1051/e3sconf/202019803023 ◽

2020 ◽

Vol 198 ◽

pp. 03023

Author(s):

Xin Yang ◽

Rui Liu ◽

Luyao Li ◽

Mei Yang ◽

Yuantao Yang

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Machine Learning Algorithms ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Roc Curve Analysis ◽

Learning Methods ◽

Machine Learning Methods ◽

Boosted Decision Tree

Landslide susceptibility mapping is a method used to assess the probability and spatial distribution of landslide occurrences. Machine learning methods have been widely used in landslide susceptibility in recent years. In this paper, six popular machine learning algorithms namely logistic regression, multi-layer perceptron, random forests, support vector machine, Adaboost, and gradient boosted decision tree were leveraged to construct landslide susceptibility models with a total of 1365 landslide points and 14 predisposing factors. Subsequently, the landslide susceptibility maps (LSM) were generated by the trained models. LSM shows the main landslide zone is concentrated in the southeastern area of Wenchuan County. The result of ROC curve analysis shows that all models fitted the training datasets and achieved satisfactory results on validation datasets. The results of this paper reveal that machine learning methods are feasible to build robust landslide susceptibility models.

Download Full-text

Landslide Susceptibility Mapping Using the Stacking Ensemble Machine Learning Method in Lushui, Southwest China

Applied Sciences ◽

10.3390/app10114016 ◽

2020 ◽

Vol 10 (11) ◽

pp. 4016 ◽

Cited By ~ 3

Author(s):

Xudong Hu ◽

Han Zhang ◽

Hongbo Mei ◽

Dunhui Xiao ◽

Yuanyuan Li ◽

...

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Southwest China ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Machine Learning Method ◽

Learning Method ◽

Statistical Measures ◽

Ensemble Machine Learning

Landslide susceptibility mapping is considered to be a prerequisite for landslide prevention and mitigation. However, delineating the spatial occurrence pattern of the landslide remains a challenge. This study investigates the potential application of the stacking ensemble learning technique for landslide susceptibility assessment. In particular, support vector machine (SVM), artificial neural network (ANN), logical regression (LR), and naive Bayes (NB) were selected as base learners for the stacking ensemble method. The resampling scheme and Pearson’s correlation analysis were jointly used to evaluate the importance level of these base learners. A total of 388 landslides and 12 conditioning factors in the Lushui area (Southwest China) were used as the dataset to develop landslide modeling. The landslides were randomly separated into two parts, with 70% used for model training and 30% used for model validation. The models’ performance was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC) and statistical measures. The results showed that the stacking-based ensemble model achieved an improved predictive accuracy as compared to the single algorithms, while the SVM-ANN-NB-LR (SANL) model, the SVM-ANN-NB (SAN) model, and the ANN-NB-LR (ANL) models performed equally well, with AUC values of 0.931, 0.940, and 0.932, respectively, for validation stage. The correlation coefficient between the LR and SVM was the highest for all resampling rounds, with a value of 0.72 on average. This connotes that LR and SVM played an almost equal role when the ensemble of SANL was applied for landslide susceptibility analysis. Therefore, it is feasible to use the SAN model or the ANL model for the study area. The finding from this study suggests that the stacking ensemble machine learning method is promising for landslide susceptibility mapping in the Lushui area and is capable of targeting areas prone to landslides.

Download Full-text

Landslide susceptibility mapping using Forest by Penalizing Attributes (FPA) algorithm based machine learning approach

Vietnam Journal of Earth Sciences ◽

10.15625/0866-7187/42/3/15047 ◽

2020 ◽

Vol 42 (3) ◽

Cited By ~ 1

Author(s):

Tran Van Phong ◽

Hai-Bang Ly ◽

Phan Trong Trinh ◽

Indra Prakash

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Susceptibility Map ◽

Conditioning Factors ◽

Environmental Conditioning ◽

Landslide Modeling ◽

Machine Learning Approach ◽

First Time

Landslide susceptibility mapping is a helpful tool for assessment and management of landslides of an area. In this study, we have applied first time Forest by Penalizing Attributes (FPA) algorithm-based Machine Learning (ML) approach for mapping of landslide susceptibility at Muong Lay district (Vietnam). For this aim, 217 historical landslides locations were identified and analyzed for the development of FPA model and generation of susceptibility map. Nine landslide topographical and geo-environmental conditioning factors (curvature, geology/lithology, aspect, distance from faults, rivers and roads, weathering crust, slope, and deep division) were utilized to construct the training and validating datasets for landslide modeling. Different quantitative statistical indices including Area Under the Receiver Operating Characteristic (ROC) curve (AUC) were used to evaluate the performance of the model. The results indicate that the predictive capability of the FPA is very good for landslide susceptibility mapping on both training (AUC = 0.935) and validating (AUC = 0.882) datasets. Thus, the novel FPA based ML model can be utilized for the development of accurate landslide susceptibility map of the study area and this approach can also be applied in other landslide prone areas.

Download Full-text

AN EVALUATION OF LANDSLIDE SUSCEPTIBILITY MAPPING USING REMOTE SENSING DATA AND MACHINE LEARNING ALGORITHMS IN IRAN

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-2-w5-503-2019 ◽

2019 ◽

Vol IV-2/W5 ◽

pp. 503-511 ◽

Cited By ~ 2

Author(s):

B. Kalantar ◽

N. Ueda ◽

H. A. H. Al-Najjar ◽

M. B. A. Gibril ◽

U. S. Lay ◽

...

Keyword(s):

Machine Learning ◽

Land Use ◽

Roc Curve ◽

Landslide Susceptibility ◽

Total Curvature ◽

Susceptibility Mapping ◽

Machine Learning Algorithms ◽

Landslide Susceptibility Mapping ◽

Inventory Data ◽

Prediction Rate

<p><strong>Abstract.</strong> Landslide is painstaking as one of the most prevalent and devastating forms of mass movement that affects man and his environment. The specific objective of this research paper is to investigate the application and performances of some selected machine learning algorithms (MLA) in landslide susceptibility mapping, in Dodangeh watershed, Iran. A 112 sample point of the past landslide, occurrence or inventory data was generated from the existing and field observations. In addition, fourteen landslide-conditioning parameters were derived from DEM and other topographic databases for the modelling process. These conditioning parameters include total curvature, profile curvature, plan curvature, slope, aspect, altitude, topographic wetness index (TWI), topographic roughness index (TRI), stream transport index (STI), stream power index (SPI), lithology, land use, distance to stream, distance to the fault. Meanwhile, factor analysis was employed to optimize the landslide conditioning parameters and the inventory data, by assessing the multi-collinearity effects and outlier detections respectively. The inventory data is divided into 70% (78) training dataset and 30% (34) test dataset for model validation. The receiver operating characteristics (ROC) curve or area under curve (AUC) value was used for assessing the model's performance. The findings reveal that TRI has 0.89 collinearity effect based on variance-inflated factor (VIF) and based on Gini factor optimization total curvature is not significant in the model development, therefore the two parameters are excluded from the modelling. All the selected MLAs (RF, BRT, and DT) shown promising performances on landslide susceptibility mapping in Dodangeh watershed, Iran. The ROC curve for training and validation for RF are 86% success rate and 83% prediction rate implies the best model performance compared to BRT and DT, with ROC curve of 72% and 70% prediction rate, respectively. In conclusion, RF could be the best algorithm for producing landslide susceptibility map, and such results could be adopted for the decision-making process to support land use planner for improving landslide risk assessment in similar environmental settings.</p>

Download Full-text

Conditioning Factors Determination for Landslide Susceptibility Mapping Using Support Vector Machine Learning

IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2019.8898340 ◽

2019 ◽

Author(s):

Bahareh Kalantar ◽

Naonori Ueda ◽

Usman Salihu Lay ◽

Husam Abdulrasool H. Al-Najjar ◽

Alfian Abdul Halin

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Conditioning Factors

Download Full-text

Using landslide-inventory mapping for a combined bagged-trees and logistic-regression approach to determining landslide susceptibility in eastern Kentucky, United States

Quarterly Journal of Engineering Geology and Hydrogeology ◽

10.1144/qjegh2020-177 ◽

2021 ◽

pp. qjegh2020-177

Author(s):

Matthew M. Crawford ◽

Jason M. Dortch ◽

Hudson J. Koch ◽

Ashton A. Killen ◽

Junfeng Zhu ◽

...

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Standard Deviation ◽

Landslide Susceptibility ◽

Engineering Geology ◽

Susceptibility Mapping ◽

Landslide Inventory ◽

Landslide Susceptibility Mapping ◽

Susceptibility Map ◽

Landslide Occurrence

High-resolution LiDAR-derived datasets from a 1.5-m digital elevation model and a detailed landslide inventory (N ≥ 1,000) for Magoffin County, Kentucky, USA, were used to develop a combined machine-learning and statistical approach to improve geomorphic-based landslide-susceptibility mapping.An initial dataset of 36 variables was compiled to investigate the connection between slope morphology and landslide occurrence. Bagged trees, a machine-learning random-forest classifier, was used to evaluate the geomorphic variables, and 12 were identified as important: standard deviation of plan curvature, standard deviation of elevation, sum of plan curvature, minimum slope, mean plan curvature, range of elevation, sum of roughness, mean curvature, sum of curvature, mean roughness, minimum curvature, and standard deviation of curvature. These variables were further evaluated using logistic regression to determine the probability of landslide occurrence and then used to create a landslide-susceptibility map.The performance of the logistic-regression model was evaluated by the receiver operating characteristic curve, area under the curve, which was 0.83. Standard deviations from the probability mean were used to set landslide-susceptibility classifications: low (0–0.10), low–moderate (0.11–0.27), moderate (0.28–0.44), moderate–high (0.45–0.7), and high (0.7–1.0). Logistic-regression results were validated by using a separate landslide inventory for the neighboring Prestonsburg 7.5-minute quadrangle, and running the same regression function. Results indicate that 74.9 percent of the landslide deposits were identified as having moderate, moderate–high, or high landslide susceptibility. Combining inventory mapping with statistical modelling identified important geomorphic variables and produced a useful approach to landslide-susceptibility mapping.Thematic collection: This article is part of the Digitization and Digitalization in engineering geology and hydrogeology collection available at: https://www.lyellcollection.org/cc/digitization-and-digitalization-in-engineering-geology-and-hydrogeology

Download Full-text

Comparative Study of Convolutional Neural Network and Conventional Machine Learning Methods for Landslide Susceptibility Mapping

Remote Sensing ◽

10.3390/rs14020321 ◽

2022 ◽

Vol 14 (2) ◽

pp. 321

Author(s):

Rui Liu ◽

Xin Yang ◽

Chong Xu ◽

Liangshuai Wei ◽

Xiangqiang Zeng

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Conventional Machine

Landslide susceptibility mapping (LSM) is a useful tool to estimate the probability of landslide occurrence, providing a scientific basis for natural hazards prevention, land use planning, and economic development in landslide-prone areas. To date, a large number of machine learning methods have been applied to LSM, and recently the advanced convolutional neural network (CNN) has been gradually adopted to enhance the prediction accuracy of LSM. The objective of this study is to introduce a CNN-based model in LSM and systematically compare its overall performance with the conventional machine learning models of random forest, logistic regression, and support vector machine. Herein, we selected Zhangzha Town in Sichuan Province, China, and Lantau Island in Hong Kong, China, as the study areas. Each landslide inventory and corresponding predisposing factors were stacked to form spatial datasets for LSM. The receiver operating characteristic analysis, area under the curve (AUC), and several statistical metrics, such as accuracy, root mean square error, Kappa coefficient, sensitivity, and specificity, were used to evaluate the performance of the models. Finally, the trained models were calculated, and the landslide susceptibility zones were mapped. Results suggest that both CNN and conventional machine learning-based models have a satisfactory performance. The CNN-based model exhibits an excellent prediction capability and achieves the highest performance but also significantly reduces the salt-of-pepper effect, which indicates its great potential for application to LSM.

Download Full-text

Novel Ensemble-Based Machine Learning Models Based on The Bagging, Boosting and Random Subspace Methods for Landslide Susceptibility Mapping

10.21203/rs.3.rs-649364/v1 ◽

2021 ◽

Author(s):

Ali Nouh Mabdeh ◽

Akif Al-Fugara ◽

Mohammad Ahmadlou ◽

Biswajeet Pradhan

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Random Subspace ◽

Learning Models ◽

Ensemble Models ◽

Conditioning Factors ◽

Machine Learning Models

Abstract Indivisual machine learning models show different limitations such as low generalization power for modeling nonlinear phenomena with complex behavior. In recent years, one of the best approaches to this issue is to use ensemble models. The purpose of this paper is to investigate the predictive power and modeling of three novel ensemble models constructed with four machine learning models: Decision Tree (DT), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Naive Bayes (NB) models based on three approaches of Bagging, boosting and Random Subspace (RS) in landslide susceptibility mapping (LSM) in the Province of Ajloun in Jordan. A total number of 91 landslide locations along with 16 conditioning factors in LSM were identified and used. Also, before modeling, the selection of effective conditioning factors in LSM was done using genetic algorithm and four single models including DT, KNN, NB and SVM. The selected factors were used in modeling with individual and ensemble models. The results show that the area under the receiver operating characteristic curve (AUROC) for ensemble models is significantly higher than the individual models and the AUC for ensemble models was on average 14% higher than individual models. Based on the results, the most accurate models were RS ensemble model (AUROC = 0.850), Boosting (AUROC = 0.848) and Bagging (AUROC = 0.814), respectively. This study showed that by combining the results of simple machine learning models and making ensemble models, models with the desired accuracy can be achieved.

Download Full-text