Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China

Yumiao Wang; Xueling Wu; Zhangjian Chen; Fu Ren; Luwei Feng; Qingyun Du

doi:10.3390/ijerph16030368

Optimizing the Predictive Ability of Machine Learning Methods for Landslide Susceptibility Mapping Using SMOTE for Lishui City in Zhejiang Province, China

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph16030368 ◽

2019 ◽

Vol 16 (3) ◽

pp. 368 ◽

Cited By ~ 15

Author(s):

Yumiao Wang ◽

Xueling Wu ◽

Zhangjian Chen ◽

Fu Ren ◽

Luwei Feng ◽

...

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Satellite Images ◽

Predictive Ability ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Training Dataset ◽

Learning Methods ◽

Machine Learning Methods ◽

Susceptibility Maps

The main goal of this study was to use the synthetic minority oversampling technique (SMOTE) to expand the quantity of landslide samples for machine learning methods (i.e., support vector machine (SVM), logistic regression (LR), artiﬁcial neural network (ANN), and random forest (RF)) to produce high-quality landslide susceptibility maps for Lishui City in Zhejiang Province, China. Landslide-related factors were extracted from topographic maps, geological maps, and satellite images. Twelve factors were selected as independent variables using correlation coefficient analysis and the neighborhood rough set (NRS) method. In total, 288 soil landslides were mapped using field surveys, historical records, and satellite images. The landslides were randomly divided into two datasets: 70% of all landslides were selected as the original training dataset and 30% were used for validation. Then, SMOTE was employed to generate datasets with sizes ranging from two to thirty times that of the training dataset to establish and compare the four machine learning methods for landslide susceptibility mapping. In addition, we used slope units to subdivide the terrain to determine the landslide susceptibility. Finally, the landslide susceptibility maps were validated using statistical indexes and the area under the curve (AUC). The results indicated that the performances of the four machine learning methods showed different levels of improvement as the sample sizes increased. The RF model exhibited a more substantial improvement (AUC improved by 24.12%) than did the ANN (18.94%), SVM (17.77%), and LR (3.00%) models. Furthermore, the ANN model achieved the highest predictive ability (AUC = 0.98), followed by the RF (AUC = 0.96), SVM (AUC = 0.94), and LR (AUC = 0.79) models. This approach significantly improves the performance of machine learning techniques for landslide susceptibility mapping, thereby providing a better tool for reducing the impacts of landslide disasters.

Get full-text (via PubEx)

Landslide susceptibility mapping based on convolutional neural network and conventional machine learning methods

10.21203/rs.3.rs-190195/v1 ◽

2021 ◽

Author(s):

Rui Liu ◽

Xin Yang ◽

Chong Xu ◽

Luyao Li ◽

Xiangqiang Zeng

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Conventional Machine

Abstract Landslide susceptibility mapping (LSM) is a useful tool to estimate the probability of landslide occurrence, providing a scientific basis for natural hazards prevention, land use planning, and economic development in landslide-prone areas. To date, a large number of machine learning methods have been applied to LSM, and recently the advanced Convolutional Neural Network (CNN) has been gradually adopted to enhance the prediction accuracy of LSM. The objective of this study is to introduce a CNN based model in LSM and systematically compare its overall performance with the conventional machine learning models of random forest, logistic regression, and support vector machine. Herein, we selected the Jiuzhaigou region in Sichuan Province, China as the study area. A total number of 710 landslides and 12 predisposing factors were stacked to form spatial datasets for LSM. The ROC analysis and several statistical metrics, such as accuracy, root mean square error (RMSE), Kappa coefficient, sensitivity, and specificity were used to evaluate the performance of the models in the training and validation datasets. Finally, the trained models were calculated and the landslide susceptibility zones were mapped. Results suggest that both CNN and conventional machine-learning based models have a satisfactory performance (AUC: 85.72% − 90.17%). The CNN based model exhibits excellent good-of-fit and prediction capability, and achieves the highest performance (AUC: 90.17%) but also significantly reduces the salt-of-pepper effect, which indicates its great potential of application to LSM.

Get full-text (via PubEx)

Landslide susceptibility mapping using machine learning for Wenchuan County, Sichuan province, China

E3S Web of Conferences ◽

10.1051/e3sconf/202019803023 ◽

2020 ◽

Vol 198 ◽

pp. 03023

Author(s):

Xin Yang ◽

Rui Liu ◽

Luyao Li ◽

Mei Yang ◽

Yuantao Yang

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Machine Learning Algorithms ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Roc Curve Analysis ◽

Learning Methods ◽

Machine Learning Methods ◽

Boosted Decision Tree

Landslide susceptibility mapping is a method used to assess the probability and spatial distribution of landslide occurrences. Machine learning methods have been widely used in landslide susceptibility in recent years. In this paper, six popular machine learning algorithms namely logistic regression, multi-layer perceptron, random forests, support vector machine, Adaboost, and gradient boosted decision tree were leveraged to construct landslide susceptibility models with a total of 1365 landslide points and 14 predisposing factors. Subsequently, the landslide susceptibility maps (LSM) were generated by the trained models. LSM shows the main landslide zone is concentrated in the southeastern area of Wenchuan County. The result of ROC curve analysis shows that all models fitted the training datasets and achieved satisfactory results on validation datasets. The results of this paper reveal that machine learning methods are feasible to build robust landslide susceptibility models.

Get full-text (via PubEx)

GIS-Based Landslide Susceptibility Mapping Using Remote Sensing Data and Machine Learning Methods

Cartography from Pole to Pole - Lecture Notes in Geoinformation and Cartography ◽

10.1007/978-3-642-32618-9_23 ◽

2013 ◽

pp. 319-333

Author(s):

Fu Ren ◽

Xueling Wu

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Landslide Susceptibility ◽

Remote Sensing Data ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Learning Methods ◽

Sensing Data ◽

Machine Learning Methods

Get full-text (via PubEx)

Comparative Study of Convolutional Neural Network and Conventional Machine Learning Methods for Landslide Susceptibility Mapping

Remote Sensing ◽

10.3390/rs14020321 ◽

2022 ◽

Vol 14 (2) ◽

pp. 321

Author(s):

Rui Liu ◽

Xin Yang ◽

Chong Xu ◽

Liangshuai Wei ◽

Xiangqiang Zeng

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Conventional Machine

Landslide susceptibility mapping (LSM) is a useful tool to estimate the probability of landslide occurrence, providing a scientific basis for natural hazards prevention, land use planning, and economic development in landslide-prone areas. To date, a large number of machine learning methods have been applied to LSM, and recently the advanced convolutional neural network (CNN) has been gradually adopted to enhance the prediction accuracy of LSM. The objective of this study is to introduce a CNN-based model in LSM and systematically compare its overall performance with the conventional machine learning models of random forest, logistic regression, and support vector machine. Herein, we selected Zhangzha Town in Sichuan Province, China, and Lantau Island in Hong Kong, China, as the study areas. Each landslide inventory and corresponding predisposing factors were stacked to form spatial datasets for LSM. The receiver operating characteristic analysis, area under the curve (AUC), and several statistical metrics, such as accuracy, root mean square error, Kappa coefficient, sensitivity, and specificity, were used to evaluate the performance of the models. Finally, the trained models were calculated, and the landslide susceptibility zones were mapped. Results suggest that both CNN and conventional machine learning-based models have a satisfactory performance. The CNN-based model exhibits an excellent prediction capability and achieves the highest performance but also significantly reduces the salt-of-pepper effect, which indicates its great potential for application to LSM.

Get full-text (via PubEx)

Landslide susceptibility mapping in three Upazilas of Rangamati hill district Bangladesh: application and comparison of GIS-based machine learning methods

Geocarto International ◽

10.1080/10106049.2020.1864026 ◽

2021 ◽

pp. 1-27

Author(s):

Yasin Wahid Rabby ◽

Md Belal Hossain ◽

Joynal Abedin

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Learning Methods ◽

Machine Learning Methods ◽

Hill District

Get full-text (via PubEx)

Comparison of Different Machine Learning Methods for Debris Flow Susceptibility Mapping: A Case Study in the Sichuan Province, China

Remote Sensing ◽

10.3390/rs12020295 ◽

2020 ◽

Vol 12 (2) ◽

pp. 295 ◽

Cited By ~ 6

Author(s):

Ke Xiong ◽

Basanta Raj Adhikari ◽

Constantine A. Stamatopoulos ◽

Yu Zhan ◽

Shaolin Wu ◽

...

Keyword(s):

Machine Learning ◽

Debris Flow ◽

Sichuan Province ◽

Susceptibility Mapping ◽

Boosted Regression Trees ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Susceptibility Maps ◽

Debris Flow Susceptibility

Debris flow susceptibility mapping is considered to be useful for hazard prevention and mitigation. As a frequent debris flow area, many hazardous events have occurred annually and caused a lot of damage in the Sichuan Province, China. Therefore, this study attempted to evaluate and compare the performance of four state-of-the-art machine-learning methods, namely Logistic Regression (LR), Support Vector Machines (SVM), Random Forest (RF), and Boosted Regression Trees (BRT), for debris flow susceptibility mapping in this region. Four models were constructed based on the debris flow inventory and a range of causal factors. A variety of datasets was obtained through the combined application of remote sensing (RS) and geographic information system (GIS). The mean altitude, altitude difference, aridity index, and groove gradient played the most important role in the assessment. The performance of these modes was evaluated using predictive accuracy (ACC) and the area under the receiver operating characteristic curve (AUC). The results of this study showed that all four models were capable of producing accurate and robust debris flow susceptibility maps (ACC and AUC values were well above 0.75 and 0.80 separately). With an excellent spatial prediction capability and strong robustness, the BRT model (ACC = 0.781, AUC = 0.852) outperformed other models and was the ideal choice. Our results also exhibited the importance of selecting suitable mapping units and optimal predictors. Furthermore, the debris flow susceptibility maps of the Sichuan Province were produced, which can provide helpful data for assessing and mitigating debris flow hazards.

Get full-text (via PubEx)

Hybrid Computational Intelligence Methods for Landslide Susceptibility Mapping

Symmetry ◽

10.3390/sym12030325 ◽

2020 ◽

Vol 12 (3) ◽

pp. 325 ◽

Cited By ~ 20

Author(s):

Guirong Wang ◽

Xinxiang Lei ◽

Wei Chen ◽

Himan Shahabi ◽

Ataollah Shirzadi

Keyword(s):

Landslide Susceptibility ◽

Radial Basis Function Network ◽

Slope Angle ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Training Dataset ◽

Validation Dataset ◽

Slope Aspect ◽

Susceptibility Maps ◽

Landslide Susceptibility Maps

In this study, hybrid integration of MultiBoosting based on two artificial intelligence methods (the radial basis function network (RBFN) and credal decision tree (CDT) models) and geographic information systems (GIS) were used to establish landslide susceptibility maps, which were used to evaluate landslide susceptibility in Nanchuan County, China. First, the landslide inventory map was generated based on previous research results combined with GIS and aerial photos. Then, 298 landslides were identified, and the established dataset was divided into a training dataset (70%, 209 landslides) and a validation dataset (30%, 89 landslides) with ensured randomness, fairness, and symmetry of data segmentation. Sixteen landslide conditioning factors (altitude, profile curvature, plan curvature, slope aspect, slope angle, stream power index (SPI), topographical wetness index (TWI), sediment transport index (STI), distance to rivers, distance to roads, distance to faults, rainfall, NDVI, soil, land use, and lithology) were identified in the study area. Subsequently, the CDT, RBFN, and their ensembles with MultiBoosting (MCDT and MRBFN) were used in ArcGIS to generate the landslide susceptibility maps. The performances of the four landslide susceptibility maps were compared and verified based on the area under the curve (AUC). Finally, the verification results of the AUC evaluation show that the landslide susceptibility mapping generated by the MCDT model had the best performance.

Get full-text (via PubEx)

Application of Ensemble-Based Machine Learning Models to Landslide Susceptibility Mapping

Remote Sensing ◽

10.3390/rs10081252 ◽

2018 ◽

Vol 10 (8) ◽

pp. 1252 ◽

Cited By ~ 50

Author(s):

Prima Kadavi ◽

Chang-Wook Lee ◽

Saro Lee

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Learning Models ◽

Landslide Occurrence ◽

Susceptibility Maps ◽

Multiclass Classifier ◽

Landslide Susceptibility Maps ◽

Machine Learning Models

The main purpose of this study was to produce landslide susceptibility maps using various ensemble-based machine learning models (i.e., the AdaBoost, LogitBoost, Multiclass Classifier, and Bagging models) for the Sacheon-myeon area of South Korea. A landslide inventory map including a total of 762 landslides was compiled based on reports and aerial photograph interpretations. The landslides were randomly separated into two datasets: 70% of landslides were selected for the model establishment and 30% were used for validation purposes. Additionally, 20 landslide condition factors divided into five categories (topographic factors, hydrological factors, soil map, geological map, and forest map) were considered in the landslide susceptibility mapping. The relationships among landslide occurrence and landslide conditioning factors were analyzed and the landslide susceptibility maps were calculated and drawn using the AdaBoost, LogitBoost, Multiclass Classifier, and Bagging models. Finally, the maps were validated using the area under the curve (AUC) method. The Multiclass Classifier method had higher prediction accuracy (85.9%) than the Bagging (AUC = 85.4%), LogitBoost (AUC = 84.8%), and AdaBoost (84.0%) methods.

Get full-text (via PubEx)

A Modelling Tool for Rainfall-triggered Landslide Susceptibility Mapping and Hazard Warning based on GIS and Machine Learning

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/783/1/012074 ◽

2021 ◽

Vol 783 (1) ◽

pp. 012074

Author(s):

Haiwei Zhou ◽

Jianjun Yu ◽

Hangjian Feng ◽

Jie Huang

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Hazard Warning

Get full-text (via PubEx)

Machine learning in earthquake- and typhoon-triggered landslide susceptibility mapping and critical factor identification

Environmental Earth Sciences ◽

10.1007/s12665-021-09510-z ◽

2021 ◽

Vol 80 (6) ◽

Author(s):

Muhammad Zeeshan Ali ◽

Hone-Jay Chu ◽

Yi-Chin Chen ◽

Saleem Ullah

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Critical Factor ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping

Get full-text (via PubEx)