A hybrid optimization method of factor screening predicated on GeoDetector and Random Forest for Landslide Susceptibility Mapping

Geomorphology ◽  
2021 ◽  
Vol 379 ◽  
pp. 107623
Author(s):  
Deliang Sun ◽  
Shuxian Shi ◽  
Haijia Wen ◽  
Jiahui Xu ◽  
Xinzhi Zhou ◽  
...  
Author(s):  
Yue Wang ◽  
Deliang Sun ◽  
Haijia Wen ◽  
Hong Zhang ◽  
Fengtai Zhang

To compare the random forest (RF) model and the frequency ratio (FR) model for landslide susceptibility mapping (LSM), this research selected Yunyang Country as the study area for its frequent natural disasters; especially landslides. A landslide inventory was built by historical records; satellite images; and extensive field surveys. Subsequently; a geospatial database was established based on 987 historical landslides in the study area. Then; all the landslides were randomly divided into two datasets: 70% of them were used as the training dataset and 30% as the test dataset. Furthermore; under five primary conditioning factors (i.e., topography factors; geological factors; environmental factors; human engineering activities; and triggering factors), 22 secondary conditioning factors were selected to form an evaluation factor library for analyzing the landslide susceptibility. On this basis; the RF model training and the FR model mathematical analysis were performed; and the established models were used for the landslide susceptibility simulation in the entire area of Yunyang County. Next; based on the analysis results; the susceptibility maps were divided into five classes: very low; low; medium; high; and very high. In addition; the importance of conditioning factors was ranked and the influence of landslides was explored by using the RF model. The area under the curve (AUC) value of receiver operating characteristic (ROC) curve; precision; accuracy; and recall ratio were used to analyze the predictive ability of the above two LSM models. The results indicated a difference in the performances between the two models. The RF model (AUC = 0.988) performed better than the FR model (AUC = 0.716). Moreover; compared with the FR model; the RF model showed a higher coincidence degree between the areas in the high and the very low susceptibility classes; on the one hand; and the geographical spatial distribution of historical landslides; on the other hand. Therefore; it was concluded that the RF model was more suitable for landslide susceptibility evaluation in Yunyang County; because of its significant model performance; reliability; and stability. The outcome also provided a theoretical basis for application of machine learning techniques (e.g., RF) in landslide prevention; mitigation; and urban planning; so as to deliver an adequate response to the increasing demand for effective and low-cost tools in landslide susceptibility assessments.


2021 ◽  
Vol 9 ◽  
Author(s):  
Shibao Wang ◽  
Jianqi Zhuang ◽  
Jia Zheng ◽  
Hongyu Fan ◽  
Jiaxu Kong ◽  
...  

Landslides are widely distributed worldwide and often result in tremendous casualties and economic losses, especially in the Loess Plateau of China. Taking Wuqi County in the hinterland of the Loess Plateau as the research area, using Bayesian hyperparameters to optimize random forest and extreme gradient boosting decision trees model for landslide susceptibility mapping, and the two optimized models are compared. In addition, 14 landslide influencing factors are selected, and 734 landslides are obtained according to field investigation and reports from literals. The landslides were randomly divided into training data (70%) and validation data (30%). The hyperparameters of the random forest and extreme gradient boosting decision tree models were optimized using a Bayesian algorithm, and then the optimal hyperparameters are selected for landslide susceptibility mapping. Both models were evaluated and compared using the receiver operating characteristic curve and confusion matrix. The results show that the AUC validation data of the Bayesian optimized random forest and extreme gradient boosting decision tree model are 0.88 and 0.86, respectively, which showed an improvement of 4 and 3%, indicating that the prediction performance of the two models has been improved. However, the random forest model has a higher predictive ability than the extreme gradient boosting decision tree model. Thus, hyperparameter optimization is of great significance in the improvement of the prediction accuracy of the model. Therefore, the optimized model can generate a high-quality landslide susceptibility map.


Forests ◽  
2020 ◽  
Vol 11 (4) ◽  
pp. 421 ◽  
Author(s):  
Viet-Ha Nhu ◽  
Ataollah Shirzadi ◽  
Himan Shahabi ◽  
Wei Chen ◽  
John J Clague ◽  
...  

We generated high-quality shallow landslide susceptibility maps for Bijar County, Kurdistan Province, Iran, using Random Forest (RAF), an ensemble computational intelligence method and three meta classifiers—Bagging (BA, BA-RAF), Random Subspace (RS, RS-RAF), and Rotation Forest (RF, RF-RAF). Modeling and validation were done on 111 shallow landslide locations using 20 conditioning factors tested by the Information Gain Ratio (IGR) technique. We assessed model performance with statistically based indexes, including sensitivity, specificity, accuracy, kappa, root mean square error (RMSE), and area under the receiver operatic characteristic curve (AUC). All four machine learning models that we tested yielded excellent goodness-of-fit and prediction accuracy, but the RF-RAF ensemble model (AUC = 0.936) outperformed the BA-RAF, RS-RAF (AUC = 0.907), and RAF (AUC = 0.812) models. The results also show that the Random Forest model significantly improved the predictive capability of the RAF-based classifier and, therefore, can be considered as a useful and an effective tool in regional shallow landslide susceptibility mapping.


2021 ◽  
pp. 1-20
Author(s):  
Renata Pacheco Quevedo ◽  
Daniel Andrade Maciel ◽  
Tatiana Dias Tardelli Uehara ◽  
Matej Vojtek ◽  
Camilo Daleles Rennó ◽  
...  

2021 ◽  
Vol 9 ◽  
Author(s):  
Zhou Zhao ◽  
Zeng yuan Liu ◽  
Chong Xu

Landslide susceptibility mapping is very important for landslide risk evaluation and land use planning. Toward this end, this paper presents a case study in Ningqiang County, Shanxi Province, China. Slope units were selected as the basic mapping units. A traditional statistical certainty factor model (CF), a machine learning support vector machine model (SVM) and random forest model (RF), along with a hybrid CF-SVM model and a CF-RF model were applied to analyze landslide susceptibility. Firstly, 10 landslide conditioning factors were selected, namely slope-angle, altitude, slope aspect, degree of relief, lithology, distance to rivers, distance to faults, distance to roads, average annual rainfall and normalized difference vegetation index. The 23,169 slope units were generated from a Digital Elevation Model and the corresponding 10 conditioning factor layers were produced from both geological and geographical data. Then, landslide susceptibility mapping was carried out using the five models, respectively. Next, the landslide density (LD), frequency ratio (FR), the area under the curve (AUC) and other indicators were used to validate the rationality, performance and accuracy of the models. The results showed that the susceptibility maps produced from the different models were all reasonable. In each map, the LD and FR were greatest in the zones classed as having very high landslide susceptibility, followed by the high, moderate, low and very low landslide susceptibility classes, respectively. From the comparison of the different maps and ROC curves, the RF model based on slope units was the most appropriate for landslide susceptibility mapping in the study area. It was also found that the combination of weaker learner model (CF model here) with a stronger learner model (SVM and RF model here) can impact the applicability of the stronger model.


Sensors ◽  
2019 ◽  
Vol 19 (12) ◽  
pp. 2685 ◽  
Author(s):  
Fumeng Zhao ◽  
Xingmin Meng ◽  
Yi Zhang ◽  
Guan Chen ◽  
Xiaojun Su ◽  
...  

Geological conditions along the Karakorum Highway (KKH) promote the occurrence of frequent natural disasters, which pose a serious threat to its normal operation. Landslide susceptibility mapping (LSM) provides a basis for analyzing and evaluating the degree of landslide susceptibility of an area. However, there has been limited analysis of actual landslide activity processes in real-time. The SBAS-InSAR (Small Baseline Subsets-Interferometric Synthetic Aperture Radar) method can fully consider the current landslide susceptibility situation and, thus, it can be used to optimize the results of LSM. In this study, we compared the results of LSM using logistic regression and Random Forest models along the KKH. Both approaches produced a classification in terms of very low, low, moderate, high, and very high landslide susceptibility. The evaluation results of the two models revealed a high susceptibility of land sliding in the Gaizi Valley and the Tashkurgan Valley. The Receiver Operating Characteristic (ROC) curve and historical landslide verification points were used to compare the evaluation accuracy of the two models. The Area under Curve (AUC) value of the Random Forest model was 0.981, and 98.79% of the historical landslide points in the verification points fell within the range of high and very high landslide susceptibility degrees. The Random Forest evaluation results were found to be superior to those of the logistic regression and they were combined with the SBAS-InSAR results to conduct a new LSM. The results showed an increase in the landslide susceptibility degree for 2808 cells. We conclude that this optimized landslide susceptibility mapping can provide valuable decision support for disaster prevention and it also provides theoretical guidance for the maintenance and normal operation of KKH.


Sign in / Sign up

Export Citation Format

Share Document