Coupling logistic model tree and random subspace to predict the landslide susceptibility areas with considering the uncertainty of environmental features

Abstract Landslide disasters cause huge casualties and economic losses every year, how to accurately forecast the landslides has always been an important issue in geo-environment research. In this paper, a hybrid machine learning approach RSLMT is firstly proposed by coupling Random Subspace (RS) and Logistic Model Tree (LMT) for producing a landslide susceptibility map (LSM). With this method, the uncertainty introduced by input features is considered, the problem of overfitting is solved by reducing dimensions to increase the prediction rate of landslide occurrence. Moreover, the uncertainty of prediction will be deeply discussed with the rank probability score (RPS) series, which is an important evaluation of uncertainty but rarely used in LSM. Qingchuan county, China was taken as a study area. 12 landslide causal factors were selected and their contribution on landslide occurrence was evaluated by ReliefF method. In addition, Logistic Model Tree (LMT), Naive Bayes (NB) and Logistic Regression (LR) were researched for comparison. The results showed that RSLMT (AUC = 0.815) outperformed LMT (AUC = 0.805), NB (AUC = 0.771), LR (AUC = 0.785). LSM of Qingchuan county was produced using the novel model, it indicated that landslides tend to occur along with the fault belts and the middle-low mountain area that is strongly influenced by the large numbers of human engineering activities.

Download Full-text

A novel ensemble approach of bivariate statistical-based logistic model tree classifier for landslide susceptibility assessment

Geocarto International ◽

10.1080/10106049.2018.1425738 ◽

2018 ◽

Vol 33 (12) ◽

pp. 1398-1420 ◽

Cited By ~ 58

Author(s):

Wei Chen ◽

Himan Shahabi ◽

Ataollah Shirzadi ◽

Tao Li ◽

Chen Guo ◽

...

Keyword(s):

Landslide Susceptibility ◽

Logistic Model ◽

Susceptibility Assessment ◽

Landslide Susceptibility Assessment ◽

Ensemble Approach ◽

Model Tree ◽

Tree Classifier ◽

Logistic Model Tree

Download Full-text

Spatial prediction of landslide susceptibility by combining evidential belief function, logistic regression and logistic model tree

Geocarto International ◽

10.1080/10106049.2019.1588393 ◽

2019 ◽

Vol 34 (11) ◽

pp. 1177-1201 ◽

Cited By ~ 37

Author(s):

Wei Chen ◽

Xia Zhao ◽

Himan Shahabi ◽

Ataollah Shirzadi ◽

Khabat Khosravi ◽

...

Keyword(s):

Logistic Regression ◽

Landslide Susceptibility ◽

Logistic Model ◽

Spatial Prediction ◽

Belief Function ◽

Evidential Belief Function ◽

Model Tree ◽

Logistic Model Tree

Download Full-text

Landslide Detection and Susceptibility Modeling on Cameron Highlands (Malaysia): A Comparison between Random Forest, Logistic Regression and Logistic Model Tree Algorithms

Forests ◽

10.3390/f11080830 ◽

2020 ◽

Vol 11 (8) ◽

pp. 830 ◽

Cited By ~ 2

Author(s):

Viet-Ha Nhu ◽

Ayub Mohammadi ◽

Himan Shahabi ◽

Baharin Bin Ahmad ◽

Nadhir Al-Ansari ◽

...

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Logistic Regression ◽

Random Forest ◽

Landslide Susceptibility ◽

Predictive Value ◽

Logistic Model ◽

Model Tree ◽

Cameron Highlands ◽

Logistic Model Tree

We used remote sensing techniques and machine learning to detect and map landslides, and landslide susceptibility in the Cameron Highlands, Malaysia. We located 152 landslides using a combination of interferometry synthetic aperture radar (InSAR), Google Earth (GE), and field surveys. Of the total slide locations, 80% (122 landslides) were utilized for training the selected algorithms, and the remaining 20% (30 landslides) were applied for validation purposes. We employed 17 conditioning factors, including slope angle, aspect, elevation, curvature, profile curvature, stream power index (SPI), topographic wetness index (TWI), lithology, soil type, land cover, normalized difference vegetation index (NDVI), distance to river, distance to fault, distance to road, river density, fault density, and road density, which were produced from satellite imageries, geological map, soil maps, and a digital elevation model (DEM). We used these factors to produce landslide susceptibility maps using logistic regression (LR), logistic model tree (LMT), and random forest (RF) models. To assess prediction accuracy of the models we employed the following statistical measures: negative predictive value (NPV), sensitivity, positive predictive value (PPV), specificity, root-mean-squared error (RMSE), accuracy, and area under the receiver operating characteristic (ROC) curve (AUC). Our results indicated that the AUC was 92%, 90%, and 88% for the LMT, LR, and RF algorithms, respectively. To assess model performance, we also applied non-parametric statistical tests of Friedman and Wilcoxon, where the results revealed that there were no practical differences among the used models in the study area. While landslide mapping in tropical environment such as Cameron Highlands remains difficult, the remote sensing (RS) along with machine learning techniques, such as the LMT model, show promise for landslide susceptibility mapping in the study area.

Download Full-text

GIS-based comparative study of Bayes network, Hoeffding tree and logistic model tree for landslide susceptibility modeling

CATENA ◽

10.1016/j.catena.2021.105344 ◽

2021 ◽

Vol 203 ◽

pp. 105344

Author(s):

Wenwu Chen ◽

Shuai Zhang

Keyword(s):

Comparative Study ◽

Landslide Susceptibility ◽

Logistic Model ◽

Susceptibility Modeling ◽

Bayes Network ◽

Model Tree ◽

Hoeffding Tree ◽

Logistic Model Tree

Download Full-text

A comparative study of support vector machine and logistic model tree classifiers for shallow landslide susceptibility modeling

Environmental Earth Sciences ◽

10.1007/s12665-019-8562-z ◽

2019 ◽

Vol 78 (18) ◽

Cited By ~ 15

Author(s):

Mousa Abedini ◽

Bahareh Ghasemian ◽

Ataollah Shirzadi ◽

Dieu Tien Bui

Keyword(s):

Support Vector Machine ◽

Comparative Study ◽

Landslide Susceptibility ◽

Logistic Model ◽

Shallow Landslide ◽

Support Vector ◽

Susceptibility Modeling ◽

Model Tree ◽

Logistic Model Tree

Download Full-text

Shallow Landslide Susceptibility Mapping: A Comparison between Logistic Model Tree, Logistic Regression, Naïve Bayes Tree, Artificial Neural Network, and Support Vector Machine Algorithms

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph17082749 ◽

2020 ◽

Vol 17 (8) ◽

pp. 2749 ◽

Cited By ~ 18

Author(s):

Viet-Ha Nhu ◽

Ataollah Shirzadi ◽

Himan Shahabi ◽

Sushant K. Singh ◽

Nadhir Al-Ansari ◽

...

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Landslide Susceptibility ◽

Logistic Model ◽

Naive Bayes ◽

Shallow Landslide ◽

Naïve Bayes ◽

Support Vector ◽

Model Tree ◽

Logistic Model Tree

Shallow landslides damage buildings and other infrastructure, disrupt agriculture practices, and can cause social upheaval and loss of life. As a result, many scientists study the phenomenon, and some of them have focused on producing landslide susceptibility maps that can be used by land-use managers to reduce injury and damage. This paper contributes to this effort by comparing the power and effectiveness of five machine learning, benchmark algorithms—Logistic Model Tree, Logistic Regression, Naïve Bayes Tree, Artificial Neural Network, and Support Vector Machine—in creating a reliable shallow landslide susceptibility map for Bijar City in Kurdistan province, Iran. Twenty conditioning factors were applied to 111 shallow landslides and tested using the One-R attribute evaluation (ORAE) technique for modeling and validation processes. The performance of the models was assessed by statistical-based indexes including sensitivity, specificity, accuracy, mean absolute error (MAE), root mean square error (RMSE), and area under the receiver operatic characteristic curve (AUC). Results indicate that all the five machine learning models performed well for shallow landslide susceptibility assessment, but the Logistic Model Tree model (AUC = 0.932) had the highest goodness-of-fit and prediction accuracy, followed by the Logistic Regression (AUC = 0.932), Naïve Bayes Tree (AUC = 0.864), ANN (AUC = 0.860), and Support Vector Machine (AUC = 0.834) models. Therefore, we recommend the use of the Logistic Model Tree model in shallow landslide mapping programs in semi-arid regions to help decision makers, planners, land-use managers, and government agencies mitigate the hazard and risk.

Download Full-text

Zonation of Landslide Susceptibility in Ruijin, Jiangxi, China

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph18115906 ◽

2021 ◽

Vol 18 (11) ◽

pp. 5906

Author(s):

Xiaoting Zhou ◽

Weicheng Wu ◽

Ziyu Lin ◽

Guiliang Zhang ◽

Renxiang Chen ◽

...

Keyword(s):

Environmental Factors ◽

Landslide Susceptibility ◽

Urban Areas ◽

Support Vector ◽

Susceptibility Map ◽

Human Society ◽

Learning Approaches ◽

Prevention Measures ◽

Landslide Occurrence ◽

Better Than

Landslides are one of the major geohazards threatening human society. The objective of this study was to conduct a landslide hazard susceptibility assessment for Ruijin, Jiangxi, China, and to provide technical support to the local government for implementing disaster reduction and prevention measures. Machine learning approaches, e.g., random forests (RFs) and support vector machines (SVMs) were employed and multiple geo-environmental factors such as land cover, NDVI, landform, rainfall, lithology, and proximity to faults, roads, and rivers, etc., were utilized to achieve our purposes. For categorical factors, three processing approaches were proposed: simple numerical labeling (SNL), weight assignment (WA)-based and frequency ratio (FR)-based. Then 19 geo-environmental factors were respectively converted into raster to constitute three 19-band datasets, i.e., DS1, DS2, and DS3 from three different processes. Then, 155 observed landslides that occurred in the past decades were vectorized, among which 70% were randomly selected to compose a training set (TS1) and the remaining 30% to form a validation set (VS1). A number of non-landslide (no-risk) samples distributed in the whole study area were identified in low slope (<1–3°) zones such as urban areas and croplands, and also added to the TS1 and VS1 in the same ratio. For comparison, we used the FR approach to identify the no-risk samples in both flat and non-flat areas, and merged them into the field-observed landslides to constitute another pair of training and validation sets (TS2 and VS2) using the same ratio of 7:3. The RF algorithm was applied to model the probability of the landslide occurrence using DS1, DS2, and DS3 as predictive variables and TS1 and TS2 for training to obtain the SNL-based, WA-based, and FR-based RF models, respectively. Verified against VS1 and VS2, the three models have similar overall accuracy (OA) and Kappa coefficient (KC), which are 89.61%, 91.47%, and 94.54%, and 0.7926, 0.8299, and 0.8908, respectively. All of them are much better than the three models obtained by SVM algorithm with OA of 81.79%, 82.86%, and 83%, and KC of 0.6337, 0.655, and 0.660. New case verification with the recent 26 landslide events of 2017–2020 revealed that the landslide susceptibility map from WA-based RF modeling was able to properly identify the high and very high susceptibility zones where 23 new landslides had occurred, and performed better than the SNL-based and FR-based RF modeling, though the latter has a slightly higher OA and KC. Hence, we concluded that all three RF models achieve reasonable risk prediction, but WA-based and FR-based RF modeling deserves a recommendation for application elsewhere. The results of this study may serve as reference for the local authorities in prevention and early warning of landslide hazards.

Download Full-text

Landslides susceptibility modelling using Multivariate Logistic Regression Model in the Sahla Watershed in Northern Morocco

Sociedade & Natureza ◽

10.14393/sn-v33-2021-59124 ◽

2021 ◽

Vol 33 ◽

Author(s):

Mohammed El-Fengour ◽

Hanifa El Motaki ◽

Aissa El Bouzidi

Keyword(s):

Logistic Regression ◽

Landslide Susceptibility ◽

Slope Angle ◽

Google Earth ◽

Slope Aspect ◽

Multivariate Logistic Regression Model ◽

Susceptibility Map ◽

Potential Hazard ◽

Topographic Wetness Index ◽

Landslide Occurrence

This study aimed to assess landslide susceptibility in the Sahla watershed in northern Morocco. Landslides hazard is the most frequent phenomenon in this part of the state due to its mountainous precarious environment. The abundance of rainfall makes this area suffer mass movements led to a notable adverse impact on the nearby settlements and infrastructures. There were 93 identified landslide scars. Landslide inventories were collected from Google Earth image interpretations. They were prepared out of landslide events in the past, and future landslide occurrence was predicted by correlating landslide predisposing factors. In this paper, landslide inventories are divided into two groups, one for landslide training and the other for validation. The Landslide Susceptibility Map (LSM) is prepared by Logistic Regression (LR) Statistical Method. Lithology, stream density, land use, slope curvature, elevation, topographic wetness index, slope aspect, and slope angle were used as conditioning factors. The Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) was employed to examine the performance of the model. In the analysis, the LR model results in 96% accuracy in the AUC. The LSM consists of the predicted landslide area. Hence it can be used to reduce the potential hazard linked with the landslides in the Sahla watershed area in Rif Mountains in northern Morocco.

Download Full-text

Speeding Up Logistic Model Tree Induction

Knowledge Discovery in Databases: PKDD 2005 - Lecture Notes in Computer Science ◽

10.1007/11564126_72 ◽

2005 ◽

pp. 675-683 ◽

Cited By ~ 87

Author(s):

Marc Sumner ◽

Eibe Frank ◽

Mark Hall

Keyword(s):

Logistic Model ◽

Model Tree ◽

Logistic Model Tree

Download Full-text

Application of Advanced Machine Learning Algorithms to Assess Groundwater Potential Using Remote Sensing-Derived Data

Remote Sensing ◽

10.3390/rs12172742 ◽

2020 ◽

Vol 12 (17) ◽

pp. 2742

Author(s):

Ehsan Kamali Maskooni ◽

Seyed Amir Naghibi ◽

Hossein Hashemi ◽

Ronny Berndtsson

Keyword(s):

Machine Learning ◽

Remote Sensing ◽

Logistic Model ◽

Nearest Neighbors ◽

Driving Factors ◽

Machine Learning Algorithms ◽

Boosted Regression Trees ◽

K Nearest Neighbors ◽

Model Tree ◽

Logistic Model Tree

Groundwater (GW) is being uncontrollably exploited in various parts of the world resulting from huge needs for water supply as an outcome of population growth and industrialization. Bearing in mind the importance of GW potential assessment in reaching sustainability, this study seeks to use remote sensing (RS)-derived driving factors as an input of the advanced machine learning algorithms (MLAs), comprising deep boosting and logistic model trees to evaluate their efficiency. To do so, their results are compared with three benchmark MLAs such as boosted regression trees, k-nearest neighbors, and random forest. For this purpose, we firstly assembled different topographical, hydrological, RS-based, and lithological driving factors such as altitude, slope degree, aspect, slope length, plan curvature, profile curvature, relative slope position, distance from rivers, river density, topographic wetness index, land use/land cover (LULC), normalized difference vegetation index (NDVI), distance from lineament, lineament density, and lithology. The GW spring indicator was divided into two classes for training (434 springs) and validation (186 springs) with a proportion of 70:30. The training dataset of the springs accompanied by the driving factors were incorporated into the MLAs and the outputs were validated by different indices such as accuracy, kappa, receiver operating characteristics (ROC) curve, specificity, and sensitivity. Based upon the area under the ROC curve, the logistic model tree (87.813%) generated similar performance to deep boosting (87.807%), followed by boosted regression trees (87.397%), random forest (86.466%), and k-nearest neighbors (76.708%) MLAs. The findings confirm the great performance of the logistic model tree and deep boosting algorithms in modelling GW potential. Thus, their application can be suggested for other areas to obtain an insight about GW-related barriers toward sustainability. Further, the outcome based on the logistic model tree algorithm depicts the high impact of the RS-based factor, such as NDVI with 100 relative influence, as well as high influence of the distance from river, altitude, and RSP variables with 46.07, 43.47, and 37.20 relative influence, respectively, on GW potential.

Download Full-text