Using Statistical Measures and Machine Learning for Graph Reduction to Solve Maximum Weight Clique Problems

The efficiency of deep learning and tree-based machine learning approaches has gained immense popularity in various fields. One deep learning model viz. convolution neural network (CNN), artificial neural network (ANN) and four tree-based machine learning models, namely, alternative decision tree (ADTree), classification and regression tree (CART), functional tree and logistic model tree (LMT), were used for landslide susceptibility mapping in the East Sikkim Himalaya region of India, and the results were compared. Landslide areas were delimited and mapped as landslide inventory (LIM) after gathering information from historical records and periodic field investigations. In LIM, 91 landslides were plotted and classified into training (64 landslides) and testing (27 landslides) subsets randomly to train and validate the models. A total of 21 landslide conditioning factors (LCFs) were considered as model inputs, and the results of each model were categorised under five susceptibility classes. The receiver operating characteristics curve and 21 statistical measures were used to evaluate and prioritise the models. The CNN deep learning model achieved the priority rank 1 with area under the curve of 0.918 and 0.933 by using the training and testing data, quantifying 23.02% and 14.40% area as very high and highly susceptible followed by ANN, ADtree, CART, FTree and LMT models. This research might be useful in landslide studies, especially in locations with comparable geophysical and climatological characteristics, to aid in decision making for land use planning.

Download Full-text

A New DNA-Based Approach to Solve the Maximum Weight Clique Problem

Computational Intelligence and Bioinformatics - Lecture Notes in Computer Science ◽

10.1007/11816102_35 ◽

2006 ◽

pp. 320-327 ◽

Cited By ~ 3

Author(s):

Aili Han ◽

Daming Zhu

Keyword(s):

Maximum Weight ◽

Maximum Weight Clique ◽

Clique Problem

Download Full-text

Machine Learning-Based Gully Erosion Susceptibility Mapping: A Case Study of Eastern India

Sensors ◽

10.3390/s20051313 ◽

2020 ◽

Vol 20 (5) ◽

pp. 1313 ◽

Cited By ~ 15

Author(s):

Sunil Saha ◽

Jagabandhu Roy ◽

Alireza Arabameri ◽

Thomas Blaschke ◽

Dieu Tien Bui

Keyword(s):

Machine Learning ◽

Mean Squared Error ◽

Absolute Error ◽

Gully Erosion ◽

Machine Learning Techniques ◽

Weight Of Evidence ◽

Validation Dataset ◽

Boosted Regression Tree ◽

Area Index ◽

Statistical Measures

Gully erosion is a form of natural disaster and one of the land loss mechanisms causing severe problems worldwide. This study aims to delineate the areas with the most severe gully erosion susceptibility (GES) using the machine learning techniques Random Forest (RF), Gradient Boosted Regression Tree (GBRT), Naïve Bayes Tree (NBT), and Tree Ensemble (TE). The gully inventory map (GIM) consists of 120 gullies. Of the 120 gullies, 84 gullies (70%) were used for training and 36 gullies (30%) were used to validate the models. Fourteen gully conditioning factors (GCFs) were used for GES modeling and the relationships between the GCFs and gully erosion was assessed using the weight-of-evidence (WofE) model. The GES maps were prepared using RF, GBRT, NBT, and TE and were validated using area under the receiver operating characteristic (AUROC) curve, the seed cell area index (SCAI) and five statistical measures including precision (PPV), false discovery rate (FDR), accuracy, mean absolute error (MAE), and root mean squared error (RMSE). Nearly 7% of the basin has high to very high susceptibility for gully erosion. Validation results proved the excellent ability of these models to predict the GES. Of the analyzed models, the RF (AUROC = 0.96, PPV = 1.00, FDR = 0.00, accuracy = 0.87, MAE = 0.11, RMSE = 0.19 for validation dataset) is accurate enough for modeling and better suited for GES modeling than the other models. Therefore, the RF model can be used to model the GES areas not only in this river basin but also in other areas with the same geo-environmental conditions.

Download Full-text

Landslide Susceptibility Mapping Using the Stacking Ensemble Machine Learning Method in Lushui, Southwest China

Applied Sciences ◽

10.3390/app10114016 ◽

2020 ◽

Vol 10 (11) ◽

pp. 4016 ◽

Cited By ~ 3

Author(s):

Xudong Hu ◽

Han Zhang ◽

Hongbo Mei ◽

Dunhui Xiao ◽

Yuanyuan Li ◽

...

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Southwest China ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Machine Learning Method ◽

Learning Method ◽

Statistical Measures ◽

Ensemble Machine Learning

Landslide susceptibility mapping is considered to be a prerequisite for landslide prevention and mitigation. However, delineating the spatial occurrence pattern of the landslide remains a challenge. This study investigates the potential application of the stacking ensemble learning technique for landslide susceptibility assessment. In particular, support vector machine (SVM), artificial neural network (ANN), logical regression (LR), and naive Bayes (NB) were selected as base learners for the stacking ensemble method. The resampling scheme and Pearson’s correlation analysis were jointly used to evaluate the importance level of these base learners. A total of 388 landslides and 12 conditioning factors in the Lushui area (Southwest China) were used as the dataset to develop landslide modeling. The landslides were randomly separated into two parts, with 70% used for model training and 30% used for model validation. The models’ performance was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC) and statistical measures. The results showed that the stacking-based ensemble model achieved an improved predictive accuracy as compared to the single algorithms, while the SVM-ANN-NB-LR (SANL) model, the SVM-ANN-NB (SAN) model, and the ANN-NB-LR (ANL) models performed equally well, with AUC values of 0.931, 0.940, and 0.932, respectively, for validation stage. The correlation coefficient between the LR and SVM was the highest for all resampling rounds, with a value of 0.72 on average. This connotes that LR and SVM played an almost equal role when the ensemble of SANL was applied for landslide susceptibility analysis. Therefore, it is feasible to use the SAN model or the ANL model for the study area. The finding from this study suggests that the stacking ensemble machine learning method is promising for landslide susceptibility mapping in the Lushui area and is capable of targeting areas prone to landslides.

Download Full-text

A Machine Learning-Assisted Numerical Predictor for Compressive Strength of Geopolymer Concrete Based on Experimental Data and Sensitivity Analysis

Applied Sciences ◽

10.3390/app10217726 ◽

2020 ◽

Vol 10 (21) ◽

pp. 7726

Author(s):

An Thao Huynh ◽

Quang Dang Nguyen ◽

Qui Lieu Xuan ◽

Bryan Magee ◽

TaeChoong Chung ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Sensitivity Analysis ◽

Compressive Strength ◽

Fly Ash ◽

Sodium Hydroxide ◽

Engineering Properties ◽

Percentage Error ◽

Geopolymer Concrete ◽

Statistical Measures

Geopolymer concrete offers a favourable alternative to conventional Portland concrete due to its reduced embodied carbon dioxide (CO2) content. Engineering properties of geopolymer concrete, such as compressive strength, are commonly characterised based on experimental practices requiring large volumes of raw materials, time for sample preparation, and costly equipment. To help address this inefficiency, this study proposes machine learning-assisted numerical methods to predict compressive strength of fly ash-based geopolymer (FAGP) concrete. Methods assessed included artificial neural network (ANN), deep neural network (DNN), and deep residual network (ResNet), based on experimentally collected data. Performance of the proposed approaches were evaluated using various statistical measures including R-squared (R2), root mean square error (RMSE), and mean absolute percentage error (MAPE). Sensitivity analysis was carried out to identify effects of the following six input variables on the compressive strength of FAGP concrete: sodium hydroxide/sodium silicate ratio, fly ash/aggregate ratio, alkali activator/fly ash ratio, concentration of sodium hydroxide, curing time, and temperature. Fly ash/aggregate ratio was found to significantly affect compressive strength of FAGP concrete. Results obtained indicate that the proposed approaches offer reliable methods for FAGP design and optimisation. Of note was ResNet, which demonstrated the highest R2 and lowest RMSE and MAPE values.

Download Full-text

Spatial Modeling of Snow Avalanche Using Machine Learning Models and Geo-Environmental Factors: Comparison of Effectiveness in Two Mountain Regions

Remote Sensing ◽

10.3390/rs11242995 ◽

2019 ◽

Vol 11 (24) ◽

pp. 2995 ◽

Cited By ~ 10

Author(s):

Omid Rahmati ◽

Omid Ghorbanzadeh ◽

Teimur Teimurian ◽

Farnoush Mohammadi ◽

John P. Tiefenbacher ◽

...

Keyword(s):

Machine Learning ◽

Goodness Of Fit ◽

Snow Avalanche ◽

Slope Position ◽

Support Vector ◽

Hazard Mapping ◽

Ensemble Model ◽

Mountainous Regions ◽

Avalanche Hazard ◽

Statistical Measures

Although snow avalanches are among the most destructive natural disasters, and result in losses of life and economic damages in mountainous regions, far too little attention has been paid to the prediction of the snow avalanche hazard using advanced machine learning (ML) models. In this study, the applicability and efficiency of four ML models: support vector machine (SVM), random forest (RF), naïve Bayes (NB) and generalized additive model (GAM), for snow avalanche hazard mapping, were evaluated. Fourteen geomorphometric, topographic and hydrologic factors were selected as predictor variables in the modeling. This study was conducted in the Darvan and Zarrinehroud watersheds of Iran. The goodness-of-fit and predictive performance of the models was evaluated using two statistical measures: the area under the receiver operating characteristic curve (AUROC) and the true skill statistic (TSS). Finally, an ensemble model was developed based upon the results of the individual models. Results show that, among individual models, RF was best, performing well in both the Darvan (AUROC = 0.964, TSS = 0.862) and Zarrinehroud (AUROC = 0.956, TSS = 0.881) watersheds. The accuracy of the ensemble model was slightly better than all individual models for generating the snow avalanche hazard map, as validation analyses showed an AUROC = 0.966 and a TSS = 0.865 in the Darvan watershed, and an AUROC value of 0.958 and a TSS value of 0.877 for the Zarrinehroud watershed. The results indicate that slope length, lithology and relative slope position (RSP) are the most important factors controlling snow avalanche distribution. The methodology developed in this study can improve risk-based decision making, increases the credibility and reliability of snow avalanche hazard predictions and can provide critical information for hazard managers.

Download Full-text

Grenade Explosion Method for Maximum Weight Clique Problem

Communications in Computer and Information Science - Contemporary Computing ◽

10.1007/978-3-642-32129-0_8 ◽

2012 ◽

pp. 20-27

Author(s):

Manohar Pallantla ◽

Alok Singh

Keyword(s):

Maximum Weight ◽

Explosion Method ◽

Maximum Weight Clique ◽

Clique Problem

Download Full-text

Novel GIS Based Machine Learning Algorithms for Shallow Landslide Susceptibility Mapping

Sensors ◽

10.3390/s18113777 ◽

2018 ◽

Vol 18 (11) ◽

pp. 3777 ◽

Cited By ~ 61

Author(s):

Ataollah Shirzadi ◽

Karim Soliamani ◽

Mahmood Habibnejhad ◽

Ataollah Kavian ◽

Kamran Chapi ◽

...

Keyword(s):

Machine Learning ◽

Sample Size ◽

Prediction Accuracy ◽

Goodness Of Fit ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Landslide Susceptibility Mapping ◽

Sample Sizes ◽

Promising Alternative ◽

Statistical Measures

The main objective of this research was to introduce a novel machine learning algorithm of alternating decision tree (ADTree) based on the multiboost (MB), bagging (BA), rotation forest (RF) and random subspace (RS) ensemble algorithms under two scenarios of different sample sizes and raster resolutions for spatial prediction of shallow landslides around Bijar City, Kurdistan Province, Iran. The evaluation of modeling process was checked by some statistical measures and area under the receiver operating characteristic curve (AUROC). Results show that, for combination of sample sizes of 60%/40% and 70%/30% with a raster resolution of 10 m, the RS model, while, for 80%/20% and 90%/10% with a raster resolution of 20 m, the MB model obtained a high goodness-of-fit and prediction accuracy. The RS-ADTree and MB-ADTree ensemble models outperformed the ADTree model in two scenarios. Overall, MB-ADTree in sample size of 80%/20% with a resolution of 20 m (area under the curve (AUC) = 0.942) and sample size of 60%/40% with a resolution of 10 m (AUC = 0.845) had the highest and lowest prediction accuracy, respectively. The findings confirm that the newly proposed models are very promising alternative tools to assist planners and decision makers in the task of managing landslide prone areas.

Download Full-text