scholarly journals Spatial prediction models for landslide hazards: review, comparison and evaluation

2005 ◽  
Vol 5 (6) ◽  
pp. 853-862 ◽  
Author(s):  
A. Brenning

Abstract. The predictive power of logistic regression, support vector machines and bootstrap-aggregated classification trees (bagging, double-bagging) is compared using misclassification error rates on independent test data sets. Based on a resampling approach that takes into account spatial autocorrelation, error rates for predicting "present" and "future" landslides are estimated within and outside the training area. In a case study from the Ecuadorian Andes, logistic regression with stepwise backward variable selection yields lowest error rates and demonstrates the best generalization capabilities. The evaluation outside the training area reveals that tree-based methods tend to overfit the data.

Author(s):  
Cemil Kuzey ◽  
Ali Uyar ◽  
Dursun Delen

Purpose The paper aims to identify and critically analyze the factors influencing cost system functionality (CSF) using several machine learning techniques including decision trees, support vector machines and logistic regression. Design/methodology/approach The study used a self-administered survey method to collect the necessary data from companies conducting business in Turkey. Several prediction models are developed and tested; a series of sensitivity analyses is performed on the developed prediction models to assess the ranked importance of factors/variables. Findings Certain factors/variables influence CSF much more than others. The findings of the study suggest that utilization of management accounting practices require a functional cost system, which is supported by a comprehensive cost data management process (i.e. acquisition, storage and utilization). Research limitations/implications The underlying data were collected using a questionnaire survey; thus, it is subjective which reflects the perceptions of the respondents. Ideally, it is expected to reflect the objective of the practices of the firms. Second, the authors have measured CSF it on a “Yes” or “No” basis which does not allow survey respondents reply in between them; thus, it might have limited the choices of the respondents. Third, the Likert scales adopted in the measurement of the other constructs might be limiting the answers of the respondents. Practical implications Information technology plays a very important role for the success of CSF practices. That is, successful implementation of a functional cost system relies heavily on a fully integrated information infrastructure capable of constantly feeding CSF with accurate, relevant and timely data. Originality/value In addition to providing evidence regarding the factors underlying CSF based on a broad range of industries interesting finding, this study also illustrates the viability of machine learning methods as a research framework to critically analyze domain specific data.


Author(s):  
Rebecca P. Ang ◽  
Dion H. Goh

In this study, the authors compared logistic regression and predictive data mining techniques such as decision trees (DTs), artificial neural networks (ANNs), and support vector machines (SVMs), and examined these methods on whether they could discriminate between adolescents who were charged or not charged for initial juvenile offending in a large Asian sample. Results were validated and tested in independent samples with logistic regression and DT, ANN, and SVM classifiers achieving accuracy rates of 95% and above. Findings from receiver operating characteristic analyses also supported these results. In addition, the authors examined distinct patterns of occurrences within and across classifiers. Proactive aggression and teacher-rated conflict consistently emerged as risk factors across validation and testing data sets of DT and ANN classifiers, and logistic regression. Reactive aggression, narcissistic exploitativeness, being male, and coming from a nonintact family were risk factors that emerged in one or more of these data sets across classifiers, while anxiety and poor peer relationships failed to emerge as predictors.


2012 ◽  
Vol 24 (4) ◽  
pp. 1047-1084 ◽  
Author(s):  
Xiao-Tong Yuan ◽  
Shuicheng Yan

We investigate Newton-type optimization methods for solving piecewise linear systems (PLSs) with nondegenerate coefficient matrix. Such systems arise, for example, from the numerical solution of linear complementarity problem, which is useful to model several learning and optimization problems. In this letter, we propose an effective damped Newton method, PLS-DN, to find the exact (up to machine precision) solution of nondegenerate PLSs. PLS-DN exhibits provable semiiterative property, that is, the algorithm converges globally to the exact solution in a finite number of iterations. The rate of convergence is shown to be at least linear before termination. We emphasize the applications of our method in modeling, from a novel perspective of PLSs, some statistical learning problems such as box-constrained least squares, elitist Lasso (Kowalski & Torreesani, 2008 ), and support vector machines (Cortes & Vapnik, 1995 ). Numerical results on synthetic and benchmark data sets are presented to demonstrate the effectiveness and efficiency of PLS-DN on these problems.


Complexity ◽  
2018 ◽  
Vol 2018 ◽  
pp. 1-10 ◽  
Author(s):  
Hai Wei ◽  
Mingming Wang ◽  
Bingyue Song ◽  
Xin Wang ◽  
Danlei Chen

An effective approach is introduced to predict the magnitude of reservoir-triggered earthquake (RTE), based on support vector machines (SVM) and fuzzy support vector machines (FSVM) methods. The main influence factors on RTE, including lithology, rock mass integrity, fault features, tectonic stress state, and seismic activity background in reservoir area, are categorized into 11 parameters and quantified by using analytical hierarchy process (AHP). Dataset on 100 reservoirs in China, including the 48 well-documented cases of RTE, are collected and used to train and validate the prediction models established with SVM and FSVM, respectively. Through numerical tests, it is found that both the SVM and FSVM models are effective in the prediction of the magnitude of RTE with high accuracy, provided that sufficient samples are collected. While the results of FSVM which is extended from SVM by introducing a fuzzy membership to reduce the influence of noises or outliers are found to be slightly less accurate than those of SVM in the current analysis of RTE cases. The reason might be attributed to the high discreteness of the sample data in the current study.


Author(s):  
Michaela Staňková ◽  
David Hampel

This article focuses on the problem of binary classification of 902 small- and medium‑sized engineering companies active in the EU, together with additional 51 companies which went bankrupt in 2014. For classification purposes, the basic statistical method of logistic regression has been selected, together with a representative of machine learning (support vector machines and classification trees method) to construct models for bankruptcy prediction. Different settings have been tested for each method. Furthermore, the models were estimated based on complete data and also using identified artificial factors. To evaluate the quality of prediction we observe not only the total accuracy with the type I and II errors but also the area under ROC curve criterion. The results clearly show that increasing distance to bankruptcy decreases the predictive ability of all models. The classification tree method leads us to rather simple models. The best classification results were achieved through logistic regression based on artificial factors. Moreover, this procedure provides good and stable results regardless of other settings. Artificial factors also seem to be a suitable variable for support vector machines models, but classification trees achieved better results using original data.


2019 ◽  
Vol 07 (02) ◽  
pp. 1950001
Author(s):  
THABANG MOKOALELI-MOKOTELI ◽  
SHAUN RAMSUMAR ◽  
HIMA VADAPALLI

The success of investors in obtaining huge financial rewards from the stock market depends on their ability to predict the direction of the stock market index. The purpose of this study is to evaluate the efficacy of several ensemble prediction models (Boosted, RUS-Boosted, Subspace Disc, Bagged, and Subspace KNN) in predicting the daily direction of the Johannesburg Stock Exchange (JSE) All-Share index compared to other commonly used machine learning techniques including support vector machines (SVM), logistic regression and [Formula: see text]-nearest neighbor (KNN). The findings in this study show that, among all ensemble models, Boosted algorithm is the best performer followed by RUS-Boosted. When compared to the other techniques, ensemble technique (represented by Boosted) outperformed these techniques, followed by KNN, logistic regression and SVM, respectively. These findings suggest that investors should include ensemble models among the index prediction models if they want to make huge profits in the stock markets. However, not all investors can benefit from this as models may suffer from alpha decay as more and more investors use them, implying that the successful algorithms have limited shelf life.


Sign in / Sign up

Export Citation Format

Share Document