Post-stratification as a tool for enhancing the predictive power of classification methods

2021 ◽  
pp. 125-130
Author(s):  
Francesco D. d'Ovidio ◽  
Angela Maria D'Uggento ◽  
Rossana Mancarella ◽  
Ernesto Toma

It is well known that, in classification problems, the predictive capacity of any decision-making model decreases rapidly with increasing asymmetry of the target variable (Sonquist et al., 1973; Fielding 1977). In particular, in segmentation analysis with a categorical target variable, very poor improvements of purity are obtained when the least represented modality counts less than 1/4 of the cases of the most represented modality. The same problem arises with other (theoretically more exhaustive) techniques such as Artificial Neural Networks. Actually, the optimal situation for classification analyses is the maximum uncertainty, that is, equidistribution of the target variable. Some classification techniques are more robust, by using, for example, the less sensitive logit transformation of the target variable (Fabbris & Martini 2002); however, also the logit transformation is strongly affected by the distributive asymmetry of the target variable. In this paper, starting from the results of a direct survey in which the target (binary) variable was extremely asymmetrical (10% vs. 90%, or greater asymmetry), we noted that also the logit model with the most significant parameters had very reduced fitting measures and almost zero predictive power. To solve this predictive issue, we tested post-stratification techniques, artificially symmetrizing a training sample. In this way, a substantially increase of fitting and predictive capacity was achieved, both in the symmetrized sample and, above all, in the original sample. In conclusion of the paper, an application of the same technique to a dataset of very different nature and size is described, demonstrating that the method is stable even in the case of analysis executed with all data of a population.

2021 ◽  
Vol 3 (1) ◽  
Author(s):  
Meisam Ghasedi ◽  
Maryam Sarfjoo ◽  
Iraj Bargegol

AbstractThe purpose of this study is to investigate and determine the factors affecting vehicle and pedestrian accidents taking place in the busiest suburban highway of Guilan Province located in the north of Iran and provide the most accurate prediction model. Therefore, the effective principal variables and the probability of occurrence of each category of crashes are analyzed and computed utilizing the factor analysis, logit, and Machine Learning approaches simultaneously. This method not only could contribute to achieving the most comprehensive and efficient model to specify the major contributing factor, but also it can provide officials with suggestions to take effective measures with higher precision to lessen accident impacts and improve road safety. Both the factor analysis and logit model show the significant roles of exceeding lawful speed, rainy weather and driver age (30–50) variables in the severity of vehicle accidents. On the other hand, the rainy weather and lighting condition variables as the most contributing factors in pedestrian accidents severity, underline the dominant role of environmental factors in the severity of all vehicle-pedestrian accidents. Moreover, considering both utilized methods, the machine-learning model has higher predictive power in all cases, especially in pedestrian accidents, with 41.6% increase in the predictive power of fatal accidents and 12.4% in whole accidents. Thus, the Artificial Neural Network model is chosen as the superior approach in predicting the number and severity of crashes. Besides, the good performance and validation of the machine learning is proved through performance and sensitivity analysis.


2006 ◽  
Vol 60 (6) ◽  
Author(s):  
C. Anghel ◽  
A. Ozunu

AbstractA novel technique based on artificial intelligence methods able to predict pollutant emission concentrations from industrial stacks is presented. This procedure combines regression and classification problems into a unified technique, named minimax decision procedure. The core of this procedure is based on the minimax probability machine regression model. Using experimental databases, the trend of pollutant emissions and the level of pollution for one industrial thermal power station stack were presented. Based on this unified technique, numerical experiments provided the estimates of concentrations of CO, NOx, NO, and SO2 confirming the predictive power of this procedure.


2018 ◽  
Vol 71 (5) ◽  
pp. 2353-2358
Author(s):  
Daniel Bruno Resende Chaves ◽  
Lívia Maia Pascoal ◽  
Beatriz Amorim Beltrão ◽  
Marília Mendes Nunes ◽  
Tânia Alteniza Leandro ◽  
...  

ABSTRACT Objective: to identify the defining characteristics of Ineffective airway clearance with better predictive power using classification trees. Method: the predictive power of the defining characteristics of Ineffective airway clearance was evaluated based on classification trees generated from the data of 249 children with acute respiratory infection. Results: Ineffective cough and adventitious breath sounds were identified as the main defining characteristics when screening for Ineffective airway clearance in accordance with trees based on three different computational algorithms. Conclusion: Ineffective coughing and adventitious breath sounds had better predictive capacity for Ineffective airway clearance in the sample.


2014 ◽  
Vol 20 (1) ◽  
pp. 65-82 ◽  
Author(s):  
María-Jesús Presentación ◽  
Rebeca Siegenthaler ◽  
Vicente Pinto ◽  
Jessica Mercader ◽  
Ana Miranda

<p>This study compares the relationship between executive functioning, analyzed with clinical and ecological tests, and math skills in preschoolers. The children (255 children 5 to 6 years old) were evaluated using neuropsychological tests of inhibition, and working memory and the TEDI-MATH to estimate basic mathematical skills. The ecological evaluation of the executive functioning by the parents and teachers was carried out with the Behavioral Rating Inventory of Executive Function (BRIEF). Compared to the ecological ratings, the neuropsychological measures show more correlations with math skills and a greater predictive capacity. The teachers’ BRIEF results were superior to those of the parents. In all cases, working memory is the process that shows the greatest predictive power.</p>


2017 ◽  
Vol 18 (5) ◽  
pp. 523-540 ◽  
Author(s):  
Andreas Behr ◽  
Jurij Weinblat

Purpose The purpose of this paper is to do a performance comparison of three different data mining techniques. Design/methodology/approach Logit model, decision tree and random forest are applied in this study on British, French, German, Italian, Portuguese and Spanish balance sheet data from 2006 to 2012, which covers 446,464 firms. Because of the strong imbalance with regard to the solvency status, classification trees and random forests are modified to adapt to this imbalance. All three model specifications are optimized extensively using resampling techniques, relying on the training sample only. Model performance is assessed, strictly, based on out-of-sample predictions. Findings Random forest is found to strongly outperform the classification tree and the logit model in almost all considered years and countries, according to the quality measure in this study. Originality/value Obtaining reliable estimates of default propensity scores is of immense importance for potential credit grantors, portfolio managers and regulatory authorities. As the overwhelming majority of firms are not listed on stock exchanges, annual balance sheets still provide the most important source of information. The obtained ranking of the three models according to their predictive performance is relatively robust, due to the consideration of several countries and a relatively long time period.


Author(s):  
Ali Shamshiripour ◽  
Nima Golshani ◽  
Ramin Shabanpour ◽  
Abolfazl (Kouros) Mohammadian

Modeling travelers’ mode choice behavior is an important component of travel demand studies. In an effort to account for day-to-day dynamics of travelers’ mode choice behavior, the current study develops a dynamic random effects logit model to endogenously incorporate the mode chosen for a day into the utility function of the mode chosen for the following day. A static multinomial logit model is also estimated to examine the performance of the dynamic model. Per the results, the dynamic random effects model outperforms the static model in relation to predictive power. According to the accuracy indices, the dynamic random effects model offers the predictive power of 60.0% for members of car-deficient households, whereas the static model is limited to 43.1%. Also, comparison of F1-scores indicates that the predictive power of the dynamic random effects model with respect to active travels is 47.1% whereas that of the static model is as low as 15.0%. The results indicate a significant day-to-day dynamic behavior of transit users and active travelers. This pattern is found to be true in general, but not for members of car-deficient households, who are found more likely to choose the same mode for two successive days.


2021 ◽  
Vol 33 (1) ◽  
pp. 174-193
Author(s):  
Yu-Wei Kao ◽  
Hung-Hsuan Chen

Backpropagation (BP) is the cornerstone of today's deep learning algorithms, but it is inefficient partially because of backward locking, which means updating the weights of one layer locks the weight updates in the other layers. Consequently, it is challenging to apply parallel computing or a pipeline structure to update the weights in different layers simultaneously. In this letter, we introduce a novel learning structure, associated learning (AL), that modularizes the network into smaller components, each of which has a local objective. Because the objectives are mutually independent, AL can learn the parameters in different layers independently and simultaneously, so it is feasible to apply a pipeline structure to improve the training throughput. Specifically, this pipeline structure improves the complexity of the training time from [Formula: see text], which is the time complexity when using BP and stochastic gradient descent (SGD) for training, to [Formula: see text], where [Formula: see text] is the number of training instances and [Formula: see text] is the number of hidden layers. Surprisingly, even though most of the parameters in AL do not directly interact with the target variable, training deep models by this method yields accuracies comparable to those from models trained using typical BP methods, in which all parameters are used to predict the target variable. Consequently, because of the scalability and the predictive power demonstrated in the experiments, AL deserves further study to determine the better hyperparameter settings, such as activation function selection, learning rate scheduling, and weight initialization, to accumulate experience, as we have done over the years with the typical BP method. In addition, perhaps our design can also inspire new network designs for deep learning. Our implementation is available at https://github.com/SamYWK/Associated_Learning .


Sign in / Sign up

Export Citation Format

Share Document