scholarly journals Development of Predictive Models of Socio-Economic Systems Based on Decision Trees with Multivariate Response

Author(s):  
A.N. Kislyakov ◽  
N.M. Filimonova ◽  
N.Yu. Omarova
2016 ◽  
Vol 8 (4) ◽  
pp. 73-77
Author(s):  
Стародубцев ◽  
Viktor Starodubtsev ◽  
Асташов ◽  
Ya. Astashov

The article is the use of self-organizing methods to identify processes in the socio-economic systems of transport companies. Through the prism of a systematic approach is considered a sequence make effective management decisions based on predictive models of the processes of socio-economic system of motor transport enterprise for sustainable development of production.


2019 ◽  
Vol 27 (5) ◽  
pp. 1053-1063 ◽  
Author(s):  
David M. Barnard ◽  
Matthew J. Germino ◽  
David S. Pilliod ◽  
Robert S. Arkle ◽  
Cara Applestein ◽  
...  

2019 ◽  
Vol 27 (3) ◽  
pp. 381-387 ◽  
Author(s):  
Aaron Russell Kaufman ◽  
Peter Kraft ◽  
Maya Sen

Though used frequently in machine learning, boosted decision trees are largely unused in political science, despite many useful properties. We explain how to use one variant of boosted decision trees, AdaBoosted decision trees (ADTs), for social science predictions. We illustrate their use by examining a well-known political prediction problem, predicting U.S. Supreme Court rulings. We find that our ADT approach outperforms existing predictive models. We also provide two additional examples of the approach, one predicting the onset of civil wars and the other predicting county-level vote shares in U.S. presidential elections.


2018 ◽  
Vol 6 (1) ◽  
pp. 377-407 ◽  
Author(s):  
Zhiyu Quan ◽  
Emiliano A. Valdez

AbstractBecause of its many advantages, the use of decision trees has become an increasingly popular alternative predictive tool for building classification and regression models. Its origins date back for about five decades where the algorithm can be broadly described by repeatedly partitioning the regions of the explanatory variables and thereby creating a tree-based model for predicting the response. Innovations to the original methods, such as random forests and gradient boosting, have further improved the capabilities of using decision trees as a predictive model. In addition, the extension of using decision trees with multivariate response variables started to develop and it is the purpose of this paper to apply multivariate tree models to insurance claims data with correlated responses. This extension to multivariate response variables inherits several advantages of the univariate decision tree models such as distribution-free feature, ability to rank essential explanatory variables, and high predictive accuracy, to name a few. To illustrate the approach, we analyze a dataset drawn from the Wisconsin Local Government Property Insurance Fund (LGPIF)which offers multi-line insurance coverage of property, motor vehicle, and contractors’ equipments.With multivariate tree models, we are able to capture the inherent relationship among the response variables and we find that the marginal predictive model based on multivariate trees is an improvement in prediction accuracy from that based on simply the univariate trees.


2021 ◽  
Author(s):  
Aleksandr Medvedev ◽  
Satyarth Mishra Sharma ◽  
Evgenii Tsatsorin ◽  
Elena Nabieva ◽  
Dmitry Yarotsky

Genotype-to-phenotype prediction is a central problem of human genetics. In recent years, it has become possible to construct complex predictive models for phenotypes, thanks to the availability of large genome data sets as well as efficient and scalable machine learning tools. In this paper, we make a three-fold contribution to this problem. First, we ask if state-of-the-art nonlinear predictive models, such as boosted decision trees, can be more efficient for phenotype prediction than conventional linear models. We find that this is indeed the case if model features include a sufficiently rich set of covariates, but probably not otherwise. Second, we ask if the conventional selection of single nucleotide polymorphisms (SNPs) by genome wide association studies (GWAS) can be replaced by a more efficient procedure, taking into account information in previously selected SNPs. We propose such a procedure, based on a sequential feature importance estimation with decision trees, and show that this approach indeed produced informative SNP sets that are much more compact than when selected with GWAS. Finally, we show that the highest prediction accuracy can ultimately be achieved by ensembling individual linear and nonlinear models. To the best of our knowledge, for some of the phenotypes that we consider (asthma, hypothyroidism), our results are a new state-of-the-art.


2019 ◽  
pp. 123-144
Author(s):  
Richard V. McCarthy ◽  
Mary M. McCarthy ◽  
Wendy Ceccucci ◽  
Leila Halawi

Sign in / Sign up

Export Citation Format

Share Document