Obtaining predicted values of the demographic process using machine learning methods
Research and analysis of demographic processes play an important role in many areas. For this, the population size and key factors from 1994 to 2019 were selected on the statistical website of the Republic of Kazakhstan. Demographics were population size, fertility, mortality, divorce, and migration. The factors of the standard of living were the number of unemployed and the average monthly salary, while the medical factors were the hospital organizations, the number of hospital beds and the number of doctors of all specialties. In the course of regression analysis, a correlation was obtained and multicollinear factors were identified. We used four different machine learning models from the Scikit-Learn library to generate population estimates. Regression models were evaluated using the quality score. As a result, linear regression and random forest models performed well.