The effect of feature selection on financial distress prediction

2015 ◽  
Vol 73 ◽  
pp. 289-297 ◽  
Author(s):  
Deron Liang ◽  
Chih-Fong Tsai ◽  
Hsin-Ting Wu
Mathematics ◽  
2020 ◽  
Vol 8 (8) ◽  
pp. 1275
Author(s):  
Dawen Yan ◽  
Guotai Chi ◽  
Kin Keung Lai

In this paper, we propose a new framework of a financial early warning system through combining the unconstrained distributed lag model (DLM) and widely used financial distress prediction models such as the logistic model and the support vector machine (SVM) for the purpose of improving the performance of an early warning system for listed companies in China. We introduce simultaneously the 3~5-period-lagged financial ratios and macroeconomic factors in the consecutive time windows t − 3, t − 4 and t − 5 to the prediction models; thus, the influence of the early continued changes within and outside the company on its financial condition is detected. Further, by introducing lasso penalty into the logistic-distributed lag and SVM-distributed lag frameworks, we implement feature selection and exclude the potentially redundant factors, considering that an original long list of accounting ratios is used in the financial distress prediction context. We conduct a series of comparison analyses to test the predicting performance of the models proposed by this study. The results show that our models outperform logistic, SVM, decision tree and neural network (NN) models in a single time window, which implies that the models incorporating indicator data in multiple time windows convey more information in terms of financial distress prediction when compared with the existing singe time window models.


2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Sen Zeng ◽  
Yaqin Li ◽  
Wanjun Yang ◽  
Yanru Li

Classification learning is a very important issue in machine learning, which has been widely used in the field of financial distress warning. Some researches show that the prediction model framework based on sparse algorithm has better performance than the traditional model. In this paper, we explore the financial distress prediction based on grouping sparsity. Feature selection of sparse algorithm plays an important role in classification learning, because many redundant and irrelevant features will degrade performance. A good feature selection algorithm would reduce computational complexity and improve classification accuracy. In this study, we propose an algorithm for feature selection classification prediction based on feature attributes and data source grouping. The existing financial distress prediction model usually only uses the data from financial statement and ignores the timeliness of company sample in practice. Therefore, we propose a corporate financial distress prediction model that is better in line with the practice and combines the grouping sparse principal component analysis of financial data, corporate governance characteristics, and market transaction data with support vector machine. Experimental results show that this method can improve the prediction efficiency of financial distress with fewer characteristic variables.


2014 ◽  
Vol 41 (5) ◽  
pp. 2472-2483 ◽  
Author(s):  
Fengyi Lin ◽  
Deron Liang ◽  
Ching-Chiang Yeh ◽  
Jui-Chieh Huang

2021 ◽  
Vol 14 (7) ◽  
pp. 333
Author(s):  
Shilpa H. Shetty ◽  
Theresa Nithila Vincent

The study aimed to investigate the role of non-financial measures in predicting corporate financial distress in the Indian industrial sector. The proportion of independent directors on the board and the proportion of the promoters’ share in the ownership structure of the business were the non-financial measures that were analysed, along with ten financial measures. For this, sample data consisted of 82 companies that had filed for bankruptcy under the Insolvency and Bankruptcy Code (IBC). An equal number of matching financially sound companies also constituted the sample. Therefore, the total sample size was 164 companies. Data for five years immediately preceding the bankruptcy filing was collected for the sample companies. The data of 120 companies evenly drawn from the two groups of companies were used for developing the model and the remaining data were used for validating the developed model. Two binary logistic regression models were developed, M1 and M2, where M1 was formulated with both financial and non-financial variables, and M2 only had financial variables as predictors. The diagnostic ability of the model was tested with the aid of the receiver operating curve (ROC), area under the curve (AUC), sensitivity, specificity and annual accuracy. The results of the study show that inclusion of the two non-financial variables improved the efficacy of the financial distress prediction model. This study made a unique attempt to provide empirical evidence on the role played by non-financial variables in improving the efficiency of corporate distress prediction models.


Symmetry ◽  
2021 ◽  
Vol 13 (3) ◽  
pp. 443
Author(s):  
Chyan-long Jan

Because of the financial information asymmetry, the stakeholders usually do not know a company’s real financial condition until financial distress occurs. Financial distress not only influences a company’s operational sustainability and damages the rights and interests of its stakeholders, it may also harm the national economy and society; hence, it is very important to build high-accuracy financial distress prediction models. The purpose of this study is to build high-accuracy and effective financial distress prediction models by two representative deep learning algorithms: Deep neural networks (DNN) and convolutional neural networks (CNN). In addition, important variables are selected by the chi-squared automatic interaction detector (CHAID). In this study, the data of Taiwan’s listed and OTC sample companies are taken from the Taiwan Economic Journal (TEJ) database during the period from 2000 to 2019, including 86 companies in financial distress and 258 not in financial distress, for a total of 344 companies. According to the empirical results, with the important variables selected by CHAID and modeling by CNN, the CHAID-CNN model has the highest financial distress prediction accuracy rate of 94.23%, and the lowest type I error rate and type II error rate, which are 0.96% and 4.81%, respectively.


Sign in / Sign up

Export Citation Format

Share Document