Research on enterprise financial economics early warning based on machine learning method

Author(s):  
Jian Yi

The stability of the economic market is an important factor for the rapid development of the economy, especially for the listed companies, whose financial and economic stability affects the stability of the financial market. It is helpful for the healthy development of enterprises and financial markets to make an accurate early warning of the financial economy of listed enterprises. This paper briefly introduced the support vector machine (SVM) and back-propagation neural network (BPNN) algorithms in the machine learning method. To make up for the defects of the two algorithms, they were combined and applied to the enterprise financial economics early warning. A simulation experiment was carried out on the single SVM algorithm-based, single BPNN algorithm-based, and SVM algorithm and BPNN algorithm combined model with the MATLAB software. The results show that the SVM algorithm and BP algorithm combined model converges faster and has higher precision and recall rate and larger area under the curve (AUC) than the single SVM algorithm-based model and the single BPNN algorithm-based model.

2021 ◽  
Vol 13 (1) ◽  
pp. 133
Author(s):  
Hao Sun ◽  
Yajing Cui

Downscaling microwave remotely sensed soil moisture (SM) is an effective way to obtain spatial continuous SM with fine resolution for hydrological and agricultural applications on a regional scale. Downscaling factors and functions are two basic components of SM downscaling where the former is particularly important in the era of big data. Based on machine learning method, this study evaluated Land Surface Temperature (LST), Land surface Evaporative Efficiency (LEE), and geographical factors from Moderate Resolution Imaging Spectroradiometer (MODIS) products for downscaling SMAP (Soil Moisture Active and Passive) SM products. This study spans from 2015 to the end of 2018 and locates in the central United States. Original SMAP SM and in-situ SM at sparse networks and core validation sites were used as reference. Experiment results indicated that (1) LEE presented comparative performance with LST as downscaling factors; (2) adding geographical factors can significantly improve the performance of SM downscaling; (3) integrating LST, LEE, and geographical factors got the best performance; (4) using Z-score normalization or hyperbolic-tangent normalization methods did not change the above conclusions, neither did using support vector regression nor feed forward neural network methods. This study demonstrates the possibility of LEE as an alternative of LST for downscaling SM when there is no available LST due to cloud contamination. It also provides experimental evidence for adding geographical factors in the downscaling process.


2020 ◽  
Vol 10 (11) ◽  
pp. 4016 ◽  
Author(s):  
Xudong Hu ◽  
Han Zhang ◽  
Hongbo Mei ◽  
Dunhui Xiao ◽  
Yuanyuan Li ◽  
...  

Landslide susceptibility mapping is considered to be a prerequisite for landslide prevention and mitigation. However, delineating the spatial occurrence pattern of the landslide remains a challenge. This study investigates the potential application of the stacking ensemble learning technique for landslide susceptibility assessment. In particular, support vector machine (SVM), artificial neural network (ANN), logical regression (LR), and naive Bayes (NB) were selected as base learners for the stacking ensemble method. The resampling scheme and Pearson’s correlation analysis were jointly used to evaluate the importance level of these base learners. A total of 388 landslides and 12 conditioning factors in the Lushui area (Southwest China) were used as the dataset to develop landslide modeling. The landslides were randomly separated into two parts, with 70% used for model training and 30% used for model validation. The models’ performance was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC) and statistical measures. The results showed that the stacking-based ensemble model achieved an improved predictive accuracy as compared to the single algorithms, while the SVM-ANN-NB-LR (SANL) model, the SVM-ANN-NB (SAN) model, and the ANN-NB-LR (ANL) models performed equally well, with AUC values of 0.931, 0.940, and 0.932, respectively, for validation stage. The correlation coefficient between the LR and SVM was the highest for all resampling rounds, with a value of 0.72 on average. This connotes that LR and SVM played an almost equal role when the ensemble of SANL was applied for landslide susceptibility analysis. Therefore, it is feasible to use the SAN model or the ANL model for the study area. The finding from this study suggests that the stacking ensemble machine learning method is promising for landslide susceptibility mapping in the Lushui area and is capable of targeting areas prone to landslides.


2014 ◽  
Vol 2014 ◽  
pp. 1-7 ◽  
Author(s):  
Xiaoyong Liu ◽  
Hui Fu

Disease diagnosis is conducted with a machine learning method. We have proposed a novel machine learning method that hybridizes support vector machine (SVM), particle swarm optimization (PSO), and cuckoo search (CS). The new method consists of two stages: firstly, a CS based approach for parameter optimization of SVM is developed to find the better initial parameters of kernel function, and then PSO is applied to continue SVM training and find the best parameters of SVM. Experimental results indicate that the proposed CS-PSO-SVM model achieves better classification accuracy and F-measure than PSO-SVM and GA-SVM. Therefore, we can conclude that our proposed method is very efficient compared to the previously reported algorithms.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Rulan Wang ◽  
Zhuo Wang ◽  
Hongfei Wang ◽  
Yuxuan Pang ◽  
Tzong-Yi Lee

AbstractLysine crotonylation (Kcr) is a type of protein post-translational modification (PTM), which plays important roles in a variety of cellular regulation and processes. Several methods have been proposed for the identification of crotonylation. However, most of these methods can predict efficiently only on histone or non-histone protein. Therefore, this work aims to give a more balanced performance in different species, here plant (non-histone) and mammalian (histone) are involved. SVM (support vector machine) and RF (random forest) were employed in this study. According to the results of cross-validations, the RF classifier based on EGAAC attribute achieved the best predictive performance which performs competitively good as existed methods, meanwhile more robust when dealing with imbalanced datasets. Moreover, an independent test was carried out, which compared the performance of this study and existed methods based on the same features or the same classifier. The classifiers of SVM and RF could achieve best performances with 92% sensitivity, 88% specificity, 90% accuracy, and an MCC of 0.80 in the mammalian dataset, and 77% sensitivity, 83% specificity, 70% accuracy and 0.54 MCC in a relatively small dataset of mammalian and a large-scaled plant dataset respectively. Moreover, a cross-species independent testing was also carried out in this study, which has proved the species diversity in plant and mammalian.


2021 ◽  
Author(s):  
Bu-Yo Kim ◽  
Joo Wan Cha ◽  
Ki-Ho Chang

Abstract. In this study, image data features and machine learning methods were used to calculate 24-h continuous cloud cover from image data obtained by a camera-based imager on the ground. The image data features were the time (Julian day and hour), solar zenith angle, and statistical characteristics of the red-blue ratio, blue–red difference, and luminance. These features were determined from the red, green, and blue brightness of images subjected to a pre-processing process involving masking removal and distortion correction. The collected image data were divided into training, validation, and test sets and were used to optimize and evaluate the accuracy of each machine learning method. The cloud cover calculated by each machine learning method was verified with human-eye observation data from a manned observatory. Supervised machine learning models suitable for nowcasting, namely, support vector regression, random forest, gradient boosting machine, k-nearest neighbor, artificial neural network, and multiple linear regression methods, were employed and their results were compared. The best learning results were obtained by the support vector regression model, which had an accuracy, recall, and precision of 0.94, 0.70, and 0.76, respectively. Further, bias, root mean square error, and correlation coefficient values of 0.04 tenth, 1.45 tenths, and 0.93, respectively, were obtained for the cloud cover calculated using the test set. When the difference between the calculated and observed cloud cover was allowed to range between 0, 1, and 2 tenths, high agreement of approximately 42 %, 79 %, and 91 %, respectively, were obtained. The proposed system involving a ground-based imager and machine learning methods is expected to be suitable for application as an automated system to replace human-eye observations.


Author(s):  
Qiaoman Yang ◽  
Chunyu Liu

Classification modeling is one of the key issues in sentiment analysis. Support vector machine (SVM) has been widely used in classification as an effective machine learning method. Generally, a common SVM is only for decision-making that sacrifices the distribution of data. In practice, sentiment data are big and mazy, which results in the deficiency of accuracy and stability when common SVM is used. The study investigates sentiment analysis by applying the twin objective function SVM, including nonparallel SVM(NPSVM) and twin SVM (TWSVM). From the experiments, we concluded that twin objective function SVMs are superior to NB and single objective function SVM in accuracy and stability.


2020 ◽  
Author(s):  
Yinxue Liu ◽  
Paul Bates ◽  
Jeffery Neal ◽  
Dai Yamazaki

<p>Precise representation of global terrain is of great significance for estimating global flood risk. As the most vulnerable areas to flooding, urban areas need GDEMs of high quality. However, current Global Digital Elevation Models (GDEMs) are all Digital Surface Models (DSMs) in urban areas, which will cause substantial blockages of flow pathways within flood inundation models. By taking GPS and LIDAR data as terrain observations, errors of popular GDEMs (including SRTM 1” void-filled version DEM - SRTM, Multi-Error-Removed Improved-Terrain DEM - MERIT and TanDEM-X 3” resolution DEM -TDM3) were analysed in seven varied types of cities. It was found that the RMSE of GDEMs errors are in the range of 2.3 m – 7.9 m, and that MERIT and TDM3 both outperformed SRTM. The error comparison between MERIT and TDM3 showed that the most accurate model varied among the studied cities. Generally, error of TDM3 is slightly lower than MERIT, but TDM3 has more extreme errors (absolute value exceeds 15 m). For cities which have experienced rapid development in the past decade, the RMSE of MERIT is lower than that of TDM3, which is mainly caused by the acquisition time difference between these two models. A machine learning method was adopted to estimate MERIT error. Night Time Light, world population density data, Openstreetmap building data, slope, elevation and neighbourhood elevation values from widely available datasets, comprising 14 factors in total, were used in the regression. Models were trained based on single city and combinations of cities, respectively, and then used to estimate error in a target city. By this approach, the RMSE of corrected MERIT can decline by up to 75% with target city trained model, though less significant a reduction of 35% -68% was shown in the combined model with target city excluded in the training data. Further validation via flood simulation showed improvements in terms of both flood extent and inundation depth by the corrected MERIT over the original MERIT, with a validation in small sized city. However, the corrected MERIT was not as good as TDM3 in this case. This method has the potential to generate a better bare-earth global DEM in urban areas, but the sensitive level about the model extrapolative application needs investigation in more study sites.</p>


2020 ◽  
Vol 44 (6) ◽  
pp. 415-427
Author(s):  
Jung Ho Yang ◽  
Jae Hyeon Park ◽  
Seong-Ho Jang ◽  
Jaesung Cho

Objective To present new classification methods of knee osteoarthritis (KOA) using machine learning and compare its performance with conventional statistical methods as classification techniques using machine learning have recently been developed.Methods A total of 84 KOA patients and 97 normal participants were recruited. KOA patients were clustered into three groups according to the Kellgren-Lawrence (K-L) grading system. All subjects completed gait trials under the same experimental conditions. Machine learning-based classification using the support vector machine (SVM) classifier was performed to classify KOA patients and the severity of KOA. Logistic regression analysis was also performed to compare the results in classifying KOA patients with machine learning method.Results In the classification between KOA patients and normal subjects, the accuracy of classification was higher in machine learning method than in logistic regression analysis. In the classification of KOA severity, accuracy was enhanced through the feature selection process in the machine learning method. The most significant gait feature for classification was flexion and extension of the knee in the swing phase in the machine learning method.Conclusion The machine learning method is thought to be a new approach to complement conventional logistic regression analysis in the classification of KOA patients. It can be clinically used for diagnosis and gait correction of KOA patients.


2020 ◽  
Vol 8 (6) ◽  
pp. 3806-3810

While using non-stop advancement of correspondences industry, almost all clients steadily appreciate various interchanges companies. To accomplish persuasive and moderate identification with regard to telecom deceit clients, all of us propose an effective and suitable extortion customer discovery method dependent on customer's Call detail Record (CDR). The suggested strategy contains two segments, specific device learning component and file format discovery element. In the equipment wisdom component, a support Vector machine (SVM) computation dependent on aimed knowledge is actually utilized to team clients making use of outline characteristics. Detail evaluation is similarly completed regarding separating the actual detail associated with networks. Outcomes show that these strategies will help rapidly character the ad calls. The actual investigations display that the technique can achieve high reputation precision regarding 97.56%, which exhibit that the proposed technique has progressively brilliant execution in examination with the best in class draws near


Sign in / Sign up

Export Citation Format

Share Document