scholarly journals Modify Random Forest Algorithm Using Hybrid Feature Selection Method

2018 ◽  
Vol 4 (2) ◽  
pp. 1-6
Author(s):  
Ahmed T. Sadiq‎ ◽  
Karrar Shareef Musawi

The Importance of Random Forrest(RF) is one of the most powerful ‎methods ‎of ‎machine learning in ‎Decision Tree.‎ The Proposed hybrid feature selection for Random Forest depend on ‎two ‎measure ‎‎Information Gain and Gini Index in varying percentages ‎based on ‎weight.‎ In this paper, we tend to ‎propose a modify Random Forrest‏ ‏‎algorithm named ‎Random Forest algorithm using hybrid ‎feature ‎‎selection ‎that uses hybrid feature ‎selection instead of ‎using ‎one feature selection. The ‎main plan is to ‎computation the ‎‎ Information ‎Gain for all random selection ‎feature then search for ‎the best split ‎‎point in ‎the node that gives the best ‎value for a hybrid ‎equation with ‎Gini Index. ‎The experimental results on the ‎dataset ‎showed that the proposed ‎modification is ‎better than the classic Random ‎Forest compared to ‎the standard static Random ‎Forest the hybrid feature ‎‎selection Random Forrest shows significant ‎improvement ‎in accuracy measure.‎

Sensors ◽  
2021 ◽  
Vol 21 (16) ◽  
pp. 5654
Author(s):  
Guo Li ◽  
Chensheng Wang ◽  
Di Zhang ◽  
Guang Yang

Feature selection and dimensionality reduction are important for the performance of wind turbine condition monitoring models using supervisory control and data acquisition (SCADA) data. In this paper, an improved random forest algorithm, namely Feature Simplification Random Forest (FS_RF), is proposed, which is capable of identifying features closely correlated with wind turbine working conditions. The Euclidian distances are employed to distinguish the weight of the same feature among different samples, and its importance is measured by means of the random forest algorithm. The selected features are finally verified by a two-layer gated recurrent unit (GRU) neural network facilitating condition monitoring. The experimental results demonstrate the capacity and effectiveness of the proposed method for wind turbine condition monitoring.


2021 ◽  
Vol 1208 (1) ◽  
pp. 012039
Author(s):  
Vedran Grgić ◽  
Denis Mušić ◽  
Elmir Babović

Abstract The paper analyzes the cardiovascular parameters of patients with heart disease. The aim of this study was to predict death in a patient with cardiovascular disease based on 12 parameters, using Random Forest and Logistic Regression algorithms. Parameters were tuned for both algorithms to determine the best settings. The most significant factors in the process predicted were found using the FEATURE SELECTION method of both algorithms. By comparative analysis of the obtained results, the highest accuracy of 90% was obtained using the Random Forest Algorithm.


2020 ◽  
Vol 59 (04/05) ◽  
pp. 151-161
Author(s):  
Yuchen Fei ◽  
Fengyu Zhang ◽  
Chen Zu ◽  
Mei Hong ◽  
Xingchen Peng ◽  
...  

Abstract Background An accurate and reproducible method to delineate tumor margins is of great importance in clinical diagnosis and treatment. In nasopharyngeal carcinoma (NPC), due to limitations such as high variability, low contrast, and discontinuous boundaries in presenting soft tissues, tumor margin can be extremely difficult to identify in magnetic resonance imaging (MRI), increasing the challenge of NPC segmentation task. Objectives The purpose of this work is to develop a semiautomatic algorithm for NPC image segmentation with minimal human intervention, while it is also capable of delineating tumor margins with high accuracy and reproducibility. Methods In this paper, we propose a novel feature selection algorithm for the identification of the margin of NPC image, named as modified random forest recursive feature selection (MRF-RFS). Specifically, to obtain a more discriminative feature subset for segmentation, a modified recursive feature selection method is applied to the original handcrafted feature set. Moreover, we combine the proposed feature selection method with the classical random forest (RF) in the training stage to take full advantage of its intrinsic property (i.e., feature importance measure). Results To evaluate the segmentation performance, we verify our method on the T1-weighted MRI images of 18 NPC patients. The experimental results demonstrate that the proposed MRF-RFS method outperforms the baseline methods and deep learning methods on the task of segmenting NPC images. Conclusion The proposed method could be effective in NPC diagnosis and useful for guiding radiation therapy.


2014 ◽  
Vol 1030-1032 ◽  
pp. 1709-1712
Author(s):  
Kai Min Song ◽  
Xun Yi Ren

Through the research on the flow identification algorithm based on statistical feature, this paper puts forward the statistical feature selection algorithm in order to reduce the number of features in identification, increase the speed of the flow identification, the experimental results show that the algorithm can effectively reduce the amount of features, improve the efficiency of identification.


Author(s):  
A. Shamsoddini ◽  
M. R. Aboodi ◽  
J. Karami

Air pollution as one of the most serious forms of environmental pollutions poses huge threat to human life. Air pollution leads to environmental instability, and has harmful and undesirable effects on the environment. Modern prediction methods of the pollutant concentration are able to improve decision making and provide appropriate solutions. This study examines the performance of the Random Forest feature selection in combination with multiple-linear regression and Multilayer Perceptron Artificial Neural Networks methods, in order to achieve an efficient model to estimate carbon monoxide and nitrogen dioxide, sulfur dioxide and PM2.5 contents in the air. The results indicated that Artificial Neural Networks fed by the attributes selected by Random Forest feature selection method performed more accurate than other models for the modeling of all pollutants. The estimation accuracy of sulfur dioxide emissions was lower than the other air contaminants whereas the nitrogen dioxide was predicted more accurate than the other pollutants.


Sign in / Sign up

Export Citation Format

Share Document