Performance Improvement of Heart Disease Prediction by Identifying Optimal Feature Sets Using Feature Selection Technique

Author(s):  
Asif Newaz ◽  
Sabiq Muhtadi
2021 ◽  
Vol 16 ◽  
Author(s):  
Dan Lin ◽  
Jialin Yu ◽  
Ju Zhang ◽  
Huan He ◽  
Xinyun Guo ◽  
...  

Background: Anti-inflammatory peptides (AIPs) are potent therapeutic agents for inflammatory and autoimmune disorders due to their high specificity and minimal toxicity under normal conditions. Therefore, it is greatly significant and beneficial to identify AIPs for further discovering novel and efficient AIPs-based therapeutics. Recently, three computational approaches, which can effectively identify potential AIPs, have been developed based on machine learning algorithms. However, there are several challenges with the existing three predictors. Objective: A novel machine learning algorithm needs to be proposed to improve the AIPs prediction accuracy. Methods: This study attempts to improve the recognition of AIPs by employing multiple primary sequence-based feature descriptors and an efficient feature selection strategy. By sorting features through four enhanced minimal redundancy maximal relevance (emRMR) methods, and then attaching seven different classifiers wrapper methods based on the sequential forward selection algorithm (SFS), we proposed a hybrid feature selection technique emRMR-SFS to optimize feature vectors. Furthermore, by evaluating seven classifiers trained with the optimal feature subset, we developed the extremely randomized tree (ERT) based predictor named PREDAIP for identifying AIPs. Results: We systematically compared the performance of PREDAIP with the existing tools on an independent test dataset. It demonstrates the effectiveness and power of the PREDAIP. The correlation criteria used in emRMR would affect the selection results of the optimal feature subset at the SFS-wrapper stage, which justifies the necessity for considering different correlation criteria in emRMR. Conclusion: We expect that PREDAIP will be useful for the high-throughput prediction of AIPs and the development of AIPs therapeutics.


Author(s):  
Rozlini Mohamed ◽  
Munirah Mohd Yusof ◽  
Noorhaniza Wahid ◽  
Norhanifah Murli ◽  
Muhaini Othman

This paper presents Bat Algorithm and K-Means techniques for classification performance improvement. The objective of this study is to investigate efficiency of Bat Algorithm in discrete dataset and to find the optimum feature in discrete dataset. In this study, one technique that comprise the discretization technique and feature selection technique have been proposed. Our contribution is in two process of classification: pre-processing and feature selection process. First, to proposed discretization techniques called as BkMD, where we hybrid Bat Algorithm technique and K-Means classifier. Second, to proposed BkMDFS as feature selection technique where Bat Algorithm is embed into BkMD. In order to evaluate our proposed techniques, 14 continuous dataset from various applications are used in experiment. From the experiment, results show that BkMDFS outperforms in most performance measures. Hence it shows that, Bat Algorithm have potential to be one of the discretization technique and feature selection technique.


Sign in / Sign up

Export Citation Format

Share Document