Multi-scoring feature selection method based on SVM-RFE for prostate cancer diagnosis

Author(s):  
Dheeb Albashish ◽  
Shahnorbanun Sahran ◽  
Azizi. Abdullah ◽  
Afzan Adam ◽  
Nordashima Abd Shukor ◽  
...  
2016 ◽  
Vol 13 (10) ◽  
pp. 7303-7309 ◽  
Author(s):  
Xiaonan Fang ◽  
Cheng Liang ◽  
Huaxiang Zhang

Ovarian cancer is the most lethal cancer of female reproductive system. Although it only ranks tenth of female malignancy tumors, its death rate is the highest among the female reproductive system tumors. Therefore, there is a pressing and consistent need to better comprehend its pathogenesis. However, the early diagnosis and survival predictions of ovarian cancer patients still remains a challenging problem today. Microarray technology has been widely accepted in early cancer diagnosis and prediction of outcome. Nevertheless, the high-dimension and imbalanced class distribution always disturb the effect of classification. In this paper, we proposed a new imbalanced feature selection method based on Random Forest called IFSRF for ovarian cancer classification. Our method selects AUC as the evaluation criterion when performing feature selection, which can relieve the negative effect of imbalanced classes. We select three manually curated ovarian cancer datasets and five widely used classifiers to show the improvement after using IFSRF. Furthermore, to demonstrate the effectiveness of the proposed method, we compare IFSRF with another widely used feature selection method Relieff. Experiments results on three ovarian cancer diagnosis and survival prediction data sets show that our feature selection method can significantly improve the AUC performance of all classifiers, especially on Random Forest. Meanwhile, the overall prediction accuracy could maintain as well.


2021 ◽  
Vol 12 ◽  
Author(s):  
Fei Yuan ◽  
Zhandong Li ◽  
Lei Chen ◽  
Tao Zeng ◽  
Yu-Hang Zhang ◽  
...  

Cancer is one of the most threatening diseases to humans. It can invade multiple significant organs, including lung, liver, stomach, pancreas, and even brain. The identification of cancer biomarkers is one of the most significant components of cancer studies as the foundation of clinical cancer diagnosis and related drug development. During the large-scale screening for cancer prevention and early diagnosis, obtaining cancer-related tissues is impossible. Thus, the identification of cancer-associated circulating biomarkers from liquid biopsy targeting has been proposed and has become the most important direction for research on clinical cancer diagnosis. Here, we analyzed pan-cancer extracellular microRNA profiles by using multiple machine-learning models. The extracellular microRNA profiles on 11 cancer types and non-cancer were first analyzed by Boruta to extract important microRNAs. Selected microRNAs were then evaluated by the Max-Relevance and Min-Redundancy feature selection method, resulting in a feature list, which were fed into the incremental feature selection method to identify candidate circulating extracellular microRNA for cancer recognition and classification. A series of quantitative classification rules was also established for such cancer classification, thereby providing a solid research foundation for further biomarker exploration and functional analyses of tumorigenesis at the level of circulating extracellular microRNA.


Sign in / Sign up

Export Citation Format

Share Document