Author(s):  
Mohd.Ashique Ridwan Nayeem ◽  
Md.A.Mannan Joadder ◽  
Shahrin Ahammad Shetu ◽  
Farzin Raeeda Jamil ◽  
Abdullah Al Helal

2019 ◽  
Vol 26 (1) ◽  
pp. 107327481987659 ◽  
Author(s):  
Flavio S. Fogliatto ◽  
Michel J. Anzanello ◽  
Felipe Soares ◽  
Priscila G. Brust-Renck

Several statistical-based approaches have been developed to support medical personnel in early breast cancer detection. This article presents a method for feature selection aimed at classifying cases into categories based on patients’ breast tissue measures and protein microarray. The effectiveness of this feature selection strategy was evaluated against the commonly used Wisconsin Breast Cancer Database—WBCD (with several patients and fewer features) and a new protein microarray data set (with several features and fewer patients). Features were ranked according to a feature importance index that combines parameters emerging from the unsupervised method of principal component analysis and the supervised method of Bhattacharyya distance. Observations of a training set were iteratively categorized into malignant and benign cases through 3 classification techniques: k-Nearest Neighbor, linear discriminant analysis, and probabilistic neural network. After each classification, the feature with the smallest importance index was removed, and a new categorization was carried out until there was only one feature left. The subset yielding maximum accuracy was used to classify observations in the testing set. Our method yielded average 99.17% accurate classifications in the testing set while retaining average 4.61 out of 9 features in the WBCD, which is comparable to the best results reported by the literature on that data set, with the advantage of relying on simple and widely available multivariate techniques. When applied to the microarray data, the method yielded average accuracy of 98.30% while retaining average 2.17% of the original features. Our results can aid health-care professionals during early diagnosis of breast cancer.


2016 ◽  
Author(s):  
Dariusz Jagodziński ◽  
Mateusz Matysiewicz ◽  
Łukasz Neumann ◽  
Robert M. Nowak ◽  
Rafał Okuniewski ◽  
...  

2015 ◽  
Vol 67 (1) ◽  
pp. 13-18
Author(s):  
Smaranda Belciug ◽  
Mircea-Sebastian Serbanescu

Abstract Feature selection is considered a key factor in classifications/decision problems. It is currently used in designing intelligent decision systems to choose the best features which allow the best performance. This paper proposes a regression-based approach to select the most important predictors to significantly increase the classification performance. Application to breast cancer detection and recurrence using publically available datasets proved the efficiency of this technique.


Sign in / Sign up

Export Citation Format

Share Document