Efficient breast cancer detection using sequential feature selection techniques

Several statistical-based approaches have been developed to support medical personnel in early breast cancer detection. This article presents a method for feature selection aimed at classifying cases into categories based on patients’ breast tissue measures and protein microarray. The effectiveness of this feature selection strategy was evaluated against the commonly used Wisconsin Breast Cancer Database—WBCD (with several patients and fewer features) and a new protein microarray data set (with several features and fewer patients). Features were ranked according to a feature importance index that combines parameters emerging from the unsupervised method of principal component analysis and the supervised method of Bhattacharyya distance. Observations of a training set were iteratively categorized into malignant and benign cases through 3 classification techniques: k-Nearest Neighbor, linear discriminant analysis, and probabilistic neural network. After each classification, the feature with the smallest importance index was removed, and a new categorization was carried out until there was only one feature left. The subset yielding maximum accuracy was used to classify observations in the testing set. Our method yielded average 99.17% accurate classifications in the testing set while retaining average 4.61 out of 9 features in the WBCD, which is comparable to the best results reported by the literature on that data set, with the advantage of relying on simple and widely available multivariate techniques. When applied to the microarray data, the method yielded average accuracy of 98.30% while retaining average 2.17% of the original features. Our results can aid health-care professionals during early diagnosis of breast cancer.

Download Full-text

Classification in Thermograms for Breast Cancer Detection using Texture Features with Feature Selection Method and Ensemble Classifier

2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT) ◽

10.1109/icict46931.2019.8977652 ◽

2019 ◽

Author(s):

Asim Ali Khan ◽

Ajat Shatru Arora

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Cancer Detection ◽

Feature Selection Method ◽

Texture Features ◽

Ensemble Classifier ◽

Selection Method ◽

Breast Cancer Detection

Download Full-text

A hybrid breast cancer detection system via neural network and feature selection based on SBS, SFS and PCA

Neural Computing and Applications ◽

10.1007/s00521-012-0982-6 ◽

2012 ◽

Vol 23 (3-4) ◽

pp. 719-728 ◽

Cited By ~ 10

Author(s):

Mustafa Serter Uzer ◽

Onur Inan ◽

Nihat Yılmaz

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Feature Selection ◽

Cancer Detection ◽

Detection System ◽

Breast Cancer Detection

Download Full-text

Feature selection and definition for contours classification of thermograms in breast cancer detection

10.1117/12.2249064 ◽

2016 ◽

Author(s):

Dariusz Jagodziński ◽

Mateusz Matysiewicz ◽

Łukasz Neumann ◽

Robert M. Nowak ◽

Rafał Okuniewski ◽

...

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Cancer Detection ◽

Breast Cancer Detection

Download Full-text

Breast cancer detection using feature selection and active learning

Computer, Communication and Electrical Technology ◽

10.1201/9781315400624-9 ◽

2017 ◽

pp. 43-48 ◽

Cited By ~ 2

Author(s):

S. Begum ◽

S.P. Bera ◽

D. Chakraborty ◽

R. Sarkar

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Active Learning ◽

Cancer Detection ◽

Breast Cancer Detection

Download Full-text

Regression-Based Approach For Feature Selection In Classification Issues. Application To Breast Cancer Detection And Recurrence

ACTA Universitatis Cibiniensis ◽

10.1515/aucts-2015-0057 ◽

2015 ◽

Vol 67 (1) ◽

pp. 13-18

Author(s):

Smaranda Belciug ◽

Mircea-Sebastian Serbanescu

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Cancer Detection ◽

Classification Performance ◽

Breast Cancer Detection ◽

Decision Problems ◽

Decision Systems ◽

Intelligent Decision ◽

Key Factor ◽

Intelligent Decision Systems

Abstract Feature selection is considered a key factor in classifications/decision problems. It is currently used in designing intelligent decision systems to choose the best features which allow the best performance. This paper proposes a regression-based approach to select the most important predictors to significantly increase the classification performance. Application to breast cancer detection and recurrence using publically available datasets proved the efficiency of this technique.

Download Full-text