Detection of Breast Cancer Through Clinical Data Using Supervised and Unsupervised Feature Selection Techniques

In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.

Download Full-text

Using Feature Selection Techniques to Improve the Accuracy of Breast Cancer Classification

Innovations in Smart Cities Applications Edition 2 - Lecture Notes in Intelligent Transportation and Infrastructure ◽

10.1007/978-3-030-11196-0_28 ◽

2019 ◽

pp. 307-315 ◽

Cited By ~ 2

Author(s):

Hajar Saoud ◽

Abderrahim Ghadi ◽

Mohamed Ghailani ◽

Boudhir Anouar Abdelhakim

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Cancer Classification ◽

Breast Cancer Classification ◽

Feature Selection Techniques

Download Full-text

Analysis of breast cancer data: a comparative study on different feature selection techniques

2020 International Multi-Conference on: “Organization of Knowledge and Advanced Technologies” (OCTA) ◽

10.1109/octa49274.2020.9151824 ◽

2020 ◽

Author(s):

Kaouther Nouira ◽

Zainab Maalej ◽

Fahmi Ben Rejab ◽

Linda Ouerfelly ◽

Ahmed Ferchichi

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Comparative Study ◽

Breast Cancer Data ◽

Cancer Data ◽

Feature Selection Techniques

Download Full-text

Augmentation of Classifier Accuracy through Implication of Feature Selection for Breast Cancer Prediction

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b2216.078219 ◽

2019 ◽

Vol 8 (2) ◽

pp. 6396-6399

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Random Forest ◽

Multilayer Perceptrons ◽

Accuracy Rate ◽

Cancer Prediction ◽

Malignant Breast ◽

Selection For ◽

Breast Lumps ◽

Feature Selection Techniques

Breast Cancer Examination and Prediction are great provocations to the researchers in the medical applications. Breast Cancer Examination distinguishes benign from malignant breast lumps, Breast Cancer Prediction has great deal in foretelling when Breast Cancer is expected to reoccur in patients that have had their cancers excised. Feature Selection is considered to be the preliminary step used in process to find best subsets of attributes. In this paper authors confer about the performance of five classifiers Sequential minimal optimization (SMO), Multilayer Perceptrons, Kstar, Decision Table and Random Forest with and without feature selection. The results manifest that after implying two feature selection techniques such as Correlation based and information based with ranker algorithm there is an augmentation in the accuracy rate of the classifier. It has been observed that after through implication feature selection techniques accuracy of the classifiers such as SMO, Multilayer Perceptrons, Kstar, Decision Trees, and Random Forest are enhanced.

Download Full-text

Predictive Modeling for Classification of Breast Cancer Dataset Using Feature Selection Techniques

Handbook of Research on Innovations and Applications of AI, IoT, and Cognitive Technologies - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-6870-5.ch015 ◽

2021 ◽

pp. 204-215

Author(s):

Leena Nesamani S. ◽

S. Nirmala Sigirtha Rajini

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Feature Selection ◽

Mutual Information ◽

Predictive Modeling ◽

Breast Cancer Dataset ◽

Cancer Dataset ◽

Chi Squared ◽

Feature Selection Techniques

Predictive modeling or predict analysis is the process of trying to predict the outcome from data using machine learning models. The quality of the output predominantly depends on the quality of the data that is provided to the model. The process of selecting the best choice of input to a machine learning model depends on a variety of criteria and is referred to as feature engineering. The work is conducted to classify the breast cancer patients into either the recurrence or non-recurrence category. A categorical breast cancer dataset is used in this work from which the best set of features is selected to make accurate predictions. Two feature selection techniques, namely the chi-squared technique and the mutual information technique, have been used. The selected features were then used by the logistic regression model to make the final prediction. It was identified that the mutual information technique proved to be more efficient and produced higher accuracy in the predictions.

Download Full-text

A Review Paper on Feature Selection Techniques and Artificial Neural Networks Architectures Used in Thermography for Early Stage Detection of Breast Cancer

Advances in Intelligent Systems and Computing - Soft Computing: Theories and Applications ◽

10.1007/978-981-15-4032-5_42 ◽

2020 ◽

pp. 455-465

Author(s):

Kumod Kumar Gupta ◽

Ritu Vijay ◽

Pallavi Pahadiya

Keyword(s):

Breast Cancer ◽

Neural Networks ◽

Feature Selection ◽

Artificial Neural Networks ◽

Review Paper ◽

Early Stage ◽

Artificial Neural ◽

Feature Selection Techniques

Download Full-text

Feature Selection Techniques on Thyroid, Hepatitis, and Breast Cancer Datasets

International Journal on Data Mining and Intelligent Information Technology Applications ◽

10.4156/ijmia.vol3.issue1.1 ◽

2013 ◽

Vol 3 (1) ◽

pp. 1-8 ◽

Cited By ~ 3

Author(s):

Mohammad Ashraf ◽

Girija Chetty ◽

Dat Tran

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Feature Selection Techniques

Download Full-text

Efficient breast cancer detection using sequential feature selection techniques

2015 IEEE Seventh International Conference on Intelligent Computing and Information Systems (ICICIS) ◽

10.1109/intelcis.2015.7397261 ◽

2015 ◽

Cited By ~ 1

Author(s):

Taha Mahdy Mohamed

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Cancer Detection ◽

Breast Cancer Detection ◽

Sequential Feature Selection ◽

Feature Selection Techniques

Download Full-text

A Review on Feature Selection Techniques in Digital Mammograms

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i2.2392 ◽

2021 ◽

Vol 12 (2) ◽

pp. 3329-3338

Author(s):

L Kanya kumara, Et. al.

Keyword(s):

Breast Cancer ◽

Feature Selection ◽

Accurate Method ◽

Optimization Techniques ◽

Computer Aided Detection ◽

Resonance Imaging ◽

Cad Systems ◽

The World ◽

Feature Selection Techniques ◽

Low Dosage

The most of the women in the world are suffering from a deadly disease called Breast Cancer (BC). Breast cancer is analyzed by using imaging modalities such as mammograms, magnetic resonance imaging, ultrasound, and thermograms. Among all, mammograms are the low dosage, less cost, more effective, and accurate method to detect BC in early stages. There are many Computer-Aided Detection (CAD) systems for the automatic detection of masses in mammograms. These techniques are helping radiologists and physicians in diagnosing disease. The objective of this paper is to overview different CAD systems in which mainly we focused on feature selection, as feature selection techniques are used to reduce the complexity of the classifiers and also increase the accuracy. We conclude that suitable optimization techniques should be chosen to increase the accuracy of the classifier so that we can increase the survival rate of the patient.

Download Full-text