scholarly journals Detection of Breast Cancer Through Clinical Data Using Supervised and Unsupervised Feature Selection Techniques

IEEE Access ◽  
2021 ◽  
Vol 9 ◽  
pp. 22090-22105
Author(s):  
Amin Ul Haq ◽  
Jian Ping Li ◽  
Abdus Saboor ◽  
Jalaluddin Khan ◽  
Samad Wali ◽  
...  
2018 ◽  
Vol 10 (10) ◽  
pp. 1564 ◽  
Author(s):  
Patrick Bradley ◽  
Sina Keller ◽  
Martin Weinmann

In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.


2019 ◽  
Vol 8 (2) ◽  
pp. 6396-6399

Breast Cancer Examination and Prediction are great provocations to the researchers in the medical applications. Breast Cancer Examination distinguishes benign from malignant breast lumps, Breast Cancer Prediction has great deal in foretelling when Breast Cancer is expected to reoccur in patients that have had their cancers excised. Feature Selection is considered to be the preliminary step used in process to find best subsets of attributes. In this paper authors confer about the performance of five classifiers Sequential minimal optimization (SMO), Multilayer Perceptrons, Kstar, Decision Table and Random Forest with and without feature selection. The results manifest that after implying two feature selection techniques such as Correlation based and information based with ranker algorithm there is an augmentation in the accuracy rate of the classifier. It has been observed that after through implication feature selection techniques accuracy of the classifiers such as SMO, Multilayer Perceptrons, Kstar, Decision Trees, and Random Forest are enhanced.


Author(s):  
Leena Nesamani S. ◽  
S. Nirmala Sigirtha Rajini

Predictive modeling or predict analysis is the process of trying to predict the outcome from data using machine learning models. The quality of the output predominantly depends on the quality of the data that is provided to the model. The process of selecting the best choice of input to a machine learning model depends on a variety of criteria and is referred to as feature engineering. The work is conducted to classify the breast cancer patients into either the recurrence or non-recurrence category. A categorical breast cancer dataset is used in this work from which the best set of features is selected to make accurate predictions. Two feature selection techniques, namely the chi-squared technique and the mutual information technique, have been used. The selected features were then used by the logistic regression model to make the final prediction. It was identified that the mutual information technique proved to be more efficient and produced higher accuracy in the predictions.


Author(s):  
L Kanya kumara, Et. al.

The most of the women in the world are suffering from a deadly disease called Breast Cancer (BC). Breast cancer is analyzed by using imaging modalities such as mammograms, magnetic resonance imaging, ultrasound, and thermograms. Among all, mammograms are the low dosage, less cost, more effective, and accurate method to detect BC in early stages. There are many Computer-Aided Detection (CAD) systems for the automatic detection of masses in mammograms. These techniques are helping radiologists and physicians in diagnosing disease. The objective of this paper is to overview different CAD systems in which mainly we focused on feature selection, as feature selection techniques are used to reduce the complexity of the classifiers and also increase the accuracy. We conclude that suitable optimization techniques should be chosen to increase the accuracy of the classifier so that we can increase the survival rate of the patient.


Sign in / Sign up

Export Citation Format

Share Document