THE EFFECT OF DATASETS ON BREAST CANCER DETECTION MODELS

Kumawuese Jennifer Kurugh; Muhammad Aminu Ahmad; Awwal Ahmad Babajo

doi:10.33003/fjs-2020-0404-487

THE EFFECT OF DATASETS ON BREAST CANCER DETECTION MODELS

FUDMA Journal of Sciences ◽

10.33003/fjs-2020-0404-487 ◽

2021 ◽

Vol 4 (4) ◽

pp. 309-315

Author(s):

Kumawuese Jennifer Kurugh ◽

Muhammad Aminu Ahmad ◽

Awwal Ahmad Babajo

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Cancer Classification ◽

Machine Learning Algorithms ◽

Support Vector ◽

Breast Cancer Dataset ◽

Machine Learning Classifiers ◽

Breast Cancer Classification ◽

Learning Classifiers

Datasets are a major requirement in the development of breast cancer classification/detection models using machine learning algorithms. These models can provide an effective, accurate and less expensive diagnosis method and reduce life losses. However, using the same machine learning algorithms on different datasets yields different results. This research developed several machine learning models for breast cancer classification/detection using Random forest, support vector machine, K Nearest Neighbors, Gaussian Naïve Bayes, Perceptron and Logistic regression. Three widely used test data sets were used; Wisconsin Breast Cancer (WBC) Original, Wisconsin Diagnostic Breast Cancer (WDBC) and Wisconsin Prognostic Breast Cancer (WPBC). The results show that datasets affect the performance of machine learning classifiers. Also, the machine learning classifiers have different performances with a given breast cancer dataset

Diagnosis of breast cancer using machine learning algorithms based on features selected by Genetic Algorithm: Assessed on five datasets

Journal of University of Shanghai for Science and Technology ◽

10.51201/jusst/21/11963 ◽

2021 ◽

Vol 23 (11) ◽

pp. 749-758

Author(s):

Saranya N ◽

◽

Kavi Priya S ◽

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Genetic Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Cancer Prognosis ◽

Support Vector ◽

Breast Cancer Dataset ◽

Human Beings ◽

Original Dataset

Breast Cancer is one of the chronic diseases occurred to human beings throughout the world. Early detection of this disease is the most promising way to improve patients’ chances of survival. The strategy employed in this paper is to select the best features from various breast cancer datasets using a genetic algorithm and machine learning algorithm is applied to predict the outcomes. Two machine learning algorithms such as Support Vector Machines and Decision Tree are used along with Genetic Algorithm. The proposed work is experimented on five datasets such as Wisconsin Breast Cancer-Diagnosis Dataset, Wisconsin Breast Cancer-Original Dataset, Wisconsin Breast Cancer-Prognosis Dataset, ISPY1 Clinical trial Dataset, and Breast Cancer Dataset. The results exploit that SVM-GA achieves higher accuracy of 98.16% than DT-GA of 97.44%.

Breast Cancer Classification Using Machine Learning Algorithms

Machine Learning for Predictive Analysis - Lecture Notes in Networks and Systems ◽

10.1007/978-981-15-7106-0_56 ◽

2020 ◽

pp. 571-578

Author(s):

Simran Sharma ◽

Sachin Deshpande

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Cancer Classification ◽

Machine Learning Algorithms ◽

Breast Cancer Classification

Breast Cancer: Classification of Tumors Using Machine Learning Algorithms

2021 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA) ◽

10.1109/civemsa52099.2021.9493583 ◽

2021 ◽

Author(s):

David Hettich ◽

Megan Olson ◽

Andie Jackson ◽

Naima Kaabouch

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Cancer Classification ◽

Machine Learning Algorithms ◽

Breast Cancer Classification ◽

Classification Of Tumors

Breast cancer classification using machine learning techniques: a comparative study

Medical Technologies Journal ◽

10.26415/2572-004x-vol4iss2p535-544 ◽

2020 ◽

Vol 4 (2) ◽

pp. 535-544

Author(s):

Djihane HOUFANI ◽

◽

Sihem SLATNIA ◽

Okba KAZAR ◽

Noureddine ZERHOUNI ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Logistic Regression ◽

Multilayer Perceptron ◽

Learning Algorithms ◽

Cancer Classification ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Breast Cancer Classification ◽

Learning Techniques

Background: The second leading deadliest disease affecting women worldwide, after lung cancer, is breast cancer. Traditional approaches for breast cancer diagnosis suffer from time consumption and some human errors in classification. To deal with this problems, many research works based on machine learning techniques are proposed. These approaches show their effectiveness in data classification in many fields, especially in healthcare. Methods: In this cross sectional study, we conducted a practical comparison between the most used machine learning algorithms in the literature. We applied kernel and linear support vector machines, random forest, decision tree, multi-layer perceptron, logistic regression, and k-nearest neighbors for breast cancer tumors classification. The used dataset is Wisconsin diagnosis Breast Cancer. Results: After comparing the machine learning algorithms efficiency, we noticed that multilayer perceptron and logistic regression gave the best results with an accuracy of 98% for breast cancer classification. Conclusion: Machine learning approaches are extensively used in medical prediction and decision support systems. This study showed that multilayer perceptron and logistic regression algorithms are performant ( good accuracy specificity and sensitivity) compared to the other evaluated algorithms.

Malignant and Benign Breast Cancer Classification using Machine Learning Algorithms

2021 International Conference on Artificial Intelligence (ICAI) ◽

10.1109/icai52203.2021.9445249 ◽

2021 ◽

Author(s):

Sharmin Ara ◽

Annesha Das ◽

Ashim Dey

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Learning Algorithms ◽

Cancer Classification ◽

Machine Learning Algorithms ◽

Breast Cancer Classification

Assessment of Machine Learning Algorithms for Prediction of Breast Cancer Malignancy Based on Mammogram Numeric Data

10.1101/2020.01.08.20016949 ◽

2020 ◽

Cited By ~ 1

Author(s):

Peter T. Habib ◽

Alsamman M. Alsamman ◽

Sameh E. Hassnein ◽

Ghada A. Shereif ◽

Aladdin Hamwieh

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Cross Validation ◽

Mean Squared Error ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Adjusted Rand Index ◽

Support Vector ◽

Cancer Information ◽

Term Care

Abstractin 2019, estimated New Cases 268.600, Breast cancer has one of the most common cancers and is one of the world’s leading causes of death for women. Classification and data mining is an efficient way to classify information. Particularly in the medical field where prediction techniques are commonly used for early detection and effective treatment in diagnosis and research.These paper tests models for the mammogram analysis of breast cancer information from 23 of the more widely used machine learning algorithms such as Decision Tree, Random forest, K-nearest neighbors and support vector machine. The spontaneously splits results are distributed from a replicated 10-fold cross-validation method. The accuracy calculated by Regression Metrics such as Mean Absolute Error, Mean Squared Error, R2 Score and Clustering Metrics such as Adjusted Rand Index, Homogeneity, V-measure.accuracy has been checked F-Measure, AUC, and Cross-Validation. Thus, proper identification of patients with breast cancer would create care opportunities, for example, the supervision and the implementation of intervention plans could benefit the quality of long-term care. Experimental results reveal that the maximum precision 100%with the lowest error rate is obtained with Ada-boost Classifier.

Comparative Study of Machine Learning Algorithms using a Breast Cancer Dataset

2020 IEEE International Conference on Electro Information Technology (EIT) ◽

10.1109/eit48999.2020.9208315 ◽

2020 ◽

Author(s):

Zaid A. El-Shair ◽

Luis A. Sanchez-Perez ◽

Samir A. Rawashdeh

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Comparative Study ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Breast Cancer Dataset ◽

Cancer Dataset

Performance Analysis of Supervised Machine Learning Algorithms for Diabetes and Breast Cancer Dataset

2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS) ◽

10.1109/icais50930.2021.9396043 ◽

2021 ◽

Author(s):

Aayushi Bansal ◽

Anita Singhrova

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Performance Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Breast Cancer Dataset ◽

Cancer Dataset

BCD-WERT: a novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm

PeerJ Computer Science ◽

10.7717/peerj-cs.390 ◽

2021 ◽

Vol 7 ◽

pp. e390

Author(s):

Shafaq Abbas ◽

Zunera Jalil ◽

Abdul Rehman Javed ◽

Iqra Batool ◽

Mohammad Zubair Khan ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Support Vector Machine ◽

Feature Selection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Experimental Results ◽

Support Vector ◽

Novel Approach ◽

Whale Optimization

Breast cancer is one of the leading causes of death in the current age. It often results in subpar living conditions for a patient as they have to go through expensive and painful treatments to fight this cancer. One in eight women all over the world is affected by this disease. Almost half a million women annually do not survive this fight and die from this disease. Machine learning algorithms have proven to outperform all existing solutions for the prediction of breast cancer using models built on the previously available data. In this paper, a novel approach named BCD-WERT is proposed that utilizes the Extremely Randomized Tree and Whale Optimization Algorithm (WOA) for efficient feature selection and classification. WOA reduces the dimensionality of the dataset and extracts the relevant features for accurate classification. Experimental results on state-of-the-art comprehensive dataset demonstrated improved performance in comparison with eight other machine learning algorithms: Support Vector Machine (SVM), Random Forest, Kernel Support Vector Machine, Decision Tree, Logistic Regression, Stochastic Gradient Descent, Gaussian Naive Bayes and k-Nearest Neighbor. BCD-WERT outperformed all with the highest accuracy rate of 99.30% followed by SVM achieving 98.60% accuracy. Experimental results also reveal the effectiveness of feature selection techniques in improving prediction accuracy.

A Comparative Analysis of Feature Selection Methods and Associated Machine Learning Algorithms on Wisconsin Breast Cancer Dataset (WBCD)

Advances in Intelligent Systems and Computing - Proceedings of International Conference on ICT for Sustainable Development ◽

10.1007/978-981-10-0129-1_23 ◽

2016 ◽

pp. 215-224 ◽

Cited By ~ 3

Author(s):

Nileshkumar Modi ◽

Kaushar Ghanchi

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Feature Selection ◽

Comparative Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Breast Cancer Dataset ◽

Selection Methods ◽

Cancer Dataset