Using new artificial bee colony as probabilistic neural network for breast cancer data classification

PurposeBreast cancer is an important medical disorder, which is not a single disease but a cluster more than 200 different serious medical complications.Design/methodology/approachThe new artificial bee colony (ABC) implementation has been applied to probabilistic neural network (PNN) for training and testing purpose to classify the breast cancer data set.FindingsThe new ABC algorithm along with PNN has been successfully applied to breast cancers data set for prediction purpose with minimum iteration consuming.Originality/valueThe new implementation of ABC along PNN can be easily applied to times series problems for accurate prediction or classification.

Download Full-text

An Intelligent Artificial Bee Colony and Adaptive Bacterial Foraging Optimization Scheme for reliable breast cancer diagnosis

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200618143705 ◽

2020 ◽

Vol 13 ◽

Author(s):

S. Punitha ◽

A. Amuthan ◽

K. Suresh Joseph

Keyword(s):

Breast Cancer ◽

Cancer Detection ◽

Cancer Diagnosis ◽

Artificial Bee Colony ◽

Breast Cancer Diagnosis ◽

Bacterial Foraging Optimization ◽

Data Set ◽

Cancer Data ◽

Bacterial Foraging ◽

Bee Colony

: Breast cancer is essential to be detected in primitive localized stage for enhancing the possibility of survival since it is considered as the major malediction to the women society around the globe. Most of the intelligent approaches devised for breast cancer necessitates expertise that results in reliable identification of patterns that conclude the presence of oncology cells and determine the possible treatment to the breast cancer patients in order to enhance their survival feasibility. Moreover, the majority of the existing scheme of the literature incurs intensive labor and time, which induces predominant impact over the diagnosis time utilized for detecting breast cancer cells. An Intelligent Artificial Bee Colony and Adaptive Bacterial Foraging Optimization (IABC-ABFO) scheme is proposed for facilitating better rate of local and global searching ability in selecting the optimal features subsets and optimal parameters of ANN considered for breast cancer diagnosis. In the proposed IABC-ABFO approach, the traditional ABC algorithm used for cancer detection is improved by integrating an adaptive bacterial foraging process in the onlooker bee and the employee bee phase that results in an optimal exploitation and exploration. The results investigation of the proposed IABC-ABFO approach facilitated using Wisconsin breast cancer data set confirmed an enhanced mean classification accuracy of 99.52% on par with the existing baseline cancer detection schemes.

Download Full-text

Breast cancer data classification using deep neural network

International Journal of Intelligent Systems Design and Computing ◽

10.1504/ijisdc.2020.10037864 ◽

2020 ◽

Vol 3 (2) ◽

pp. 133

Author(s):

Mihir Narayan Mohanty ◽

Vipul Sharma ◽

Saumendra Kumar Mohapatra

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Deep Neural Network ◽

Data Classification ◽

Breast Cancer Data ◽

Cancer Data

Download Full-text

Predicting Breast Cancer Using Logistic Regression and Multi-Class Classifiers

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i4.20.22115 ◽

2018 ◽

Vol 7 (4.20) ◽

pp. 22 ◽

Cited By ~ 4

Author(s):

Jabeen Sultana ◽

Abdul Khader Jilani ◽

. .

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Logistic Regression ◽

Regression Method ◽

Breast Cancer Dataset ◽

Breast Cancer Data ◽

Data Set ◽

Cancer Data ◽

Logistic Regression Method ◽

Simple Logistic

The primary identification and prediction of type of the cancer ought to develop a compulsion in cancer study, in order to assist and supervise the patients. The significance of classifying cancer patients into high or low risk clusters needs commanded many investigation teams, from the biomedical and the bioinformatics area, to learn and analyze the application of machine learning (ML) approaches. Logistic Regression method and Multi-classifiers has been proposed to predict the breast cancer. To produce deep predictions in a new environment on the breast cancer data. This paper explores the different data mining approaches using Classification which can be applied on Breast Cancer data to build deep predictions. Besides this, this study predicts the best Model yielding high performance by evaluating dataset on various classifiers. In this paper Breast cancer dataset is collected from the UCI machine learning repository has 569 instances with 31 attributes. Data set is pre-processed first and fed to various classifiers like Simple Logistic-regression method, IBK, K-star, Multi-Layer Perceptron (MLP), Random Forest, Decision table, Decision Trees (DT), PART, Multi-Class Classifiers and REP Tree. 10-fold cross validation is applied, training is performed so that new Models are developed and tested. The results obtained are evaluated on various parameters like Accuracy, RMSE Error, Sensitivity, Specificity, F-Measure, ROC Curve Area and Kappa statistic and time taken to build the model. Result analysis reveals that among all the classifiers Simple Logistic Regression yields the deep predictions and obtains the best model yielding high and accurate results followed by other methods IBK: Nearest Neighbor Classifier, K-Star: instance-based Classifier, MLP- Neural network. Other Methods obtained less accuracy in comparison with Logistic regression method.

Download Full-text

A Markov chain model of a longitudinal breast cancer data set.

Journal of Clinical Oncology ◽

10.1200/jco.2014.32.15_suppl.11040 ◽

2014 ◽

Vol 32 (15_suppl) ◽

pp. 11040-11040

Author(s):

Paul K. Newton ◽

Jorge J. Nieva ◽

Peter Kuhn ◽

Larry Norton ◽

Elizabeth Anne Comen ◽

...

Keyword(s):

Breast Cancer ◽

Markov Chain ◽

Markov Chain Model ◽

Chain Model ◽

Breast Cancer Data ◽

Data Set ◽

Cancer Data

Download Full-text

Supervised Learning Breast Cancer Data Set Analysis in MATLAB Using Novel SVM Classifier

Advances in Intelligent Systems and Computing - Machine Intelligence and Soft Computing ◽

10.1007/978-981-15-9516-5_22 ◽

2021 ◽

pp. 255-263

Author(s):

Prasanna Priya Golagani ◽

Tummala Sita Mahalakshmi ◽

Shaik Khasim Beebi

Keyword(s):

Breast Cancer ◽

Supervised Learning ◽

Svm Classifier ◽

Breast Cancer Data ◽

Data Set ◽

Cancer Data

Download Full-text

Synthesising artificial patient-level data for Open Science - an evaluation of five methods

10.1101/2020.10.09.20210138 ◽

2020 ◽

Author(s):

Michael Allen ◽

Andrew Salmon

Keyword(s):

Breast Cancer ◽

Logistic Regression ◽

Synthetic Data ◽

Original Data ◽

Classification Model ◽

Data Sets ◽

List Type ◽

Breast Cancer Data ◽

Data Set ◽

Cancer Data

ABSTRACTBackgroundOpen science is a movement seeking to make scientific research accessible to all, including publication of code and data. Publishing patient-level data may, however, compromise the confidentiality of that data if there is any significant risk that data may later be associated with individuals. Use of synthetic data offers the potential to be able to release data that may be used to evaluate methods or perform preliminary research without risk to patient confidentiality.MethodsWe have tested five synthetic data methods:A technique based on Principal Component Analysis (PCA) which samples data from distributions derived from the transformed data.Synthetic Minority Oversampling Technique, SMOTE which is based on interpolation between near neighbours.Generative Adversarial Network, GAN, an artificial neural network approach with competing networks - a discriminator network trained to distinguish between synthetic and real data., and a generator network trained to produce data that can fool the discriminator network.CT-GAN, a refinement of GANs specifically for the production of structured tabular synthetic data.Variational Auto Encoders, VAE, a method of encoding data in a reduced number of dimensions, and sampling from distributions based on the encoded dimensions.Two data sets are used to evaluate the methods:The Wisconsin Breast Cancer data set, a histology data set where all features are continuous variables.A stroke thrombolysis pathway data set, a data set describing characteristics for patients where a decision is made whether to treat with clot-busting medication. Features are mostly categorical, binary, or integers.Methods are evaluated in three ways:The ability of synthetic data to train a logistic regression classification model.A comparison of means and standard deviations between original and synthetic data.A comparison of covariance between features in the original and synthetic data.ResultsUsing the Wisconsin Breast Cancer data set, the original data gave 98% accuracy in a logistic regression classification model. Synthetic data sets gave between 93% and 99% accuracy. Performance (best to worst) was SMOTE > PCA > GAN > CT-GAN = VAE. All methods produced a high accuracy in reproducing original data means and stabdard deviations (all R-square > 0.96 for all methods and data classes). CT-GAN and VAE suffered a significant loss of covariance between features in the synthetic data sets.Using the Stroke Pathway data set, the original data gave 82% accuracy in a logistic regression classification model. Synthetic data sets gave between 66% and 82% accuracy. Performance (best to worst) was SMOTE > PCA > CT-GAN > GAN > VAE. CT-GAN and VAE suffered loss of covariance between features in the synthetic data sets, though less pronounced than with the Wisconsin Breast Cancer data set.ConclusionsThe pilot work described here shows, as proof of concept, that synthetic data may be produced, which is of sufficient quality to publish with open methodology, to allow people to better understand and test methodology. The quality of the synthetic data also gives promise of data sets that may be used for screening of ideas, or for research project (perhaps especially in an education setting).More work is required to further refine and test methods across a broader range of patient-level data sets.

Download Full-text

Breast cancer data classification using deep neural network

International Journal of Intelligent Systems Design and Computing ◽

10.1504/ijisdc.2020.115169 ◽

2020 ◽

Vol 3 (2) ◽

pp. 133

Author(s):

Vipul Sharma ◽

Saumendra Kumar Mohapatra ◽

Mihir Narayan Mohanty

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Deep Neural Network ◽

Data Classification ◽

Breast Cancer Data ◽

Cancer Data

Download Full-text

A Comparative Analysis of Breast Cancer Data Set Using Different Classification Methods

Smart Intelligent Computing and Applications - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-13-1921-1_17 ◽

2018 ◽

pp. 175-181 ◽

Cited By ~ 2

Author(s):

M. Navya Sri ◽

J. S. V. S. Hari Priyanka ◽

D. Sailaja ◽

M. Ramakrishna Murthy

Keyword(s):

Breast Cancer ◽

Comparative Analysis ◽

Classification Methods ◽

Breast Cancer Data ◽

Data Set ◽

Cancer Data

Download Full-text

Detection of Breast Cancer Using Machine Learning Support Vector Machine Algorithm

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.7747 ◽

2019 ◽

Vol 16 (2) ◽

pp. 441-444

Author(s):

D. V. Soundari ◽

R. Padmapriya ◽

C. Thirumariselvi ◽

N. Nanthini ◽

K. Priyadharsini

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Support Vector Machine ◽

Support Vector ◽

Learning Support ◽

Support Vector Machine Algorithm ◽

Breast Cancer Data ◽

Data Set ◽

Cancer Data ◽

Hormone Imbalance

A woman majorly suffers due to breast cancer which is due to hormone imbalance. It leads to huge death in recent years. Early detection of the breast cancer is more important to prevent human lives. Image Processing plays an important to classify and detect the same. So this paper proposes machine learning based cancer classification using support vector machine with Wisconsin breast cancer data set.

Download Full-text

Breast Cancer Data Prediction by Dimensionality Reduction Using PCA and Adaptive Neuro Evolution

International Journal of Information Systems and Social Change ◽

10.4018/jissc.2012010101 ◽

2012 ◽

Vol 3 (1) ◽

pp. 1-9

Author(s):

R. R. Janghel ◽

Ritu Tiwari ◽

Rahul Kala ◽

Anupam Shukla

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Dimensionality Reduction ◽

Principal Component ◽

Cancer Disease ◽

Data Set ◽

Computing Technique ◽

Cancer Data ◽

Evolutionary Neural Network ◽

Hidden Layer

In this paper a new approach for the prediction of breast cancer has been made by reducing the features of the data set using PCA (principal component analysis) technique and prediction results by simulating different models namely SANE (Symbiotic, Adaptive Neuro-evolution), Modular neural network, Fixed architecture evolutionary neural network (F-ENN), and Variable Architecture evolutionary neural network (V-ENN). The dimensionality reduction of the inputs achieved by PCA technique to an extent of 33% and further different models of the soft computing technique simulated and tested based on efficiency to find the optimum model. The SANE model includes maximum number of connections per neuron as 24, evolutionary population size of 1000, maximum neurons in hidden layer as 12, SANE elite value of 200, mutation rate of 0.2, and number of generations as 100. The simulated results reflect that this is the best model for the prediction of the breast cancer disease among the other models considered in the experiment and it can effectively assist the doctors for taking the diagnosis results as its efficiency found to be 98.52% accuracy which is highest.

Download Full-text