Data Mining and Principal Component Analysis on Coimbra Breast Cancer Dataset

Principal Component ◽

Component Analysis ◽

Breast Cancer Dataset ◽

Analysis Tool ◽

Machine Learning Classification

Machine Learning (ML) techniques play an important role in the medical field. Early diagnosis is required to improve the treatment of carcinoma. During this analysis Breast Cancer Coimbra dataset (BCCD) with ten predictors are analyzed to classify carcinoma. In this paper method for feature selection and Machine learning algorithms are applied to the dataset from the UCI repository. WEKA (“Waikato Environment for Knowledge Analysis”) tool is used for machine learning techniques. In this paper Principal Component Analysis (PCA) is used for feature extraction. Different Machine Learning classification algorithms are applied through WEKA such as Glmnet, Gbm, ada Boosting, Adabag Boosting, C50, Cforest, DcSVM, fnn, Ksvm, Node Harvest compares the accuracy and also compare values such as Kappa statistic, Mean Absolute Error (MAE), Root Mean Square Error (RMSE). Here the 10-fold cross validation method is used for training, testing and validation purposes.

Comparative Analysis of Machine Learning Techniques with Principal Component Analysis on Kidney and Heart Disease

10.1109/icesc51422.2021.9533011 ◽

2021 ◽

Author(s):

Reena Chandra ◽

Manoj Kapil ◽

Avinash Sharma

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Comparative Analysis ◽

Principal Component ◽

Component Analysis ◽

Dimensionality Reduction using PCA and K-Means Clustering for Breast Cancer Prediction

Lontar Komputer Jurnal Ilmiah Teknologi Informasi ◽

10.24843/lkjiti.2018.v09.i03.p08 ◽

2018 ◽

pp. 192 ◽

Cited By ~ 2

Author(s):

Ade Jamal ◽

Annisa Handayani ◽

Ali Akbar Septiandri ◽

Endang Ripmiatin ◽

Yunus Effendi

Keyword(s):

Breast Cancer ◽

Dimensionality Reduction ◽

Principal Component ◽

Component Analysis ◽

Gradient Boosting ◽

Support Vector ◽

Breast Cancer Dataset ◽

Cancer Prediction ◽

Extreme Gradient Boosting

Breast cancer is the most important cause of death among women. A prediction of breast cancer in early stage provides a greater possibility of its cure. It needs a breast cancer prediction tool that can classify a breast tumor whether it was a harmful malignant tumor or un-harmful benign tumor. In this paper, two algorithms of machine learning, namely Support Vector Machine and Extreme Gradient Boosting technique will be compared for classification purpose. Prior to the classification, the number of data attribute will be reduced from the raw data by extracting features using Principal Component Analysis. A clustering method, namely K-Means is also used for dimensionality reduction besides the Principal Component Analysis. This paper will present a comparison among four models based on two dimensionality reduction methods combined with two classifiers which applied on Wisconsin Breast Cancer Dataset. The comparison will be measured by using accuracy, sensitivity and specificity metrics evaluated from the confusion matrices. The experimental results have indicated that the K-Means method, which is not usually used for dimensionality reduction can perform well compared to the popular Principal Component Analysis.

2015 3rd International Conference on Signal Processing, Communication and Networking (ICSCN) ◽

Comparative study of Principal Component Analysis based Intrusion Detection approach using machine learning algorithms

10.1109/icscn.2015.7219853 ◽

2015 ◽

Cited By ~ 5

Author(s):

Krupa Joel Chabathula ◽

C. D. Jaidhar ◽

M. A. Ajay Kumara

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Comparative Study ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Detection Approach

Gap‐filling approaches for eddy covariance methane fluxes: A comparison of three machine learning algorithms and a traditional method with principal component analysis

Global Change Biology ◽

10.1111/gcb.14845 ◽

2019 ◽

Vol 26 (3) ◽

pp. 1499-1518 ◽

Cited By ~ 12

Author(s):

Yeonuk Kim ◽

Mark S. Johnson ◽

Sara H. Knox ◽

T. Andrew Black ◽

Higo J. Dalmagro ◽

...

Keyword(s):

Machine Learning ◽

Eddy Covariance ◽

Traditional Method ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Gap Filling ◽

Methane Fluxes

Alzheimer's disease prediction using machine learning techniques and principal component analysis (PCA)

Materials Today Proceedings ◽

10.1016/j.matpr.2021.03.061 ◽

2021 ◽

Author(s):

M. Sudharsan ◽

G. Thailambal

Keyword(s):

Machine Learning ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Principal Component ◽

Component Analysis ◽

Disease Prediction ◽

Hand Gesture recognition and classification by Discriminant and Principal Component Analysis using Machine Learning techniques

INTERNATIONAL JOURNAL OF ADVANCED RESEARCH IN ARTIFICIAL INTELLIGENCE ◽

10.14569/ijarai.2012.010908 ◽

2012 ◽

Vol 1 (9) ◽

Author(s):

Sauvik Das ◽

Souvik Kundu ◽

Rick Pandey ◽

Rahul Ghosh ◽

Rajesh Bag ◽

...

Keyword(s):

Machine Learning ◽

Gesture Recognition ◽

Principal Component ◽

Component Analysis ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

The Discovery of Normality of Body Weight Using Principal Component Analysis: A Comparative Study on Machine Learning Techniques Using Different Data Pre-Processing Methods

International Journal of Knowledge Engineering and Data Mining ◽

10.1504/ijkedm.2019.10018092 ◽

2019 ◽

Vol 6 (1) ◽

pp. 1

Author(s):

Meharunnisa M ◽

Madasamy Sornam

Keyword(s):

Machine Learning ◽

Body Weight ◽

Comparative Study ◽

Principal Component ◽

Component Analysis ◽

Processing Methods ◽

Algorithms for Intelligent Systems - Applications of Artificial Intelligence in Engineering ◽

Churn Prediction in Telecom Industry Using Machine Learning Algorithms with K-Best and Principal Component Analysis

10.1007/978-981-33-4604-8_40 ◽

2021 ◽

pp. 499-507

Author(s):

K. V. Anjana ◽

Siddhaling Urolagin

Keyword(s):

Machine Learning ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Churn Prediction ◽

Telecom Industry

Analysis of Supervised Machine Learning Algorithms for Heart Disease Prediction with Reduced Number of Attributes using Principal Component Analysis

International Journal of Computer Applications ◽

10.5120/ijca2016909231 ◽

2016 ◽

Vol 140 (2) ◽

pp. 27-31 ◽

Cited By ~ 1

Author(s):

Ayon Dey ◽

Jyoti Singh ◽

Neeta Singh

Keyword(s):

Machine Learning ◽

Heart Disease ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Supervised Machine Learning ◽

Disease Prediction

Machine learning techniques to derive bioclimatic classifications for Colombia

10.1101/2021.09.05.459033 ◽

2021 ◽

Author(s):

Richard Rios ◽

Elkin A. Noguera-Urbano ◽

Jairo Espinosa ◽

Jose Manuael Ochoa

Keyword(s):

Machine Learning ◽

Logistic Regression ◽

Regression Models ◽

Principal Component ◽

Component Analysis ◽

Study Region ◽

Logistic Regression Models ◽

Bioclimatic classifications seek to divide a study region into geographic areas with similar bioclimatic characteristics. In this study we proposed two bioclimatic classifications for Colombia using machine learning techniques. We firstly characterized the precipitation space of Colombia using principal component analysis. Based on Lang classification, we then projected all background sites in the precipitation space with their corresponding categories. We sequentially fit logistic regression models to re-classify all background sites in the precipitation space with six redefined Lang categories. New categories were the used to define a new modified Lang and Caldas-Lang classifications.