The Discovery of Normality of Body Weight Using Principal Component Analysis: A Comparative Study on Machine Learning Techniques Using Different Data Pre-Processing Methods

Bioclimatic classifications seek to divide a study region into geographic areas with similar bioclimatic characteristics. In this study we proposed two bioclimatic classifications for Colombia using machine learning techniques. We firstly characterized the precipitation space of Colombia using principal component analysis. Based on Lang classification, we then projected all background sites in the precipitation space with their corresponding categories. We sequentially fit logistic regression models to re-classify all background sites in the precipitation space with six redefined Lang categories. New categories were the used to define a new modified Lang and Caldas-Lang classifications.

Download Full-text

AN EXPERIMENTAL REVIEW ON EFFECT OF PRINCIPAL COMPONENT ANALYSIS ON MACHINE LEARNING TECHNIQUES

International Journal of Engineering Applied Sciences and Technology ◽

10.33564/ijeast.2020.v05i01.044 ◽

2020 ◽

Vol 5 (1) ◽

pp. 290-296

Author(s):

Hari Krishna Modalavalasa ◽

Madhavi Latha Makkena

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text

Comparative study of Principal Component Analysis based Intrusion Detection approach using machine learning algorithms

2015 3rd International Conference on Signal Processing, Communication and Networking (ICSCN) ◽

10.1109/icscn.2015.7219853 ◽

2015 ◽

Cited By ~ 5

Author(s):

Krupa Joel Chabathula ◽

C. D. Jaidhar ◽

M. A. Ajay Kumara

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Intrusion Detection ◽

Comparative Study ◽

Learning Algorithms ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Detection Approach

Download Full-text

Airbnb (Air Bed and Breakfast) Listing Analysis Through Machine Learning Techniques

10.4018/978-1-7998-8455-2.ch008 ◽

2022 ◽

pp. 209-232

Author(s):

Xiang Li ◽

Jingxi Liao ◽

Tianchuan Gao

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Data Science ◽

Principal Component ◽

Machine Learning Techniques ◽

Classification Models ◽

Performance Measurements ◽

Learning Techniques ◽

Source Data ◽

Bed And Breakfast

Machine learning is a broad field that contains multiple fields of discipline including mathematics, computer science, and data science. Some of the concepts, like deep neural networks, can be complicated and difficult to explain in several words. This chapter focuses on essential methods like classification from supervised learning, clustering, and dimensionality reduction that can be easily interpreted and explained in an acceptable way for beginners. In this chapter, data for Airbnb (Air Bed and Breakfast) listings in London are used as the source data to study the effect of each machine learning technique. By using the K-means clustering, principal component analysis (PCA), random forest, and other methods to help build classification models from the features, it is able to predict the classification results and provide some performance measurements to test the model.

Download Full-text

Data Mining and Principal Component Analysis on Coimbra Breast Cancer Dataset

Proceedings of Intelligent Computing and Technologies Conference ◽

10.21467/proceedings.115.5 ◽

2021 ◽

Author(s):

Anupam Sen

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Principal Component Analysis ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Breast Cancer Dataset ◽

Analysis Tool ◽

Machine Learning Classification

Machine Learning (ML) techniques play an important role in the medical field. Early diagnosis is required to improve the treatment of carcinoma. During this analysis Breast Cancer Coimbra dataset (BCCD) with ten predictors are analyzed to classify carcinoma. In this paper method for feature selection and Machine learning algorithms are applied to the dataset from the UCI repository. WEKA (“Waikato Environment for Knowledge Analysis”) tool is used for machine learning techniques. In this paper Principal Component Analysis (PCA) is used for feature extraction. Different Machine Learning classification algorithms are applied through WEKA such as Glmnet, Gbm, ada Boosting, Adabag Boosting, C50, Cforest, DcSVM, fnn, Ksvm, Node Harvest compares the accuracy and also compare values such as Kappa statistic, Mean Absolute Error (MAE), Root Mean Square Error (RMSE). Here the 10-fold cross validation method is used for training, testing and validation purposes.

Download Full-text

Classification of Observations through Combination of the Dimension Reduction and the Cluster Analysis

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v7i8.13 ◽

2017 ◽

Vol 7 (8) ◽

pp. 30

Author(s):

Hyeuk Kim

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Cluster Analysis ◽

Unsupervised Learning ◽

Principal Component ◽

Component Analysis ◽

Baseball Players ◽

Partitioning Around Medoids ◽

Different Characteristics

Unsupervised learning in machine learning divides data into several groups. The observations in the same group have similar characteristics and the observations in the different groups have the different characteristics. In the paper, we classify data by partitioning around medoids which have some advantages over the k-means clustering. We apply it to baseball players in Korea Baseball League. We also apply the principal component analysis to data and draw the graph using two components for axis. We interpret the meaning of the clustering graphically through the procedure. The combination of the partitioning around medoids and the principal component analysis can be used to any other data and the approach makes us to figure out the characteristics easily.

Download Full-text