Early Breast Cancer Prediction Using Dermatoglyphics: Data Mining Pilot Study in a General Hospital in Iran (Preprint)

BACKGROUND Dermatoglyphics is the study of skin patterns on hands and feet. It has been shown in some studies that specific finger patterns could be a risk factor of breast cancer. There are several studies using data mining methods to evaluate the risk of breast cancer; while there is no or little study that evaluates finger patterns with data mining for breast cancer risk prediction. OBJECTIVE This study aims to evaluate fingerprint patterns along with other easy-to-obtain features in the risk of breast cancer. METHODS A dataset containing 462 records includes female patients in Imam Khomeini Hospital Complex, Tehran, Iran was obtained. The dataset has comprised of age, menstruation age, menopause age, and situation, has a child, age at first live birth, family history of breast cancer, and figure print patterns features of hands. The factors weight was determined by the Information Gain index. Predictive models were built once without fingerprint features and once with fingerprint features using Naïve Bayes, Decision Tree, Random Forest (RF), Support Vector Machine (SVM), and Deep Learning classifiers. RESULTS The most important factor determining breast cancer were age, having a child, menopause situation, and menopause age. The best performance belongs to the RF model with accuracy and AUC of 84.43% and 0.923 respectively. The fingerprint patterns feature increased the RF accuracy from 79.44% to 84.43%. CONCLUSIONS An early breast cancer screening model could be built with the use of data mining methods. The fingerprint patterns could increase the performance of these models. The Random Forest model could be used. The results of such models could be used in designing apps for self-screening breast cancer.

Download Full-text

A data mining approach to the diagnosis of failure modes for two serial fastened sandwich composite plates

Journal of Composite Materials ◽

10.1177/0021998316679720 ◽

2016 ◽

Vol 51 (20) ◽

pp. 2853-2862 ◽

Cited By ~ 2

Author(s):

Serkan Ballı

Keyword(s):

Data Mining ◽

Random Forest ◽

Failure Modes ◽

Composite Plates ◽

Study Data ◽

Sandwich Composite ◽

Support Vector ◽

Geometrical Parameters ◽

Mining Methods

The aim of this study is to diagnose and classify the failure modes for two serial fastened sandwich composite plates using data mining techniques. The composite material used in the study was manufactured using glass fiber reinforced layer and aluminum sheets. Obtained results of previous experimental study for sandwich composite plates, which were mechanically fastened with two serial pins or bolts were used for classification of failure modes. Furthermore, experimental data from previous study consists of different geometrical parameters for various applied preload moments as 0 (pinned), 2, 3, 4, and 5 Nm (bolted). In this study, data mining methods were applied by using these geometrical parameters and pinned/bolted joint configurations. Therefore, three geometrical parameters and 100 test data were used for classification by utilizing support vector machine, Naive Bayes, K-Nearest Neighbors, Logistic Regression, and Random Forest methods. According to experiments, Random Forest method achieved better results than others and it was appropriate for diagnosing and classification of the failure modes. Performances of all data mining methods used were discussed in terms of accuracy and error ratios.

Download Full-text

Analysis of prognostic factors on overall survival in elderly women treated for early breast cancer using data mining and machine learning

Annals of Oncology ◽

10.1093/annonc/mdz257.022 ◽

2019 ◽

Vol 30 ◽

pp. v580-v581

Author(s):

P. Heudel ◽

D. Hooijenga ◽

R. Phan ◽

V. Augusto ◽

X. Xie ◽

...

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Data Mining ◽

Overall Survival ◽

Prognostic Factors ◽

Early Breast Cancer ◽

Elderly Women ◽

Using Data

Download Full-text

Prognosis on Stratification of Breast Cancer using Data Mining Models

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.f3406.049620 ◽

2020 ◽

Vol 9 (6) ◽

pp. 650-653

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Random Forest ◽

Prediction Model ◽

Prediction Models ◽

Sequential Minimal Optimization ◽

Fine Needle Aspirate ◽

Fine Needle ◽

Physical Attributes ◽

Using Data

Breast cancer classification can be useful for discovering the genetic behavior of tumors and envision the outcome of some diseases. Through this paper we are predicting the noxious behavior of a tumor. The prediction models used are Random Forest, Naïve Bayes, IBK (Instance Based Learner), SMO (Sequential minimal optimization), and Multi Class Classifier. This prediction model which can potentially be used as a biomarker of breast cancer is based on physical attributes of a breast mass and which is gathered from digitized image of Fine Needle Aspirate (FNA). These can be helpful in prediction and reduction of invasive tumors

Download Full-text

Predication of Parkinson′s disease using data mining methods: A comparative analysis of tree, statistical, and support vector machine classifiers

Indian Journal of Medical Sciences ◽

10.4103/0019-5359.107023 ◽

2011 ◽

Vol 65 (6) ◽

pp. 231 ◽

Cited By ~ 7

Author(s):

Yugal Kumar ◽

Gadadhar Sahoo ◽

Geeta Yadav

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Comparative Analysis ◽

Support Vector ◽

Mining Methods ◽

Using Data

Download Full-text

Estimating Daily Reference Evapotranspiration using Data Mining Methods of Support Vector Regression and M5 Model Tree

Journal of watershed management research ◽

10.29252/jwmr.9.18.157 ◽

2019 ◽

Vol 9 (18) ◽

pp. 157-167

Author(s):

Saeed Samadianfard ◽

Solmaz Panahi ◽

◽

Keyword(s):

Data Mining ◽

Support Vector Regression ◽

Reference Evapotranspiration ◽

Support Vector ◽

M5 Model Tree ◽

Model Tree ◽

Mining Methods ◽

Using Data ◽

M5 Model

Download Full-text

Proximate Breast Cancer Factors Using Data Mining Classification Techniques

International Journal of Big Data and Analytics in Healthcare ◽

10.4018/ijbdah.2019010104 ◽

2019 ◽

Vol 4 (1) ◽

pp. 47-56

Author(s):

Alice Constance Mensah ◽

Isaac Ofori Asare

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Random Forest ◽

Cancer Patients ◽

Classification Model ◽

Tree Model ◽

Cancer Data ◽

Data Mining Approach ◽

Learning Techniques ◽

Using Data

Breast cancer is the most common of all cancers and is the leading cause of cancer deaths in women worldwide. The classification of breast cancer data can be useful to predict the outcome of some diseases or discover the genetic behavior of tumors. Data mining technology helps in classifying cancer patients and this technique helps to identify potential cancer patients by simply analyzing the data. This study examines the determinant factors of breast cancer and measures the breast cancer patient data to build a useful classification model using a data mining approach. In this study of 2397 women, 1022 (42.64%) were diagnosed with breast cancer. Among the four main learning techniques such as: Random Forest, Naive Bayes, Classification and Regression Model (CART), and Boosted Tree model were used for the study. The Random Forest technique had the better accuracy value of 0.9892(95%CI,0.9832 -0.9935) and a sensitivity value of about 92%. This means that the Random Forest learning model is the best model to classify and predict breast cancer based on associated factors.

Download Full-text

Predication of Parkinson's disease using data mining methods: A comparative analysis of tree, statistical and support vector machine classifiers

2012 NATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION SYSTEMS ◽

10.1109/ncccs.2012.6413034 ◽

2012 ◽

Cited By ~ 13

Author(s):

Geeta Yadav ◽

Yugal Kumar ◽

G. Sahoo

Keyword(s):

Data Mining ◽

Parkinson’S Disease ◽

Support Vector Machine ◽

Parkinson's Disease ◽

Comparative Analysis ◽

Support Vector ◽

Mining Methods ◽

Using Data

Download Full-text

A review of data mining methods in financial markets

Data Science in Finance and Economics ◽

10.3934/dsfe.2021020 ◽

2021 ◽

Vol 1 (4) ◽

pp. 362-392

Author(s):

Haihua Liu ◽

◽

Shan Huang ◽

Peng Wang ◽

Zejun Li ◽

...

Keyword(s):

Data Mining ◽

Data Analysis ◽

Financial Markets ◽

Social Life ◽

Development Trend ◽

Financial Data ◽

Support Vector ◽

Mining Methods ◽

Financial Data Analysis ◽

Using Data

<abstract><p>Financial activities are closely related to human social life. Data mining plays an important role in the analysis and prediction of financial markets, especially in the context of the current era of big data. However, it is not simple to use data mining methods in the process of analyzing financial data, due to the differences in the background of researchers in different disciplines. This review summarizes several commonly used data mining methods in financial data analysis. The purpose is to make it easier for researchers in the financial field to use data mining methods and to expand the application scenarios of it used by researchers in the computer field. This review introduces the principles and steps of decision trees, support vector machines, Bayesian, K-nearest neighbors, k-means, Expectation-maximization algorithm, and ensemble learning, and points out their advantages, disadvantages and applicable scenarios. After introducing the algorithms, it summarizes the use of the algorithm in the process of financial data analysis, hoping that readers can get specific examples of using the algorithm. In this review, the difficulties and countermeasures of using data mining methods are summarized, and the development trend of using data mining methods to analyze financial data is predicted.</p></abstract>

Download Full-text

Analysis of Heart Disorder by Using Machine Learning Methods and Data Mining Techniques

Deep Learning Applications and Intelligent Decision Making in Engineering - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-2108-3.ch009 ◽

2021 ◽

pp. 212-221

Author(s):

Sarangam Kodati ◽

Jeeva Selvaraj

Keyword(s):

Machine Learning ◽

Data Mining ◽

Support Vector Machine ◽

Heart Disease ◽

Heart Diseases ◽

Support Vector ◽

Primary Reason ◽

The World ◽

Mining Methods ◽

Using Data

Data mining is the most famous knowledge extraction approach for knowledge discovery from data (KDD). Machine learning is used to enable a program to analyze data, recognize correlations, and make usage on insights to solve issues and/or enrich data and because of prediction. The chapter highlights the need for more research within the usage of robust data mining methods in imitation of help healthcare specialists between the diagnosis regarding heart diseases and other debilitating disease conditions. Heart disease is the primary reason of death of people in the world. Nearly 47% of death is caused by heart disease. The authors use algorithms including random forest, naïve Bayes, support vector machine to analyze heart disease. Accuracy on the prediction stage is high when using a greater number of attributes. The goal is to function predictive evaluation using data mining, using data mining to analyze heart disease, and show which methods are effective and efficient.

Download Full-text

Breast Cancer Diagnosis Using Data Mining Methods, Cumulative Histogram Features, and Gary Level Co-occurrence Matrix

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405613666161227162918 ◽

2017 ◽

Vol 13 (4) ◽

Author(s):

Asgarali Bouyer

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Cancer Diagnosis ◽

Breast Cancer Diagnosis ◽

Mining Methods ◽

Using Data ◽

Occurrence Matrix

Download Full-text