Osteoporosis Risk Prediction Using Data Mining Algorithms

Journal of Community Health Research ◽

10.18502/jchr.v9i2.3401 ◽

2020 ◽

Author(s):

Efat Jabarpour ◽

Amin Abedini ◽

Abbasali Keshtkar

Keyword(s):

Data Mining ◽

Personal Information ◽

Disease Diagnosis ◽

Support Vector ◽

Data Mining Algorithms ◽

Industry Standard ◽

Disease Information ◽

Increased Risk ◽

Using Data ◽

Mining Algorithms

Introduction: Osteoporosis is a disease that reduces bone density and loses the quality of bone microstructure leading to an increased risk of fractures. It is one of the major causes of inability and death in elderly people. The current study aims at determining the factors influencing the incidence of osteoporosis and providing a predictive model for the disease diagnosis to increase the diagnostic speed and reduce diagnostic costs. Methods: An Individual's data including personal information, lifestyle, and disease information were reviewed. A new model has been presented based on the Cross-Industry Standard Process CRISP methodology. Besides, Support Vector Machine (SVM) and Bayes methods (Tree Augmented Naïve Bayes (TAN)) and Clementine12 have been used as data mining tools. Results: Some features have been detected to affect this disease. The rules have been extracted that can be used as a pattern for the prediction of the patients' status. Classification precision was calculated to be 88.39% for SVM, and 91.29% for (TAN) when the precision of TAN is higher comparing to other methods. Conclusion: The most effective factors concerning osteoporosis are detected and can be used for a new sample with defined characteristics to predict the possibility of osteoporosis in a person.

Download Full-text

Data mining, fuzzy AHP and TOPSIS for optimizing taxpayer supervision

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i1.pp75-87 ◽

2020 ◽

Vol 18 (1) ◽

pp. 75

Author(s):

M. Jupri ◽

Riyanarto Sarno

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

Fuzzy Ahp ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Using Data ◽

Time Required ◽

Mining Algorithms

The achievement of accepting optimal tax need effective and efficient tax supervision can be achieved by classifying taxpayer compliance to tax regulations. Considering this issue, this paper proposes the classification of taxpayer compliance using data mining algorithms; i.e. C4.5, Support Vector Machine, K-Nearest Neighbor, Naive Bayes, and Multilayer Perceptron based on the compliance of taxpayer data. The taxpayer compliance can be classified into four classes, which are (1) formal and material compliant taxpayers, (2) formal compliant taxpayers, (3) material compliant taxpayers, and (4) formal and material non-compliant taxpayers. Furthermore, the results of data mining algorithms are compared by using Fuzzy AHP and TOPSIS to determine the best performance classification based on the criteria of Accuracy, F-Score, and Time required. Selection of the taxpayer's priority for more detailed supervision at each level of taxpayer compliance is ranked using Fuzzy AHP and TOPSIS based on criteria of dataset variables. The results show that C4.5 is the best performance classification and achieves preference value of 0.998; whereas the MLP algorithm results from the lowest preference value of 0.131. Alternative taxpayer A233 is the top priority taxpayer with a preference value of 0.433; whereas alternative taxpayer A051 is the lowest priority taxpayer with a preference value of 0.036.

Download Full-text

Performance Analysis of Data Mining Algorithms

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.8260 ◽

2019 ◽

Vol 16 (9) ◽

pp. 3849-3853

Author(s):

Dar Masroof Amin ◽

Atul Garg

Keyword(s):

Data Mining ◽

Big Data ◽

Future Trend ◽

Easy Access ◽

Support Vector ◽

Linear Discriminant ◽

Data Mining Algorithms ◽

Data Files ◽

Using Data ◽

Mining Algorithms

The globalisation of Internet is creating enormous amount of data on servers. The data created during last two years is itself equivalent to the data created during all these years. This exponential creation of data is due to the easy access to devices based on Internet of things. This information has become a source of predictive analysis for future happenings. The versatile use of computing devices is creating data of diverse nature and the analysts are predicting the future trend using data of their respective domain. The technology used to analyse the data has become a bottleneck over the time. The main reason behind this is that the rate with which the data is getting created is much more than the technology used to access the same. There are various mining techniques used to explore the useful information. In this research there is detailed analysis of how data is used and perceived by various data mining algorithms. Mining algorithms like Naïve Bayes, Support Vector Machines, Linear Discriminant Analysis Algorithm, Artificial Neural Networks, C4.5, C5.0, K-Nearest Neighbour are analysed. The input data used in these algorithms is big data files. This research mainly focuses on how the existing data algorithms are interacting with big data files. The research has been done on twitter comments.

Download Full-text

A Review Study on Data Mining Algorithms for Prediction Diseases

International Journal for Research in Engineering Application & Management ◽

10.35291/2454-9150.2020.0340 ◽

2020 ◽

pp. 504-507

Keyword(s):

Neural Network ◽

Data Mining ◽

Support Vector Machine ◽

Support Vector ◽

Healthcare Industry ◽

Network Support ◽

Data Mining Algorithms ◽

Review Study ◽

Using Data ◽

Mining Algorithms

The healthcare industry assembles massive volume of healthcare information or data that circulate the information into useful data. In everyday life several factors that affect the human diseases. Hospitals are producing large amount of information related to patients. This paper describes the various data mining algorithms such as neural network, support vector machine, KNN, decision tree etc. and provides an overall brief of the existing work. The major advantage of using data mining is that to identify the structures.

Download Full-text

Predictive Factors of Infant Mortality Using Data Mining in Iran

Journal of Comprehensive Pediatrics ◽

10.5812/compreped.108575 ◽

2021 ◽

Vol 12 (1) ◽

Author(s):

Mahmoud Hajipour ◽

Niloufar Taherpour ◽

Haleh Fateh ◽

Ebrahim Yousefi ◽

Koorosh Etemad ◽

...

Keyword(s):

Risk Factors ◽

Data Mining ◽

Infant Mortality ◽

Rural Areas ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Data Mining Algorithms ◽

Using Data ◽

Mining Algorithms

Objectives: Reducing infant mortality in the whole world is one of the millennium development goals.The aim of this study was to determine the factors related to infant mortality using data mining algorithms. Methods: This population-based case-control study was conducted in eight provinces of Iran. A sum of 2,386 mothers (1,076 cases and 1,310 controls) enrolled in this study. Data were extracted from health records of mothers and filled with checklists in health centers. We employed several data mining algorithms such as AdaBoost classifier, Support Vector Machine, Artificial Neural Networks, Random Forests, K-nearest neighborhood, and Naïve Bayes in order to recognize the important predictors of infant death; binary logistic regression model was used to clarify the role of each selected predictor. Results: In this study, 58.7% of infant mortalities occurred in rural areas, that 55.6% of them were boys. Moreover, Naïve Bayes and Random Forest were highly capable of predicting related factors among data mining models. Also, the results showed that events during pregnancy such as dental disorders, high blood pressure, loss of parents, factors related to infants such as low birth weight, and factors related to mothers like consanguineous marriage and gap of pregnancy (< 3 years) were all risk factors while the age of pregnancy (18 - 35 year) and a high degree of education were protective factors. Conclusions: Infant mortality is the consequence of a variety of factors, including factors related to infants themselves and their mothers and events during pregnancy. Owing to the high accuracy and ability of modern modeling compared to traditional modeling, it is recommended to use machine learning tools for indicating risk factors of infant mortality.

Download Full-text

Simple Correlation Between Weather and COVID-19 Pandemic Using Data Mining Algorithms

IOP Conference Series Materials Science and Engineering ◽

10.1088/1757-899x/982/1/012015 ◽

2020 ◽

Vol 982 ◽

pp. 012015

Author(s):

Ari Fadli ◽

Azis Wisnu Widhi Nugraha ◽

Muhammad Syaiful Aliim ◽

Acep Taryana ◽

Yogiek Indra Kurniawan ◽

...

Keyword(s):

Data Mining ◽

Simple Correlation ◽

Data Mining Algorithms ◽

Using Data ◽

Mining Algorithms

Download Full-text

Diagnosis Using Data Mining Algorithms for Malignant Breast Cancer Cell Detection

2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA) ◽

10.1109/iceca49313.2020.9297481 ◽

2020 ◽

Author(s):

S. Saranya ◽

S. Sasikala

Keyword(s):

Breast Cancer ◽

Data Mining ◽

Cancer Cell ◽

Breast Cancer Cell ◽

Cell Detection ◽

Data Mining Algorithms ◽

Malignant Breast ◽

Using Data ◽

Cancer Cell Detection ◽

Mining Algorithms

Download Full-text

Social Media Analytics Using Data Mining Algorithms

Sustainable Communication Networks and Application - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-030-34515-0_2 ◽

2019 ◽

pp. 12-23

Author(s):

Harnoor Anand ◽

Sandeep Mathur

Keyword(s):

Data Mining ◽

Social Media ◽

Social Media Analytics ◽

Data Mining Algorithms ◽

Using Data ◽

Mining Algorithms

Download Full-text

Prediction of Job Suitability of College Graduate Candidates Using Data Mining Algorithms

Proceedings of the The 1st International Conference on Computer Science and Engineering Technology Universitas Muria Kudus ◽

10.4108/eai.24-10-2018.2280576 ◽

2018 ◽

Author(s):

Vanessa Stefanny ◽

Arief Wibowo

Keyword(s):

Data Mining ◽

College Graduate ◽

Data Mining Algorithms ◽

Using Data ◽

Mining Algorithms

Download Full-text

Applying data mining algorithms to real estate appraisals: a comparative study

International Journal of Housing Markets and Analysis ◽

10.1108/ijhma-07-2020-0080 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Thiago Cesar de Oliveira ◽

Lúcio de Medeiros ◽

Daniel Henrique Marco Detzel

Keyword(s):

Data Mining ◽

Real Estate ◽

Support Vector ◽

Predictive Capacity ◽

Content Type ◽

Data Mining Algorithms ◽

Wide Range ◽

Very Large Databases ◽

Mining Algorithms ◽

Statistical Results

Purpose Real estate appraisals are becoming an increasingly important means of backing up financial operations based on the values of these kinds of assets. However, in very large databases, there is a reduction in the predictive capacity when traditional methods, such as multiple linear regression (MLR), are used. This paper aims to determine whether in these cases the application of data mining algorithms can achieve superior statistical results. First, real estate appraisal databases from five towns and cities in the State of Paraná, Brazil, were obtained from Caixa Econômica Federal bank. Design/methodology/approach After initial validations, additional databases were generated with both real, transformed and nominal values, in clean and raw data. Each was assisted by the application of a wide range of data mining algorithms (multilayer perceptron, support vector regression, K-star, M5Rules and random forest), either isolated or combined (regression by discretization – logistic, bagging and stacking), with the use of 10-fold cross-validation in Weka software. Findings The results showed more varied incremental statistical results with the use of algorithms than those obtained by MLR, especially when combined algorithms were used. The largest increments were obtained in databases with a large amount of data and in those where minor initial data cleaning was carried out. The paper also conducts a further analysis, including an algorithmic ranking based on the number of significant results obtained. Originality/value The authors did not find similar studies or research studies conducted in Brazil.

Download Full-text

Method for Processing Fluorescence Decay Kinetic Curves Using Data Mining Algorithms

Journal of Applied Spectroscopy ◽

10.1007/s10812-020-01004-3 ◽

2020 ◽

Vol 87 (2) ◽

pp. 333-344

Author(s):

M. M. Yatskou ◽

V. V. Skakun ◽

V. V. Apanasovich

Keyword(s):

Data Mining ◽

Fluorescence Decay ◽

Decay Kinetic ◽

Kinetic Curves ◽

Data Mining Algorithms ◽

Using Data ◽

Mining Algorithms

Download Full-text