Air Temperature Prediction Using Different Datamining Approaches In Sulaymaniyah City In Iraq

Yosra Mohammed; Sherko Murad; Brzu Tahir

doi:10.24271/psr.21

Air Temperature Prediction Using Different Datamining Approaches In Sulaymaniyah City In Iraq

Mapping Intimacies ◽

10.24271/psr.21 ◽

2021 ◽

Vol 3 (2) ◽

pp. 1-9

Author(s):

Yosra Mohammed ◽

Sherko Murad ◽

Brzu Tahir

Keyword(s):

Climate Change ◽

Data Mining ◽

Support Vector Machine ◽

Air Temperature ◽

Significant Feature ◽

Support Vector ◽

Temperature Prediction ◽

Data Mining Algorithms ◽

Air Temperature Prediction ◽

Mining Algorithms

Climate change has a historical impact at universal and local levels over the past era. Climate change is one of the greatest challenge issues in the globe for meteorological research. Air temperature estimation, in particular, has been measured as a significant feature in weather impression studies on industrial sectors, environmental, ecological, and agricultural. Accurately predicting air temperature guides to measure lifestyle, perform a key character for the government, industries, and public in development activities. In this paper, we investigate the use of various data mining approaches such as Support Vector Machine (SVM), Decision tree (DT), and Naïve Bayes for air temperature prediction within Sulaymaniyah City in Kurdistan, IRAQ. The metrological data is collected from the local Weather Forecast Department in the city within the range 2013 to 2018 inclusive. A dataset for the metrological data was developed and used to train the data mining algorithms. The proposed data mining algorithms were tested on the dataset to predict the air temperature and the performance of these algorithms were compared using standard performance metrics. Support vector machine has accomplished promising performance among using algorithms

Assessment of land subsidence susceptibility in Semnan plain (Iran): a comparison of support vector machine and weights of evidence data mining algorithms

Natural Hazards ◽

10.1007/s11069-019-03785-z ◽

2019 ◽

Vol 99 (2) ◽

pp. 951-971 ◽

Cited By ~ 11

Author(s):

Majid Mohammady ◽

Hamid Reza Pourghasemi ◽

Mojtaba Amiri

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Land Subsidence ◽

Weights Of Evidence ◽

Support Vector ◽

Data Mining Algorithms ◽

Mining Algorithms

A Review Study on Data Mining Algorithms for Prediction Diseases

International Journal for Research in Engineering Application & Management ◽

10.35291/2454-9150.2020.0340 ◽

2020 ◽

pp. 504-507

Keyword(s):

Neural Network ◽

Data Mining ◽

Support Vector Machine ◽

Support Vector ◽

Healthcare Industry ◽

Network Support ◽

Data Mining Algorithms ◽

Review Study ◽

Using Data ◽

Mining Algorithms

The healthcare industry assembles massive volume of healthcare information or data that circulate the information into useful data. In everyday life several factors that affect the human diseases. Hospitals are producing large amount of information related to patients. This paper describes the various data mining algorithms such as neural network, support vector machine, KNN, decision tree etc. and provides an overall brief of the existing work. The major advantage of using data mining is that to identify the structures.

Study of SVM algorithm for Data Mining in Big Data

Psychology and Education Journal ◽

10.17762/pae.v58i1.1498 ◽

2021 ◽

Vol 58 (1) ◽

pp. 4296-4301

Author(s):

V. Nanda Kumar, Vinoth N. A. S. , Mohamed Sanjarkhan

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Big Data ◽

Elevated Level ◽

Support Vector ◽

Information Mining ◽

Data Mining Algorithms ◽

Svm Algorithm ◽

Mining Algorithms ◽

Excessive Number

Data mining is a process which finds useful patterns from large amount of data.These days, there are an excessive number of Data Mining Algorithms present. Support Vector Machine (SVM) is assuming a crucial function as it gives strategies so as to acquire brings about a viable route and with an elevated level of value. In this paper, we examine about the function of SVM calculation in large information from information mining viewpoint undertakings like order, bunching, expectation, estimating and others applications. In current situation world is comprised of "huge information". The principle point of this paper is to unmistakably comprehend the premise of SVM procedures in different zones. In our perspective, we have assessed the quantity of exploration distributions that have been advanced in various rumored diaries for the information mining applications and furthermore recommended a potential number of issues of SVM.

Osteoporosis Risk Prediction Using Data Mining Algorithms

Journal of Community Health Research ◽

10.18502/jchr.v9i2.3401 ◽

2020 ◽

Author(s):

Efat Jabarpour ◽

Amin Abedini ◽

Abbasali Keshtkar

Keyword(s):

Data Mining ◽

Personal Information ◽

Disease Diagnosis ◽

Support Vector ◽

Data Mining Algorithms ◽

Industry Standard ◽

Disease Information ◽

Increased Risk ◽

Using Data ◽

Mining Algorithms

Introduction: Osteoporosis is a disease that reduces bone density and loses the quality of bone microstructure leading to an increased risk of fractures. It is one of the major causes of inability and death in elderly people. The current study aims at determining the factors influencing the incidence of osteoporosis and providing a predictive model for the disease diagnosis to increase the diagnostic speed and reduce diagnostic costs. Methods: An Individual's data including personal information, lifestyle, and disease information were reviewed. A new model has been presented based on the Cross-Industry Standard Process CRISP methodology. Besides, Support Vector Machine (SVM) and Bayes methods (Tree Augmented Naïve Bayes (TAN)) and Clementine12 have been used as data mining tools. Results: Some features have been detected to affect this disease. The rules have been extracted that can be used as a pattern for the prediction of the patients' status. Classification precision was calculated to be 88.39% for SVM, and 91.29% for (TAN) when the precision of TAN is higher comparing to other methods. Conclusion: The most effective factors concerning osteoporosis are detected and can be used for a new sample with defined characteristics to predict the possibility of osteoporosis in a person.

Applying data mining algorithms to real estate appraisals: a comparative study

International Journal of Housing Markets and Analysis ◽

10.1108/ijhma-07-2020-0080 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Thiago Cesar de Oliveira ◽

Lúcio de Medeiros ◽

Daniel Henrique Marco Detzel

Keyword(s):

Data Mining ◽

Real Estate ◽

Support Vector ◽

Predictive Capacity ◽

Content Type ◽

Data Mining Algorithms ◽

Wide Range ◽

Very Large Databases ◽

Mining Algorithms ◽

Statistical Results

Purpose Real estate appraisals are becoming an increasingly important means of backing up financial operations based on the values of these kinds of assets. However, in very large databases, there is a reduction in the predictive capacity when traditional methods, such as multiple linear regression (MLR), are used. This paper aims to determine whether in these cases the application of data mining algorithms can achieve superior statistical results. First, real estate appraisal databases from five towns and cities in the State of Paraná, Brazil, were obtained from Caixa Econômica Federal bank. Design/methodology/approach After initial validations, additional databases were generated with both real, transformed and nominal values, in clean and raw data. Each was assisted by the application of a wide range of data mining algorithms (multilayer perceptron, support vector regression, K-star, M5Rules and random forest), either isolated or combined (regression by discretization – logistic, bagging and stacking), with the use of 10-fold cross-validation in Weka software. Findings The results showed more varied incremental statistical results with the use of algorithms than those obtained by MLR, especially when combined algorithms were used. The largest increments were obtained in databases with a large amount of data and in those where minor initial data cleaning was carried out. The paper also conducts a further analysis, including an algorithmic ranking based on the number of significant results obtained. Originality/value The authors did not find similar studies or research studies conducted in Brazil.

Acoustic Signature Based Weld Quality Monitoring for SMAW Process Using Data Mining Algorithms

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.813-814.1104 ◽

2015 ◽

Vol 813-814 ◽

pp. 1104-1113 ◽

Cited By ~ 5

Author(s):

A. Sumesh ◽

Dinu Thomas Thekkuden ◽

Binoy B. Nair ◽

K. Rameshkumar ◽

K. Mohandas

Keyword(s):

Neural Network ◽

Data Mining ◽

Welding Process ◽

Machine Learning Algorithms ◽

Steel Plates ◽

Support Vector ◽

Welding Parameters ◽

Process Data ◽

Data Mining Algorithms ◽

Mining Algorithms

The quality of weld depends upon welding parameters and exposed environment conditions. Improper selection of welding process parameter is one of the important reasons for the occurrence of weld defect. In this work, arc sound signals are captured during the welding of carbon steel plates. Statistical features of the sound signals are extracted during the welding process. Data mining algorithms such as Naive Bayes, Support Vector Machines and Neural Network were used to classify the weld conditions according to the features of the sound signal. Two weld conditions namely good weld and weld with defects namely lack of fusion, and burn through were considered in this study. Classification efficiencies of machine learning algorithms were compared. Neural network is found to be producing better classification efficiency comparing with other algorithms considered in this study.

Comparing Performance of Data Mining Algorithms in Prediction Heart Diseases

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v5i6.pp1569-1576 ◽

2015 ◽

Vol 5 (6) ◽

pp. 1569 ◽

Cited By ~ 13

Author(s):

Moloud Abdar ◽

Sharareh R. Niakan Kalhori ◽

Tole Sutikno ◽

Imam Much Ibnu Subroto ◽

Goli Arji

Keyword(s):

Neural Network ◽

Data Mining ◽

Decision Tree ◽

Heart Diseases ◽

Support Vector ◽

Data Mining Algorithm ◽

Network Support ◽

Data Mining Algorithms ◽

Mining Algorithms ◽

Analysis Models

Heart diseases are among the nation’s leading couse of mortality and moribidity. Data mining teqniques can predict the likelihood of patients getting a heart disease. The purpose of this study is comparison of different data mining algorithm on prediction of heart diseases. This work applied and compared data mining techniques to predict the risk of heart diseases. After feature analysis, models by five algorithms including decision tree (C5.0), neural network, support vector machine (SVM), logistic regression and k-nearest neighborhood (KNN) were developed and validated. C5.0 Decision tree has been able to build a model with greatest accuracy 93.02%, KNN, SVM, Neural network have been 88.37%, 86.05% and 80.23% respectively. Produced results of decision tree can be simply interpretable and applicable; their rules can be understood easily by different clinical practitioner.

Data mining, fuzzy AHP and TOPSIS for optimizing taxpayer supervision

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i1.pp75-87 ◽

2020 ◽

Vol 18 (1) ◽

pp. 75

Author(s):

M. Jupri ◽

Riyanarto Sarno

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

Fuzzy Ahp ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Using Data ◽

Time Required ◽

Mining Algorithms

The achievement of accepting optimal tax need effective and efficient tax supervision can be achieved by classifying taxpayer compliance to tax regulations. Considering this issue, this paper proposes the classification of taxpayer compliance using data mining algorithms; i.e. C4.5, Support Vector Machine, K-Nearest Neighbor, Naive Bayes, and Multilayer Perceptron based on the compliance of taxpayer data. The taxpayer compliance can be classified into four classes, which are (1) formal and material compliant taxpayers, (2) formal compliant taxpayers, (3) material compliant taxpayers, and (4) formal and material non-compliant taxpayers. Furthermore, the results of data mining algorithms are compared by using Fuzzy AHP and TOPSIS to determine the best performance classification based on the criteria of Accuracy, F-Score, and Time required. Selection of the taxpayer's priority for more detailed supervision at each level of taxpayer compliance is ranked using Fuzzy AHP and TOPSIS based on criteria of dataset variables. The results show that C4.5 is the best performance classification and achieves preference value of 0.998; whereas the MLP algorithm results from the lowest preference value of 0.131. Alternative taxpayer A233 is the top priority taxpayer with a preference value of 0.433; whereas alternative taxpayer A051 is the lowest priority taxpayer with a preference value of 0.036.

A Method for Classification Using Data Mining Technique for Diabetes

Psychology and Mental Health ◽

10.4018/978-1-5225-0159-6.ch030 ◽

2016 ◽

pp. 738-761

Author(s):

Ahmad Al-Khasawneh

Keyword(s):

Data Mining ◽

Support Vector Machine ◽

Classification Accuracy ◽

Health Information System ◽

Parameters Optimization ◽

Support Vector ◽

Data Mining Algorithms ◽

Predictive Data Mining ◽

Severity Of The Disease ◽

Using Data

Many researchers in the health information system field have been attracted to develop computer applications that help in the diagnosis process. Imperatively, data mining algorithms address the vital role in all of these applications. Many contributions were made in this area. There has always been a debate on the algorithm that gives the best classifier, the parameters to be used, the dataset pre-processing steps, etc. In this paper, the author largely emphasizes that the best way to build a predictive model with relatively high classification accuracy is to build several predictive models and to choose the model that gives the best results through parameters optimization. Diagnosing diabetes mellitus has gained considerable attention in the last few decades due to the increased severity of the disease. In this research, the author reviews four predictive data mining approaches that are being used in diagnosing diabetes. Four models were implemented to diagnose diabetes from PIMA dataset; k-nearest neighbour, support vector machine, multilayer perceptron neural network, and naive bayesian network. Giving the highest classification accuracy, support vector machine technique outperformed the others with a value of 78.83%.

Performance Analysis of Data Mining Algorithms

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2019.8260 ◽

2019 ◽

Vol 16 (9) ◽

pp. 3849-3853

Author(s):

Dar Masroof Amin ◽

Atul Garg

Keyword(s):

Data Mining ◽

Big Data ◽

Future Trend ◽

Easy Access ◽

Support Vector ◽

Linear Discriminant ◽

Data Mining Algorithms ◽

Data Files ◽

Using Data ◽

Mining Algorithms

The globalisation of Internet is creating enormous amount of data on servers. The data created during last two years is itself equivalent to the data created during all these years. This exponential creation of data is due to the easy access to devices based on Internet of things. This information has become a source of predictive analysis for future happenings. The versatile use of computing devices is creating data of diverse nature and the analysts are predicting the future trend using data of their respective domain. The technology used to analyse the data has become a bottleneck over the time. The main reason behind this is that the rate with which the data is getting created is much more than the technology used to access the same. There are various mining techniques used to explore the useful information. In this research there is detailed analysis of how data is used and perceived by various data mining algorithms. Mining algorithms like Naïve Bayes, Support Vector Machines, Linear Discriminant Analysis Algorithm, Artificial Neural Networks, C4.5, C5.0, K-Nearest Neighbour are analysed. The input data used in these algorithms is big data files. This research mainly focuses on how the existing data algorithms are interacting with big data files. The research has been done on twitter comments.