scholarly journals Performance evaluation of different classification techniques using different datasets

Author(s):  
Abdulkadir Özdemir ◽  
Uğur Yavuz ◽  
Fares Abdulhafidh Dael

<span>Nowadays data mining become one of the technologies that paly major effect on business intelligence. However, to be able to use the data mining outcome the user should go through many process such as classified data. Classification of data is processing data and organize them in specific categorize to be use in most effective and efficient use. In data mining one technique is not applicable to be applied to all the datasets. This paper showing the difference result of applying different techniques on the same data. This paper evaluates the performance of different classification techniques using different datasets. In this study four data classification techniques have chosen. They are as follow, BayesNet, NaiveBayes, Multilayer perceptron and J48. The selected data classification techniques performance tested under two parameters, the time taken to build the model of the dataset and the percentage of accuracy to classify the dataset in the correct classification. The experiments are carried out using Weka 3.8 software. The results in the paper demonstrate that the efficiency of Multilayer Perceptron classifier in overall the best accuracy performance to classify the instances, and NaiveBayes classifiers were the worst outcome of accuracy to classifying the instance for each dataset.</span>

2018 ◽  
Vol 150 ◽  
pp. 06003 ◽  
Author(s):  
Saima Anwar Lashari ◽  
Rosziati Ibrahim ◽  
Norhalina Senan ◽  
N. S. A. M. Taujuddin

This paper investigates the existing practices and prospects of medical data classification based on data mining techniques. It highlights major advanced classification approaches used to enhance classification accuracy. Past research has provided literature on medical data classification using data mining techniques. From extensive literature analysis, it is found that data mining techniques are very effective for the task of classification. This paper analysed comparatively the current advancement in the classification of medical data. The findings of the study showed that the existing classification of medical data can be improved further. Nonetheless, there should be more research to ascertain and lessen the ambiguities for classification to gain better precision.


2021 ◽  
Vol 4 (1) ◽  
pp. 14
Author(s):  
Husna Afanyn Khoirunissa ◽  
Amanda Rizky Widyaningrum ◽  
Annisa Priliya Ayu Maharani

<p>The Bank is a business entity that is dealing with money, accepting deposits from customers, providing funds for each withdrawal, billing checks on the customer's orders, giving credit and or embedding the excess deposits until required for repayment. The purpose of this research is to determine the influence of age, gender, country, customer credit score, number of bank products used by the customer, and the activation of the bank members in the decision to choose to continue using the bank account that he has retained or closed the bank account. The data in this research used 10,000 respondents originating from France, Spain, and Germany. The method used is data mining with early stage preprocessing to clean data from outlier and missing value and feature selection to select important attributes. Then perform the classification using three methods, which are Random Forest, Logistic Regression, and Multilayer Perceptron. The results of this research showed that the model with Multilayer Perceptron method with 10 folds Cross Validation is the best model with 85.5373% accuracy.</p><strong>Keywords:</strong> bank customer, random forest, logistic regression, multilayer perceptron


2019 ◽  
Vol 1 (1) ◽  
pp. 121-131
Author(s):  
Ali Fauzi

The existence of big data of Indonesian FDI (foreign direct investment)/ CDI (capital direct investment) has not been exploited somehow to give further ideas and decision making basis. Example of data exploitation by data mining techniques are for clustering/labeling using K-Mean and classification/prediction using Naïve Bayesian of such DCI categories. One of DCI form is the ‘Quick-Wins’, a.k.a. ‘Low-Hanging-Fruits’ Direct Capital Investment (DCI), or named shortly as QWDI. Despite its mentioned unfavorable factors, i.e. exploitation of natural resources, low added-value creation, low skill-low wages employment, environmental impacts, etc., QWDI , to have great contribution for quick and high job creation, export market penetration and advancement of technology potential. By using some basic data mining techniques as complements to usual statistical/query analysis, or analysis by similar studies or researches, this study has been intended to enable government planners, starting-up companies or financial institutions for further CDI development. The idea of business intelligence orientation and knowledge generation scenarios is also one of precious basis. At its turn, Information and Communication Technology (ICT)’s enablement will have strategic role for Indonesian enterprises growth and as a fundamental for ‘knowledge based economy’ in Indonesia.


1999 ◽  
Vol 15 (1) ◽  
pp. 10-17
Author(s):  
Molina Omar Franklin ◽  
Tavares Gimenes Pablo ◽  
Aquilino Raphael ◽  
Rank Rise ◽  
Coelho Santos Zeila ◽  
...  

Objective: To assess the level of depression, severity of pain and pain in single/multiple sites in patients with different severity of bruxing behavior and Temporomandibular Disorders (TMDs). Methods: We evaluated 131 patients with bruxism and TMDs: 20 patients with mild bruxism, 42 patients with moderate bruxism, 45 patients with severe bruxism and 24 patients with extreme bruxism. We used the Beck Depression Inventory (BDI), clinical examination, a questionnaire of clinical epidemiological data, criteria for TMDs and bruxism, palpation of muscles and joints, the Visual Analogue Scale for pain, classification of the occlusion and biomechanical tests to assess for internal joint derangements. Results: The level of depression increased from the mild, to the moderate, severe and extreme bruxing behavior groups, but the difference was significant only from the mild to the extreme group (p<0.001). Pain levels increased from the mild and moderate to the severe and extreme subgroups, but were not statistically significant. Mean number of pain sites increased from the mild, to the moderate, severe and extreme subgroup and the difference was extremely significant (p<0.0001). Conclusion: Levels of depression, severity of pain and pain sites increased with severity of bruxing behavior. A higher number of pain sites with more severe bruxism indicates somatization in bruxers, but a further study using the same protocol and a psychological test for somatization would be indicated to further substantiate these findings.


Diagnostics ◽  
2021 ◽  
Vol 11 (2) ◽  
pp. 233
Author(s):  
Dong-Woon Lee ◽  
Sung-Yong Kim ◽  
Seong-Nyum Jeong ◽  
Jae-Hong Lee

Fracture of a dental implant (DI) is a rare mechanical complication that is a critical cause of DI failure and explantation. The purpose of this study was to evaluate the reliability and validity of a three different deep convolutional neural network (DCNN) architectures (VGGNet-19, GoogLeNet Inception-v3, and automated DCNN) for the detection and classification of fractured DI using panoramic and periapical radiographic images. A total of 21,398 DIs were reviewed at two dental hospitals, and 251 intact and 194 fractured DI radiographic images were identified and included as the dataset in this study. All three DCNN architectures achieved a fractured DI detection and classification accuracy of over 0.80 AUC. In particular, automated DCNN architecture using periapical images showed the highest and most reliable detection (AUC = 0.984, 95% CI = 0.900–1.000) and classification (AUC = 0.869, 95% CI = 0.778–0.929) accuracy performance compared to fine-tuned and pre-trained VGGNet-19 and GoogLeNet Inception-v3 architectures. The three DCNN architectures showed acceptable accuracy in the detection and classification of fractured DIs, with the best accuracy performance achieved by the automated DCNN architecture using only periapical images.


Diagnostics ◽  
2020 ◽  
Vol 10 (3) ◽  
pp. 162 ◽  
Author(s):  
Julieta G. Rodríguez-Ruiz ◽  
Carlos E. Galván-Tejada ◽  
Laura A. Zanella-Calzada ◽  
José M. Celaya-Padilla ◽  
Jorge I. Galván-Tejada ◽  
...  

Major Depression Disease has been increasing in the last few years, affecting around 7 percent of the world population, but nowadays techniques to diagnose it are outdated and inefficient. Motor activity data in the last decade is presented as a better way to diagnose, treat and monitor patients suffering from this illness, this is achieved through the use of machine learning algorithms. Disturbances in the circadian rhythm of mental illness patients increase the effectiveness of the data mining process. In this paper, a comparison of motor activity data from the night, day and full day is carried out through a data mining process using the Random Forest classifier to identified depressive and non-depressive episodes. Data from Depressjon dataset is split into three different subsets and 24 features in time and frequency domain are extracted to select the best model to be used in the classification of depression episodes. The results showed that the best dataset and model to realize the classification of depressive episodes is the night motor activity data with 99.37% of sensitivity and 99.91% of specificity.


Sign in / Sign up

Export Citation Format

Share Document