Performance evaluation of different classification techniques using different datasets

Nowadays data mining become one of the technologies that paly major effect on business intelligence. However, to be able to use the data mining outcome the user should go through many process such as classified data. Classification of data is processing data and organize them in specific categorize to be use in most effective and efficient use. In data mining one technique is not applicable to be applied to all the datasets. This paper showing the difference result of applying different techniques on the same data. This paper evaluates the performance of different classification techniques using different datasets. In this study four data classification techniques have chosen. They are as follow, BayesNet, NaiveBayes, Multilayer perceptron and J48. The selected data classification techniques performance tested under two parameters, the time taken to build the model of the dataset and the percentage of accuracy to classify the dataset in the correct classification. The experiments are carried out using Weka 3.8 software. The results in the paper demonstrate that the efficiency of Multilayer Perceptron classifier in overall the best accuracy performance to classify the instances, and NaiveBayes classifiers were the worst outcome of accuracy to classifying the instance for each dataset.

Download Full-text

Application of Data Mining Techniques for Medical Data Classification: A Review

MATEC Web of Conferences ◽

10.1051/matecconf/201815006003 ◽

2018 ◽

Vol 150 ◽

pp. 06003 ◽

Cited By ~ 2

Author(s):

Saima Anwar Lashari ◽

Rosziati Ibrahim ◽

Norhalina Senan ◽

N. S. A. M. Taujuddin

Keyword(s):

Data Mining ◽

Data Classification ◽

Past Research ◽

Medical Data ◽

Extensive Literature ◽

Literature Analysis ◽

Data Mining Techniques ◽

Medical Data Classification ◽

Using Data

This paper investigates the existing practices and prospects of medical data classification based on data mining techniques. It highlights major advanced classification approaches used to enhance classification accuracy. Past research has provided literature on medical data classification using data mining techniques. From extensive literature analysis, it is found that data mining techniques are very effective for the task of classification. This paper analysed comparatively the current advancement in the classification of medical data. The findings of the study showed that the existing classification of medical data can be improved further. Nonetheless, there should be more research to ascertain and lessen the ambiguities for classification to gain better precision.

Download Full-text

Comparison of Random Forest, Logistic Regression, and MultilayerPerceptron Methods on Classification of Bank Customer Account Closure

Indonesian Journal of Applied Statistics ◽

10.13057/ijas.v4i1.41461 ◽

2021 ◽

Vol 4 (1) ◽

pp. 14

Author(s):

Husna Afanyn Khoirunissa ◽

Amanda Rizky Widyaningrum ◽

Annisa Priliya Ayu Maharani

Keyword(s):

Data Mining ◽

Logistic Regression ◽

Feature Selection ◽

Random Forest ◽

Multilayer Perceptron ◽

Cross Validation ◽

Early Stage ◽

Bank Account ◽

Credit Score

The Bank is a business entity that is dealing with money, accepting deposits from customers, providing funds for each withdrawal, billing checks on the customer's orders, giving credit and or embedding the excess deposits until required for repayment. The purpose of this research is to determine the influence of age, gender, country, customer credit score, number of bank products used by the customer, and the activation of the bank members in the decision to choose to continue using the bank account that he has retained or closed the bank account. The data in this research used 10,000 respondents originating from France, Spain, and Germany. The method used is data mining with early stage preprocessing to clean data from outlier and missing value and feature selection to select important attributes. Then perform the classification using three methods, which are Random Forest, Logistic Regression, and Multilayer Perceptron. The results of this research showed that the model with Multilayer Perceptron method with 10 folds Cross Validation is the best model with 85.5373% accuracy.Keywords: bank customer, random forest, logistic regression, multilayer perceptron

Download Full-text

On the classification techniques in data mining for microarray data classification

Journal of Physics Conference Series ◽

10.1088/1742-6596/971/1/012004 ◽

2018 ◽

Vol 971 ◽

pp. 012004 ◽

Cited By ~ 2

Author(s):

Husna Aydadenta ◽

Adiwijaya

Keyword(s):

Data Mining ◽

Microarray Data ◽

Data Classification ◽

Classification Techniques

Download Full-text

How Sweet and Ripe are the Fruits? Data Mining Techniques for Classifying and Predicting ‘Quick-Wins’ Direct Capital Investment in Indonesia as One Approach to Business intelligence Orientation and Knowledge Management Scenarios of Indonesian Enterprises

ACMIT Proceedings ◽

10.33555/acmit.v1i1.13 ◽

2019 ◽

Vol 1 (1) ◽

pp. 121-131

Author(s):

Ali Fauzi

Keyword(s):

Data Mining ◽

Business Intelligence ◽

Direct Investment ◽

Capital Investment ◽

Export Market ◽

Added Value ◽

Knowledge Generation ◽

Management Scenarios ◽

Data Mining Techniques ◽

Knowledge Based Economy

The existence of big data of Indonesian FDI (foreign direct investment)/ CDI (capital direct investment) has not been exploited somehow to give further ideas and decision making basis. Example of data exploitation by data mining techniques are for clustering/labeling using K-Mean and classification/prediction using Naïve Bayesian of such DCI categories. One of DCI form is the ‘Quick-Wins’, a.k.a. ‘Low-Hanging-Fruits’ Direct Capital Investment (DCI), or named shortly as QWDI. Despite its mentioned unfavorable factors, i.e. exploitation of natural resources, low added-value creation, low skill-low wages employment, environmental impacts, etc., QWDI , to have great contribution for quick and high job creation, export market penetration and advancement of technology potential. By using some basic data mining techniques as complements to usual statistical/query analysis, or analysis by similar studies or researches, this study has been intended to enable government planners, starting-up companies or financial institutions for further CDI development. The idea of business intelligence orientation and knowledge generation scenarios is also one of precious basis. At its turn, Information and Communication Technology (ICT)’s enablement will have strategic role for Indonesian enterprises growth and as a fundamental for ‘knowledge based economy’ in Indonesia.

Download Full-text

Depression, pain, and site:

Revista Neurociências ◽

10.34024/rnc.2007.v15.8710 ◽

1999 ◽

Vol 15 (1) ◽

pp. 10-17

Author(s):

Molina Omar Franklin ◽

Tavares Gimenes Pablo ◽

Aquilino Raphael ◽

Rank Rise ◽

Coelho Santos Zeila ◽

...

Keyword(s):

Visual Analogue Scale ◽

Beck Depression Inventory ◽

Temporomandibular Disorders ◽

Analogue Scale ◽

Epidemiological Data ◽

Depression Severity ◽

Internal Joint ◽

The Difference ◽

Biomechanical Tests

Objective: To assess the level of depression, severity of pain and pain in single/multiple sites in patients with different severity of bruxing behavior and Temporomandibular Disorders (TMDs). Methods: We evaluated 131 patients with bruxism and TMDs: 20 patients with mild bruxism, 42 patients with moderate bruxism, 45 patients with severe bruxism and 24 patients with extreme bruxism. We used the Beck Depression Inventory (BDI), clinical examination, a questionnaire of clinical epidemiological data, criteria for TMDs and bruxism, palpation of muscles and joints, the Visual Analogue Scale for pain, classification of the occlusion and biomechanical tests to assess for internal joint derangements. Results: The level of depression increased from the mild, to the moderate, severe and extreme bruxing behavior groups, but the difference was significant only from the mild to the extreme group (p<0.001). Pain levels increased from the mild and moderate to the severe and extreme subgroups, but were not statistically significant. Mean number of pain sites increased from the mild, to the moderate, severe and extreme subgroup and the difference was extremely significant (p<0.0001). Conclusion: Levels of depression, severity of pain and pain sites increased with severity of bruxing behavior. A higher number of pain sites with more severe bruxism indicates somatization in bruxers, but a further study using the same protocol and a psychological test for somatization would be indicated to further substantiate these findings.

Download Full-text

The Difference of Self-management, Athletic Ability Beliefs, and Perceived Performance in Response to the Latent Profile Classification of Resilience of Professional

Korean Journal of Sports Science ◽

10.35159/kjss.2020.06.29.3.241 ◽

2020 ◽

Vol 29 (3) ◽

pp. 241-257

Author(s):

Seong-Moo Park ◽

Jin-Young Huh

Keyword(s):

Self Management ◽

Perceived Performance ◽

Athletic Ability ◽

The Difference ◽

Ability Beliefs ◽

Latent Profile

Download Full-text

Artificial Intelligence in Fractured Dental Implant Detection and Classification: Evaluation Using Dataset from Two Dental Hospitals

Diagnostics ◽

10.3390/diagnostics11020233 ◽

2021 ◽

Vol 11 (2) ◽

pp. 233

Author(s):

Dong-Woon Lee ◽

Sung-Yong Kim ◽

Seong-Nyum Jeong ◽

Jae-Hong Lee

Keyword(s):

Dental Implant ◽

Classification Accuracy ◽

Reliability And Validity ◽

Mechanical Complication ◽

Radiographic Images ◽

Acceptable Accuracy ◽

Reliable Detection ◽

Classification Evaluation ◽

Accuracy Performance

Fracture of a dental implant (DI) is a rare mechanical complication that is a critical cause of DI failure and explantation. The purpose of this study was to evaluate the reliability and validity of a three different deep convolutional neural network (DCNN) architectures (VGGNet-19, GoogLeNet Inception-v3, and automated DCNN) for the detection and classification of fractured DI using panoramic and periapical radiographic images. A total of 21,398 DIs were reviewed at two dental hospitals, and 251 intact and 194 fractured DI radiographic images were identified and included as the dataset in this study. All three DCNN architectures achieved a fractured DI detection and classification accuracy of over 0.80 AUC. In particular, automated DCNN architecture using periapical images showed the highest and most reliable detection (AUC = 0.984, 95% CI = 0.900–1.000) and classification (AUC = 0.869, 95% CI = 0.778–0.929) accuracy performance compared to fine-tuned and pre-trained VGGNet-19 and GoogLeNet Inception-v3 architectures. The three DCNN architectures showed acceptable accuracy in the detection and classification of fractured DIs, with the best accuracy performance achieved by the automated DCNN architecture using only periapical images.

Download Full-text

Comparison of Night, Day and 24 h Motor Activity Data for the Classification of Depressive Episodes

Diagnostics ◽

10.3390/diagnostics10030162 ◽

2020 ◽

Vol 10 (3) ◽

pp. 162 ◽

Cited By ~ 4

Author(s):

Julieta G. Rodríguez-Ruiz ◽

Carlos E. Galván-Tejada ◽

Laura A. Zanella-Calzada ◽

José M. Celaya-Padilla ◽

Jorge I. Galván-Tejada ◽

...

Keyword(s):

Data Mining ◽

Motor Activity ◽

Random Forest Classifier ◽

Machine Learning Algorithms ◽

World Population ◽

Activity Data ◽

The World ◽

Depressive Episodes ◽

Full Day

Major Depression Disease has been increasing in the last few years, affecting around 7 percent of the world population, but nowadays techniques to diagnose it are outdated and inefficient. Motor activity data in the last decade is presented as a better way to diagnose, treat and monitor patients suffering from this illness, this is achieved through the use of machine learning algorithms. Disturbances in the circadian rhythm of mental illness patients increase the effectiveness of the data mining process. In this paper, a comparison of motor activity data from the night, day and full day is carried out through a data mining process using the Random Forest classifier to identified depressive and non-depressive episodes. Data from Depressjon dataset is split into three different subsets and 24 features in time and frequency domain are extracted to select the best model to be used in the classification of depression episodes. The results showed that the best dataset and model to realize the classification of depressive episodes is the night motor activity data with 99.37% of sensitivity and 99.91% of specificity.

Download Full-text