IMPRECISE CLASSIFICATION WITH CREDAL DECISION TREES

In this paper, we present the following contributions: (i) an adaptation of a precise classifier to work on imprecise classification for cost-sensitive problems; (ii) a new measure to check the performance of an imprecise classifier. The imprecise classifier is based on a method to build simple decision trees that we have modified for imprecise classification. It uses the Imprecise Dirichlet Model (IDM) to represent information, with the upper entropy as a tool for splitting. Our new measure to compare imprecise classifiers takes errors into account. Thus far, this has not been considered by other measures for classifiers of this type. This measure penalizes wrong predictions using a cost matrix of the errors, given by an expert; and it quantifies the success of an imprecise classifier based on the cardinal number of the set of non-dominated states returned. To compare the performance of our imprecise classification method and the new measure, we have used a second imprecise classifier known as Naive Credal Classifier (NCC) which is a variation of the classic Naive Bayes using the IDM; and a known measure for imprecise classification.

Download Full-text

Classifying Documents with Respect to “Earnings” and Then Making a Predictive Model for the Target Variable Using Decision Trees, MARSplines, Naïve Bayes Classifier, and K-Nearest Neighbors with STATISTICA Text Miner

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications ◽

10.1016/b978-0-12-386979-1.00032-3 ◽

2012 ◽

pp. 773-796

Keyword(s):

Predictive Model ◽

Decision Trees ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Target Variable ◽

K Nearest Neighbors

Download Full-text

Prediction of Heart Disease Using Machine Learning Algorithms

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.32.15714 ◽

2018 ◽

Vol 7 (2.32) ◽

pp. 363 ◽

Cited By ~ 5

Author(s):

N Rajesh ◽

Maneesha T ◽

Shaik Hafeez ◽

Hari Krishna

Keyword(s):

Risk Factors ◽

Heart Disease ◽

Decision Trees ◽

Naive Bayes ◽

Heart Diseases ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Common Disease ◽

Bayes Algorithm ◽

The One

Heart disease is the one of the most common disease. This disease is quite common now a days we used different attributes which can relate to this heart diseases well to find the better method to predict and we also used algorithms for prediction. Naive Bayes, algorithm is analyzed on dataset based on risk factors. We also used decision trees and combination of algorithms for the prediction of heart disease based on the above attributes. The results shown that when the dataset is small naive Bayes algorithm gives the accurate results and when the dataset is large decision trees gives the accurate results.

Download Full-text

Research on Filtration System of Network Negative Information on the Basis of Naive Bayes

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.271-273.911 ◽

2011 ◽

Vol 271-273 ◽

pp. 911-916

Author(s):

Xi Ai Yan ◽

Jin Min Yang

Keyword(s):

Pattern Classification ◽

Naive Bayes ◽

Threshold Value ◽

Negative Information ◽

Naïve Bayes ◽

Classification Method ◽

Filtration System ◽

Strong Robustness

The difficulty of filtrating network negative information lies in how to classify information correctly. As one of the classification method with the advantage of strong robustness and good understandability in the field of pattern classification, Naïve Bayes has been used widely. A method for filtrating network negative information on the basis of Naïve Bayes, improvement proposals aiming at the disadvantages of Naïve Bayes and amelioration of erroneous judgment of negative information by setting threshold value k have been put forward in this article. The experiment shows that by adjusting threshold value k can the integrity of the system can be optimum and can favorable application effects be achieved.

Download Full-text

Utilization of Prediction Data for Prospective Decision Customers Insurance Using the Classification Method of C.45 and Naive Bayes Algorithms

Journal of Physics Conference Series ◽

10.1088/1742-6596/1179/1/012023 ◽

2019 ◽

Vol 1179 ◽

pp. 012023

Author(s):

Saruni Dwiasnati ◽

Yudo Devianto

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Classification Method

Download Full-text

Detecting Malicious Defects in 3D Printing Process Using Machine Learning and Image Classification

Volume 14: Emerging Technologies; Materials: Genetics to Structures; Safety Engineering and Risk Analysis ◽

10.1115/imece2016-67641 ◽

2016 ◽

Cited By ~ 13

Author(s):

Mingtao Wu ◽

Vir V. Phoha ◽

Young B. Moon ◽

Amith K. Belman

Keyword(s):

Machine Learning ◽

3D Printing ◽

Image Classification ◽

Decision Trees ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Printing Process

3D printing, or additive manufacturing, is a key technology for future manufacturing systems. However, 3D printing systems have unique vulnerabilities presented by the ability to affect the infill without affecting the exterior. In order to detect malicious infill defects in 3D printing process, this paper proposes the following: 1) investigate malicious defects in the 3D printing process, 2) extract features based on simulated 3D printing process images, and 3) an experiment of image classification with one group of non-defect infill image and the other group of defect infill training image from 3D printing process. The images are captured layer by layer from the top view of software simulation preview. The data extracted from images is input to two machine learning algorithms, Naive Bayes Classifier and J48 Decision Trees. The result shows Naive Bayes Classifier has an accuracy of 85.26% and J48 Decision Trees has an accuracy of 95.51% for classification.

Download Full-text

Implementation of Naïve Bayes Classification Method for Predicting Purchase

2018 6th International Conference on Cyber and IT Service Management (CITSM) ◽

10.1109/citsm.2018.8674324 ◽

2018 ◽

Cited By ~ 7

Author(s):

Fitriana Harahap ◽

Ahir Yugo Nugroho Harahap ◽

Evri Ekadiansyah ◽

Rita Novita Sari ◽

Robiatul Adawiyah ◽

...

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Classification Method ◽

Naive Bayes Classification ◽

Naïve Bayes Classification

Download Full-text

Implementation of Decision Tree and Naïve Bayes Classification Method for Predicting Study Period

Journal of Physics Conference Series ◽

10.1088/1742-6596/1569/2/022022 ◽

2020 ◽

Vol 1569 ◽

pp. 022022

Author(s):

N Pandiangan ◽

M L C Buono ◽

S H D Loppies

Keyword(s):

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Classification Method ◽

Naive Bayes Classification ◽

Naïve Bayes Classification

Download Full-text

The image classification method based on multivariate Bernoulli naive Bayes with Dirichlet prior and hyper parameter optimization

2015 12th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) ◽

10.1109/iccwamtip.2015.7493988 ◽

2015 ◽

Author(s):

Lin Li

Keyword(s):

Image Classification ◽

Parameter Optimization ◽

Naive Bayes ◽

Naïve Bayes ◽

Classification Method ◽

Multivariate Bernoulli ◽

Dirichlet Prior

Download Full-text

A Semantic Scattering model for the automatic interpretation of English genitives

Natural Language Engineering ◽

10.1017/s1351324908004798 ◽

2009 ◽

Vol 15 (2) ◽

pp. 215-239 ◽

Cited By ~ 1

Author(s):

ADRIANA BADULESCU ◽

DAN MOLDOVAN

Keyword(s):

Support Vector Machines ◽

Decision Trees ◽

Naive Bayes ◽

Word Sense Disambiguation ◽

Naïve Bayes ◽

Semantic Relations ◽

Support Vector ◽

Word Sense ◽

Vector Machines ◽

Bayes Algorithm

AbstractAn important problem in knowledge discovery from text is the automatic extraction of semantic relations. This paper addresses the automatic classification of thesemantic relationsexpressed by English genitives. A learning model is introduced based on the statistical analysis of the distribution of genitives' semantic relations in a corpus. The semantic and contextual features of the genitive's noun phrase constituents play a key role in the identification of the semantic relation. The algorithm was trained and tested on a corpus of approximately 20,000 sentences and achieved an f-measure of 79.80 per cent for of-genitives, far better than the 40.60 per cent obtained using a Decision Trees algorithm, the 50.55 per cent obtained using a Naive Bayes algorithm, or the 72.13 per cent obtained using a Support Vector Machines algorithm on the same corpus using the same features. The results were similar for s-genitives: 78.45 per cent using Semantic Scattering, 47.00 per cent using Decision Trees, 43.70 per cent using Naive Bayes, and 70.32 per cent using a Support Vector Machines algorithm. The results demonstrate the importance of word sense disambiguation and semantic generalization/specialization for this task. They also demonstrate that different patterns (in our case the two types of genitive constructions) encode different semantic information and should be treated differently in the sense that different models should be built for different patterns.

Download Full-text

Klasifikasi Berita Olahraga Menggunakan Metode Naïve Bayes dengan Enhanced Confix Stripping Stemmer

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.201853810 ◽

2018 ◽

Vol 5 (3) ◽

pp. 269

Author(s):

Yoga Dwitya Pramudita ◽

Sigit Susanto Putro ◽

Nurul Makhmud

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Classification Method ◽

Easy Access ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

News Stories ◽

Long Time ◽

Sports News

Dokumen berita olahraga dalam bentuk web kini memiliki jumlah yang besar dalam kurun waktu singkat. Untuk kemudahan akses dokumen perlu melakukan pengelompokan dokumen berita kedalam beberapa kategori. Hal tersebut bertujuan agar berita olahraga tersusun sesuai dengan kategori yang ditentukan. Berita dapat dikelompokkan secara manual oleh manusia, akan tetapi hal tersebut membutuhkan waktu yang lama untuk melakukan kategorisasi. Metode klasifikasi diusulkan dalam penelitian ini untuk melakukan pengkategorian secara otomatis dokumen berita. Tujuan dilakukannya klasifikasi adalah untuk mempercepat dan mempermudah dalam pemberian kategori, sehingga dapat meningkatkan efisiensi waktu. Pada penelitian ini menggunakan metode klasifikasi Naïve Bayes Classifier. Sebelum dilakukan klasifikasi ada proses preprocessing dengan menggunakan Enhanced Confix Striping Stemmer. Hal ini bertujuan untuk mengembalikan ke bentuk kata dasar, sehingga data berkurang dan proses komputasi menjadi lebih efisien. Pengujian dilakukan menggunakan 18 berita olahraga yang dipilih secara acak oleh user atau tester, dari 18 berita yang diujikan terdapat 14 berita yang bernilai benar atau relevan dengan analisis yang dilakukan use atau tester pada berita uji. Dari penelitian ini dapat disimpulkan bahwa Aplikasi Klasifikasi Berita Olahraga menggunakan Metode Naïve Bayes dengan Enhanced Confix Striping Stemmer mampu mengklasifikasi berita olahraga sesuai dengan kategori masing-masing, seperti Sepak Bola, Basket, Raket, Formula 1, Moto GP dan olahraga lainnya dengan keakuratan sebesar 77%. Abstract Web-based sports news currently has a considerable amount of documents. News documents need to be grouped into multiple categories for easy access. The goal is that sports news is structured according to the specified category. News can be grouped manually by humans, but it takes a long time to categorize if it involves large documents. Classification method is proposed in this research to categorize automatically news document. The purpose of doing the classification is to accelerate and simplify the granting of categories, thereby increasing the efficiency of time. In this research using the Naïve Bayes Classifier classification method. Prior to classification there is a preprocessing process using Enhanced Confix Striping Stemmer. It aims to return to the basic word form, so the data is reduced and the computing process becomes more efficient. From the test using 18 sports news randomly selected by the user or tester, there are 14 news stories that are true or relevant to the analysis by the user or the tester on the test news. This study concludes that the Sports News Classification Application using the Naïve Bayes Method with Enhanced Confix Striping Stemmer is able to classify sports news according to their respective categories, such as Football, Basket, Racquet, Formula 1, Moto GP and other sports with accuracy of 77%.

Download Full-text