Bayesian Prediction Model Based on Attribute Weighting and Kernel Density Estimations

Although naïve Bayes learner has been proven to show reasonable performance in machine learning, it often suffers from a few problems with handling real world data. First problem is conditional independence; the second problem is the usage of frequency estimator. Therefore, we have proposed methods to solve these two problems revolving around naïve Bayes algorithms. By using an attribute weighting method, we have been able to handle conditional independence assumption issue, whereas, for the case of the frequency estimators, we have found a way to weaken the negative effects through our proposed smooth kernel method. In this paper, we have proposed a compact Bayes model, in which a smooth kernel augments weights on likelihood estimation. We have also chosen an attribute weighting method which employs mutual information metric to cooperate with the framework. Experiments have been conducted on UCI benchmark datasets and the accuracy of our proposed learner has been compared with that of standard naïve Bayes. The experimental results have demonstrated the effectiveness and efficiency of our proposed learning algorithm.

Download Full-text

Experimental analysis of naïve Bayes classifier based on an attribute weighting framework with smooth kernel density estimations

Applied Intelligence ◽

10.1007/s10489-015-0719-1 ◽

2015 ◽

Vol 44 (3) ◽

pp. 611-620 ◽

Cited By ~ 14

Author(s):

Zhong-Liang Xiang ◽

Xiang-Ru Yu ◽

Dae-Ki Kang

Keyword(s):

Experimental Analysis ◽

Naive Bayes ◽

Kernel Density ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Attribute Weighting ◽

Smooth Kernel

Download Full-text

Adapting Hidden Naive Bayes for Text Classification

Mathematics ◽

10.3390/math9192378 ◽

2021 ◽

Vol 9 (19) ◽

pp. 2378

Author(s):

Shengfeng Gan ◽

Shiqi Shao ◽

Long Chen ◽

Liangjun Yu ◽

Liangxiao Jiang

Keyword(s):

Text Classification ◽

Conditional Independence ◽

Structure Learning ◽

Naive Bayes ◽

Learning Algorithm ◽

Classification Performance ◽

Naïve Bayes ◽

Efficiency And Effectiveness ◽

The One ◽

Structure Extension

Due to its simplicity, efficiency, and effectiveness, multinomial naive Bayes (MNB) has been widely used for text classification. As in naive Bayes (NB), its assumption of the conditional independence of features is often violated and, therefore, reduces its classification performance. Of the numerous approaches to alleviating its assumption of the conditional independence of features, structure extension has attracted less attention from researchers. To the best of our knowledge, only structure-extended MNB (SEMNB) has been proposed so far. SEMNB averages all weighted super-parent one-dependence multinomial estimators; therefore, it is an ensemble learning model. In this paper, we propose a single model called hidden MNB (HMNB) by adapting the well-known hidden NB (HNB). HMNB creates a hidden parent for each feature, which synthesizes all the other qualified features’ influences. For HMNB to learn, we propose a simple but effective learning algorithm without incurring a high-computational-complexity structure-learning process. Our improved idea can also be used to improve complement NB (CNB) and the one-versus-all-but-one model (OVA), and the resulting models are simply denoted as HCNB and HOVA, respectively. The extensive experiments on eleven benchmark text classification datasets validate the effectiveness of HMNB, HCNB, and HOVA.

Download Full-text

Comparison of Naive Bayes, Back Propagation, And Deep Learning algorithm to Measure the Performance Using Datasets

i-manager’s Journal on Software Engineering ◽

10.26634/jse.11.2.13443 ◽

2016 ◽

Vol 11 (2) ◽

pp. 1

Author(s):

SHARMILA J. ◽

Keyword(s):

Deep Learning ◽

Naive Bayes ◽

Learning Algorithm ◽

Back Propagation ◽

Naïve Bayes ◽

Deep Learning Algorithm

Download Full-text

Text classification on mahout with Naïve-Bayes machine learning algorithm

2017 International Artificial Intelligence and Data Processing Symposium (IDAP) ◽

10.1109/idap.2017.8090328 ◽

2017 ◽

Cited By ~ 2

Author(s):

Mehmet Umut Salur ◽

Sezai Tokat ◽

Ibrahim Berkan Aydilek

Keyword(s):

Machine Learning ◽

Text Classification ◽

Naive Bayes ◽

Learning Algorithm ◽

Naïve Bayes ◽

Machine Learning Algorithm

Download Full-text

Decision Tree Learning Algorithm and Naïve Bayes Classifier Algorithm Comparative Classification for Mango Pulp Weevil Mating Activity

2021 IEEE International Conference on Automatic Control & Intelligent Systems (I2CACIS) ◽

10.1109/i2cacis52118.2021.9495863 ◽

2021 ◽

Author(s):

Ivane Ann P. Banlawe ◽

Jennifer C. Dela Cruz ◽

John Christian P. Gaspar ◽

Edrian James I. Gutierrez

Keyword(s):

Decision Tree ◽

Naive Bayes ◽

Learning Algorithm ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Decision Tree Learning ◽

Mating Activity ◽

Mango Pulp

Download Full-text

To Enhance Phishing Emails Classification using Machine Learning Algorithm

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c6542.118419 ◽

2019 ◽

Vol 8 (4) ◽

pp. 2240-2242

Keyword(s):

Social Networking ◽

Social Networking Sites ◽

Naive Bayes ◽

Learning Algorithm ◽

False Negative ◽

Naïve Bayes ◽

True Positive ◽

Warning Messages ◽

Dataset Size ◽

Small Dataset

Phishing email becomes more dangers problem in online bank truncation processing problem as well as social networking sites like Facebook, twitter, Instagram. Normally phishing is carrying out by mocking of email or text embedded in email body, which will provoke users to enter their credential. Training on phishing approach is not so much effective because users are not permanently remember their training tricks, warning messages.it is totally depend on the user action which will be performed on certain time on warning messages given by software while operating any URL. In this paper, phishing email classification is enhanced using J48, Naïve Bayes and decision tree on Spam base dataset. J48 does best classification on spam base which is 97%for true positive and 0.025% false negative. Random forest work best on small dataset that is up to 5000 and number of feature are 34.but increase dataset size and reduce feature Naïve Bayes work faster.

Download Full-text

Predicting Student’s Performance Using Machine Learning Algorithm

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-1209 ◽

2021 ◽

pp. 53-58

Author(s):

Sheela Rani P ◽

Dhivya S ◽

Dharshini Priya M ◽

Dharmila Chowdary A

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Prediction Model ◽

Naive Bayes ◽

Learning Algorithm ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Approaches ◽

K Nearest Neighbors

Machine learning is a new analysis discipline that uses knowledge to boost learning, optimizing the training method and developing the atmosphere within which learning happens. There square measure 2 sorts of machine learning approaches like supervised and unsupervised approach that square measure accustomed extract the knowledge that helps the decision-makers in future to require correct intervention. This paper introduces an issue that influences students' tutorial performance prediction model that uses a supervised variety of machine learning algorithms like support vector machine , KNN(k-nearest neighbors), Naïve Bayes and supplying regression and logistic regression. The results supported by various algorithms are compared and it is shown that the support vector machine and Naïve Bayes performs well by achieving improved accuracy as compared to other algorithms. The final prediction model during this paper may have fairly high prediction accuracy .The objective is not just to predict future performance of students but also provide the best technique for finding the most impactful features that influence student’s while studying.

Download Full-text

Prediction of Solid Garbage Waste Generation in Smart Cities using Naive Bayes Algorithm

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.b1031.1292s19 ◽

2019 ◽

Vol 9 (2S) ◽

pp. 53-56

Keyword(s):

Naive Bayes ◽

Learning Algorithm ◽

Smart Cities ◽

Confusion Matrix ◽

Daily Basis ◽

Naïve Bayes ◽

Human Beings ◽

Waste Generation ◽

Future Prediction ◽

Bayes Algorithm

Smart cities which are becoming overcrowded today are making human beings life miserable and prone to more challenges on daily basis. Overcrowded is leading to vast generation of wastes contributing to air pollution and in turn is affecting health causing various diseases. Even though various measures are taken to recycle wastes, the rate at which it is being produced is becoming higher and higher. This paper deals with prediction of waste generation using Naïve Bayes machine learning algorithm(Classifier) based on the statistics of previous waste datasets. The datasets used for the future prediction are obtained from reliable sources. The implementation of the algorithm is done in Pyspark using Anaconda Jupyter. The performance of the classifier on the datasets is analyzed with confusion matrix and accuracy metric is used to rate the efficiency of the classifier. The accuracy obtained indicates that algorithm can be effectively used for real time prediction and it gives more accurate results for huge input datasets based on independence assumption.

Download Full-text

A Decision Tree-Based Attribute Weighting Filter for Naive Bayes

Research and Development in Intelligent Systems XXIII ◽

10.1007/978-1-84628-663-6_5 ◽

2007 ◽

pp. 59-70 ◽

Cited By ~ 4

Author(s):

Mark Hall

Keyword(s):

Decision Tree ◽

Naive Bayes ◽

Naïve Bayes ◽

Attribute Weighting

Download Full-text

Improving the Naïve Bayes Classifier

Encyclopedia of Artificial Intelligence ◽

10.4018/978-1-59904-849-9.ch130 ◽

2011 ◽

pp. 879-883 ◽

Cited By ~ 1

Author(s):

Liwei Fan ◽

Kim Leng Poh

Keyword(s):

Naive Bayes ◽

Probability Distributions ◽

Naïve Bayes ◽

Knowledge Representation And Reasoning ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Real World Data ◽

Independence Assumption ◽

Strong Assumption

A Bayesian Network (BN) takes a relationship between graphs and probability distributions. In the past, BN was mainly used for knowledge representation and reasoning. Recent years have seen numerous successful applications of BN in classification, among which the Naïve Bayes classifier was found to be surprisingly effective in spite of its simple mechanism (Langley, Iba & Thompson, 1992). It is built upon the strong assumption that different attributes are independent with each other. Despite of its many advantages, a major limitation of using the Naïve Bayes classifier is that the real-world data may not always satisfy the independence assumption among attributes. This strong assumption could make the prediction accuracy of the Naïve Bayes classifier highly sensitive to the correlated attributes. To overcome the limitation, many approaches have been developed to improve the performance of the Naïve Bayes classifier. This article gives a brief introduction to the approaches which attempt to relax the independence assumption among attributes or use certain pre-processing procedures to make the attributes as independent with each other as possible. Previous theoretical and empirical results have shown that the performance of the Naïve Bayes classifier can be improved significantly by using these approaches, while the computational complexity will also increase to a certain extent.

Download Full-text