Naïve Bayes Filter for Communication & Enhancing Semantic in Email

Due tothe current pandemic of COVID-19, the world has turned into ONLINE modeand an increase in online communication thereby information exchange, sharing useful data through emails and other social Medias.So addressing the security issues places a vital role in computer security and shouldhave thepriorities. We need a security check to enhance the inbox so that the important information or emails should not reach to the spam box. In this paper to improve the filtering techniques, wehave adopted the Naïve Bayes approach in implementation and enhancing the spam filter in the email. Bayes's approach is efficient, accurate, and simple in implementing the proposed algorithm. Bayes algorithm is used to verify correct semantic information of the email and avoidsthe pass to pass approach if the incoming mail is important. The Python language is used to develop the proposed algorithm.

Download Full-text

Detecting spam e-mails using stop word TF-IDF and stemming algorithm with Naïve Bayes classifier on the multicore GPU

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v11i4.pp3168-3175 ◽

2021 ◽

Vol 11 (4) ◽

pp. 3168

Author(s):

Manjit Jaiswal ◽

Sukriti Das ◽

Khushboo Khushboo

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Testing Time ◽

Training Time ◽

Spam Filter ◽

Time Period ◽

Stop Word ◽

Working Principle ◽

Testing Accuracy ◽

Bayes Algorithm

<span>A spam filter is a program which is used to identify unwanted emails and prevents those messages from getting into a user's mail. The study was focused on how the algorithms can be applied on a number of e-mails consisting of both ham and spam e-mails. First, the working principle and steps which are followed for implementation of stop words, TF-IDF and stemming algorithm on NVIDIA’s Tesla P100 GPU are discussed and to verify the findings by executing of Naïve Bayes algorithm. After complete training and testing of the spam e-mails dataset taken from Kaggle by using the proposed method, we got a high training accuracy of 99.67% and got a testing accuracy of about 99.03% on the multicore GPU that boosted the speed of execution of training time period and testing time period which is improved of training and testing accuracy around 0.22% and 0.18% respectively when compared to that after applying only Naïve Bayes i.e. conventional method to the same dataset where we found training and testing accuracy to be 99.45% and 98.85% respectively. Also, we found that training time taken on GPU is 1.361 seconds which was about 1.49X faster than that taken on CPU which is 2.029 seconds. And the testing time taken on GPU is 1.978 seconds which was about 1.15X faster than that taken on CPU which is 2.280 seconds.</span>

Download Full-text

Indonesian language email spam detection using N-gram and Naïve Bayes algorithm

Bulletin of Electrical Engineering and Informatics ◽

10.11591/eei.v9i5.2444 ◽

2020 ◽

Vol 9 (5) ◽

pp. 2012-2019

Author(s):

Yustinus Vernanda ◽

Seng Hansun ◽

Marcel Bonar Kristanda

Keyword(s):

Data Exchange ◽

Naive Bayes ◽

Naïve Bayes ◽

Bayesian Filtering ◽

Spam Filter ◽

N Gram ◽

Bayes Algorithm ◽

Rest Api ◽

Email Spam ◽

F Measure

Indonesia is ranked the top 8th out of the total country population in the world for the global spammers. Web-based spam filter service with the REST API type can be used to detect email spam in the Indonesian language on the email server or various types of email server applications. With REST API, then there will be data exchange between the applications with JSON data type using existing HTTP commands. One type of spam filter commonly used is Bayesian Filtering, where the Naïve Bayes algorithm is used as a classification algorithm. Meanwhile, the N-gram method is used to increase the accuracy of the implementation of the Naïve Bayes algorithm in this study. N-gram and Naïve Bayes algorithms to detect spam email in the Indonesian language have successfully been implemented with accuracy around 0.615 until 0.94, precision at 0.566 until 0.924, recall at 0.96 until 1.00, and F-measure at 0.721 until 0.942. The best solution is found by using the 5-gram method with the highest score of accuracy at 0.94, precision at 0.924, recall at 0.96, and F-measure value at 0.942.

Download Full-text

Identifying the User As Genuine/Malign Based on Search Logs and Search History

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.a2752.059120 ◽

2020 ◽

Vol 9 (1) ◽

pp. 2046-2048

Keyword(s):

Machine Learning ◽

Naive Bayes ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Main Research ◽

Security Issues ◽

Specific Category ◽

System Logs ◽

Bayes Algorithm ◽

Search Logs

-One of the major challenges a developer may face is security issues/threats on the labelled data. The labelled data comprises of system logs, network traffic or any other enriched data with threat/not threat classification. . There were few studies which categorized the URLs to a specific category like Arts, Technology, etc. In this paper the main research is on the classification of users based on the search logs(URLs). Manually it is difficult to differentiate the user based on search logs. So, we train a machine learning model that takes raw data as input and classifies the user to genuine or malign. This model helps in intrusion detection/suspicious activity detection. For this first we gather data of past malicious URLS as training set for Naïve Bayes algorithm to detect the malicious users. By implementing KNN algorithm effectively we can detect the malign users up to an accuracy of 94.28%. With the help of Machine Learning algorithms like Naïve Bayes, KNN, Random Forest classifiers we can classify the malign and genuine users.

Download Full-text

Emotion Identification between POMS and Multinomial Naive Bayes Algorithm Using Twitter API

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v7i7.1419 ◽

2019 ◽

Vol 7 (7) ◽

pp. 14-19 ◽

Cited By ~ 1

Author(s):

Asharani S Dandoti ◽

Sunil M Sangve

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Emotion Identification ◽

Bayes Algorithm

Download Full-text

Algorithm Comparation of Naive Bayes and Support Vector Machine based on Particle Swarm Optimization in Sentiment Analysis of Freight Forwarding Services

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v4i2.1840 ◽

2020 ◽

Vol 4 (2) ◽

pp. 362-369

Author(s):

Sharazita Dyah Anggita ◽

Ikmah

Keyword(s):

Support Vector Machine ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

The Public ◽

Svm Algorithm ◽

Bayes Algorithm ◽

Freight Forwarding ◽

Improved Accuracy

The needs of the community for freight forwarding are now starting to increase with the marketplace. User opinion about freight forwarding services is currently carried out by the public through many things one of them is social media Twitter. By sentiment analysis, the tendency of an opinion will be able to be seen whether it has a positive or negative tendency. The methods that can be applied to sentiment analysis are the Naive Bayes Algorithm and Support Vector Machine (SVM). This research will implement the two algorithms that are optimized using the PSO algorithms in sentiment analysis. Testing will be done by setting parameters on the PSO in each classifier algorithm. The results of the research that have been done can produce an increase in the accreditation of 15.11% on the optimization of the PSO-based Naive Bayes algorithm. Improved accuracy on the PSO-based SVM algorithm worth 1.74% in the sigmoid kernel.

Download Full-text

Analysis of Sentiment of Moving a National Capital with Feature Selection Naive Bayes Algorithm and Support Vector Machine

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v4i3.1942 ◽

2020 ◽

Vol 4 (3) ◽

pp. 504-512

Author(s):

Faried Zamachsari ◽

Gabriel Vangeran Saragih ◽

Susafa'ati ◽

Windu Gata

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Feature Selection ◽

Public Opinion ◽

Naive Bayes ◽

Naïve Bayes ◽

Capital City ◽

Support Vector ◽

National Capital ◽

Bayes Algorithm

The decision to move Indonesia's capital city to East Kalimantan received mixed responses on social media. When the poverty rate is still high and the country's finances are difficult to be a factor in disapproval of the relocation of the national capital. Twitter as one of the popular social media, is used by the public to express these opinions. How is the tendency of community responses related to the move of the National Capital and how to do public opinion sentiment analysis related to the move of the National Capital with Feature Selection Naive Bayes Algorithm and Support Vector Machine to get the highest accuracy value is the goal in this study. Sentiment analysis data will take from public opinion using Indonesian from Twitter social media tweets in a crawling manner. Search words used are #IbuKotaBaru and #PindahIbuKota. The stages of the research consisted of collecting data through social media Twitter, polarity, preprocessing consisting of the process of transform case, cleansing, tokenizing, filtering and stemming. The use of feature selection to increase the accuracy value will then enter the ratio that has been determined to be used by data testing and training. The next step is the comparison between the Support Vector Machine and Naive Bayes methods to determine which method is more accurate. In the data period above it was found 24.26% positive sentiment 75.74% negative sentiment related to the move of a new capital city. Accuracy results using Rapid Miner software, the best accuracy value of Naive Bayes with Feature Selection is at a ratio of 9:1 with an accuracy of 88.24% while the best accuracy results Support Vector Machine with Feature Selection is at a ratio of 5:5 with an accuracy of 78.77%.

Download Full-text

A Study on Online Detection of micro-blog Rumors Based on Naive Bayes Algorithm

2020 Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC) ◽

10.1109/ipec49694.2020.9115171 ◽

2020 ◽

Author(s):

Gan Wenfeng ◽

Zhang Hong ◽

Cheng Ruoyi

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Online Detection ◽

Bayes Algorithm

Download Full-text

A novel way to classify passenger data using Naïve Bayes algorithm (A real time anti-terrorism approach)

2016 2nd International Conference on Next Generation Computing Technologies (NGCT) ◽

10.1109/ngct.2016.7877433 ◽

2016 ◽

Author(s):

Saurabh Singh ◽

Shashikant Verma ◽

Akhilesh Tiwari ◽

Aditya Tiwari

Keyword(s):

Real Time ◽

Naive Bayes ◽

Naïve Bayes ◽

Bayes Algorithm

Download Full-text

The Sentiment Analysis Reviewing Indosat Services from Twitter Using the Naive Bayes Classifier

Journal of Applied Computer Science and Technology ◽

10.52158/jacost.v1i2.79 ◽

2020 ◽

Vol 1 (2) ◽

pp. 61-66

Author(s):

Febri Astiko ◽

Achmad Khodar

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Naive Bayes ◽

Learning Model ◽

Naïve Bayes ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Machine Learning Model ◽

Bayes Algorithm

This study aims to design a machine learning model of sentiment analysis on Indosat Ooredoo service reviews on social media twitter using the Naive Bayes algorithm as a classifier of positive and negative labels. This sentiment analysis uses machine learning to get patterns an model that can be used again to predict new data.

Download Full-text

Research and Application of Artificial Intelligence Based Integrated Teaching-Learning Modular Approach in Colleges and Universities

Journal of Interconnection Networks ◽

10.1142/s0219265921430064 ◽

2021 ◽

Author(s):

Lingchong Jia ◽

B. Santhosh Kumar ◽

R. Parthasarathy

Keyword(s):

Artificial Intelligence ◽

Student Development ◽

Naive Bayes ◽

Naïve Bayes ◽

Learning Performance ◽

Barriers To Entry ◽

Final Exam ◽

Artificial Intelligence Technology ◽

Bayes Algorithm ◽

Teaching Learning

Nowadays, in various educational institutions, artificial intelligence technology is applied effectively and successfully. This artificial intelligence improves learning and student development in academic performance. Challenges of the conventional education approach, students’ dependence on teachers in all resources for study, unavailability of professional instructors, and a greater focus on conditioning learning than practical usefulness lead to lower learning performance. In this paper integrated teaching-learning model approach has been proposed using artificial intelligence in student education. It involves speeding up fulfilling education targets by reducing barriers to entry, automating management processes, and maximizing learning performance. The proposed ITLMA method used the naive Bayes algorithm to evaluate the student ranking using a class score, task, project score, and final exam. The result of artificial intelligence-based ITLMA and naive Bayes algorithm hasa high accuracy ratio of 80.1% with less error ratio of 15.7%, high prediction 88.2%, precision 98.2%, and improves student and teacher interaction compared to other existing methods.

Download Full-text