Perbandingan Kinerja Metode Naive Bayes dan K-Nearest Neighbor untuk Klasifikasi Artikel Berbahasa indonesia

Kecocokan isi artikel dengan sebuah tema jurnal menjadi faktor utama diterima tidaknya sebuah artikel. Tetapi masih banyak mahasiswa yang bingung untuk menentukan jurnal yang sesuai dengan artikel yang dimilikinya. Untuk itu diperlukannya sebuah metode klasifikasi dokumen yang dapat mengelompokkan artikel secara otomatis dan akurat. Terdapat banyak metode klasifikasi yang dapat digunakan. Metode yang digunakan dalam penelitian ini adalah Naive Bayes dan sebagai baseline digunakan metode K-Nearest Neighbor. Metode Naive Bayes dipilih karena dapat menghasilkan akurasi yang maksimal dengan data latih yang sedikit. Sedangkan metode K-Nearest Neighbor dipilih karena metode tersebut tangguh terhadap data noise. Kinerja dari kedua metode tersebut akan dibandingkan, sehingga dapat diketahui metode mana yang lebih baik dalam melakukan klasifikasi dokumen. Hasil yang didapatkan menunjukkan metode Naive Bayes memiliki kinerja yang lebih baik dengan tingkat akurasi 70%, sedangkan metode K-Nearest Neighbor memiliki tingkat akurasi yang cukup rendah yaitu 40%. AbstractOne way to be accepted in a journal conference and get the publication is to create an article with perfect suitability content of the journal. Matching the content of the article with a journal theme is the main factor for acceptability an article. But there are still many students who are confused to choose the journal in accordance with the articles it has. So we need a method to classification article documents category automatically and accurately group articles. There are many classification methods that can be used. The method used in this study is Naive Bayes and as a baseline the K-Nearest Neighbor method. Naive Bayes method is chosen because it can produce maximum accuracy with little training data. While K-Nearest Neighbor method was chosen because the method is robust to data noise. The performance of the two methods will be compared, so we can be known which method is better in classifying the document. The results show that the Naive Bayes method performs is more accurate with 70% accuracy and K-Nearest Neighbors method has a fairly low accuracy of 40% on classification test.

Download Full-text

Sentiment Analysis of the Enforcement of PSBB Part II in Jakarta

JELIKU (Jurnal Elektronik Ilmu Komputer Udayana) ◽

10.24843/jlk.2020.v09.i02.p13 ◽

2020 ◽

Vol 9 (2) ◽

pp. 259

Author(s):

Gede Putra Aditya Brahmantha ◽

I Wayan Santiyasa

Keyword(s):

Social Media ◽

Sentiment Analysis ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Web Crawling ◽

K Nearest Neighbor ◽

Bayes Method ◽

The Public ◽

Naive Bayes Method

In addition to communicating, Social Media is a place to issue opinions by the public on many things that are currently taking place, Twitter is one of these social medias that is widely used in conveying opinions regardless of whether these opinions are negative, positive, or even neutral. Tweets data about the Enforcement of PSBB Part II in Jakarta were obtained as many as 200 opinions using web crawling then advanced to the preprocessing stage before being classified using the K-Nearest Neighbor and Multinomial Naive Bayes algorithms. In 3 tests, the highest accuracy was 65.00% for K-Nearest Neighbor and the highest accuracy was 85.00% for Multinomial Naive Bayes method.

Download Full-text

Application Design of Dengue Hemorrhagic Fever Patients Screening Using Naive Bayes Method

Journal of Biomimetics Biomaterials and Biomedical Engineering ◽

10.4028/www.scientific.net/jbbbe.34.20 ◽

2017 ◽

Vol 34 ◽

pp. 20-28

Author(s):

Cindy Astelia Ramadhan Suparman ◽

Endah Purwanti ◽

Prihartini Widiyanti

Keyword(s):

Dengue Hemorrhagic Fever ◽

Naive Bayes ◽

Public Health Problem ◽

Hemorrhagic Fever ◽

Naïve Bayes ◽

Training Data ◽

Design System ◽

Design Calculation ◽

Bayes Method ◽

Naive Bayes Method

Dengue Hemorrhagic Fever is a disease which is endemic in most districts / cities still becomes a public health problem in Indonesia. The awareness of people to the dengue viral infection and its symptoms are needed to decrease the fatality of this disease. The community need to be known the symptoms thereby they could intervened and prevent from falling in to worse condition. This study was conducted to design system which could diagnose the onset of the disease with 3 levels of possibilities namely Grade 1 Dengue Hemorrhagic Fever, Grade 2 Dengue Hemorrhagic Fever, and Non Dengue Hemorrhagic Fever. The system is build based on patient medical records of Dr. Wahidin Sudiro Husodo General Hospital, Mojokerto, East Java using the Naive Bayes method. The method of this study including several steps such as collecting data, preprocessing data, designing database, interface design, calculation and processing data, classification and analyzing data and evaluating application. Determining the results of the application diagnose requires posterior calculation which searches the highest values in three degrees as the results of the initial diagnose. The application as a device for an early diagnosis of dengue hemorrhagic fever has a high accuracy value of 97% out of the 30 tested data. The homogenization of the training data and the test data by sex and age can be considered in future research.

Download Full-text

Application Of Naïve Bayes to Predict the Potential of Rain in Ternate City

E3S Web of Conferences ◽

10.1051/e3sconf/202132804011 ◽

2021 ◽

Vol 328 ◽

pp. 04011

Author(s):

Alwin Ali ◽

Amal Khairan ◽

Firman Tempola ◽

Achmad Fuad

Keyword(s):

Test Data ◽

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Bayes Method ◽

Calculation Results ◽

Bayes Algorithm ◽

Correct Data ◽

Rain Data ◽

Naive Bayes Method

The amount of rainfall that occurs cannot be determined with certainty, but it can be predicted or estimated. In predicting the potential for rain, data mining techniques can be used by classifying data using the naive Bayes method. Naïve Bayes algorithm is a classification method using probability and statistical methods. The purpose of this study is how to implement the naive Bayes method to predict the potential for rain in Ternate City, and be able to calculate the accuracy of the Naive Bayes method from system created. The highest calculation results with new data with a total of 400 training data and 30 test data, obtained 30 correct data with 100% precision, 100% recall and 100% accuracy and the lowest calculation results with new data with a total of 500 training data and 50 test data, obtained 38 correct data and 12 incorrect data with a percentage of precision 61.29%, recall 100% and accuracy 76%.

Download Full-text

Estimation System For Late Payment Of School Tuition Fees

International Journal of Computer and Information System (IJCIS) ◽

10.29040/ijcis.v1i1.5 ◽

2020 ◽

Vol 1 (1) ◽

pp. 1-6

Author(s):

Muqorobin Muqorobin ◽

Kusrini Kusrini ◽

Siti Rokhmah ◽

Isnawati Muslihah

Keyword(s):

Naive Bayes ◽

Information Gain ◽

Educational Institution ◽

Confusion Matrix ◽

Naïve Bayes ◽

K Nearest Neighbor ◽

Tuition Fees ◽

Bayes Method ◽

Estimation System ◽

Naive Bayes Method

The Surakarta Al-Islam Vocational School is a private educational institution that requires all students to pay school tuition fees. Education is an obligation for all Indonesian citizens. The cost of education is one of the most important input components in implementing education. Because cost is the main requirement in achieving educational goals. SPP School is a routine school fee that is carried out every month. Based on last year's School Admin report, many students were late in paying school tuition fees, around 60%. This is a very big problem because the income of school funds comes from school tuition. The purpose of this research is that the researcher will build a prediction system using the best classification method, which is to compare the accuracy level of the Naïve Bayes method with the K-K-Nearest Neighbor method. Because both methods can make class classifications right or late, in paying school fees. processing using dapodic data for 2017/2018 as many as 236 data. In improving accuracy, the researcher also applies feature selection with Information Gain, which is useful for selecting optimal parameters. System testing is carried out using the Confusion Matrix method. The final results of this study indicate that the Naïve Bayes Method + Information Gain Method produces the highest accuracy, namely 95% compared to the Naïve Bayes method alone, namely 85% and the K-NN method, namely 81%.

Download Full-text

COMPARING NAIVE BAYES, K-NEAREST NEIGHBOR, AND NEURAL NETWORK CLASSIFICATION METHODS OF SEAT LOAD FACTOR IN LOMBOK OUTBOUND FLIGHTS

Jurnal Matematika Statistika dan Komputasi ◽

10.20956/jmsk.v16i2.7864 ◽

2019 ◽

Vol 16 (2) ◽

pp. 187

Author(s):

Mega Luna Suliztia ◽

Achmad Fauzan

Keyword(s):

Neural Network ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Load Factor ◽

Classification Methods ◽

Affecting Factors ◽

K Nearest Neighbor ◽

Accuracy Rate ◽

Ticket Prices

Classification is the process of grouping data based on observed variables to predict new data whose class is unknown. There are some classification methods, such as Naïve Bayes, K-Nearest Neighbor and Neural Network. Naïve Bayes classifies based on the probability value of the existing properties. K-Nearest Neighbor classifies based on the character of its nearest neighbor, where the number of neighbors=k, while Neural Network classifies based on human neural networks. This study will compare three classification methods for Seat Load Factor, which is the percentage of aircraft load, and also a measure in determining the profit of airline.. Affecting factors are the number of passengers, ticket prices, flight routes, and flight times. Based on the analysis with 47 data, it is known that the system of Naïve Bayes method has misclassifies in 14 data, so the accuracy rate is 70%. The system of K-Nearest Neighbor method with k=5 has misclassifies in 5 data, so the accuracy rate is 89%, and the Neural Network system has misclassifies in 10 data with accuracy rate 78%. The method with highest accuracy rate is the best method that will be used, which in this case is K-Nearest Neighbor method with success of classification system is 42 data, including 14 low, 10 medium, and 18 high value. Based on the best method, predictions can be made using new data, for example the new data consists of Bali flight routes (2), flight times in afternoon (2), estimate of passenger numbers is 140 people, and ticket prices is Rp.700,000. By using the K-Nearest Neighbor method, Seat Load Factor prediction is high or at intervals of 80% -100%.

Download Full-text

COMPATIBILITY OF SELECTION OF STUDENT DEPARTMENTS USING k-NEAREST NEIGHBOR AND NAÏVE BAYES CLASSIFIER IN INFORMATICS PRIVATE VOCATIONAL SCHOOL, SERANG CITY

JISA(Jurnal Informatika dan Sains) ◽

10.31326/jisa.v4i1.893 ◽

2021 ◽

Vol 4 (1) ◽

pp. 33-39

Author(s):

Budi Pangestu ◽

Keyword(s):

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

K Nearest Neighbor ◽

Prospective Students ◽

Naïve Bayes Classifier ◽

K Nearest Neighbors ◽

Selection Of

Selection of majors by prospective students when registering at a school, especially a Vocational High School, is very vulnerable because prospective students usually choose a major not because of their individual wishes. And because of the increasing emergence of new schools in cities and districts in each province in Indonesia, especially in the province of Banten. Problems experienced by prospective students when choosing the wrong department or not because of their desire, so that it has an unsatisfactory value or value in each semester fluctuates, especially in their Productive Lessons or Competencies. To provide a solution, a departmental suitability system is needed that can provide recommendations for specialization or major suitability based on students' abilities through attributes that can later assist students in the suitability of majors. The process of classifying the suitability of majors in data mining uses the k-Nearest Neighbor and Naive Bayes Classifier methods by entering 16 (sixteen) criteria or attributes which can later provide an assessment of students through this test when determining the majors for themselves, and there is no interference from people. another when choosing a major later. Research that has been carried out successfully using the k-Nearest Neighbors method has a higher recall of 99%, 81% accuracy and 82% precision compared to the Naïve Bayes Classifier whose recall only yields 98% while the accuracy and precision is the same as the k- Nearest Neighbors.

Download Full-text

Perbandingan Naïve bayes dan Certainty factor pada Sistem Pakar Untuk Mendiagnosa Dini Penyakit Glaukoma

Jurnal JTIK (Jurnal Teknologi Informasi dan Komunikasi) ◽

10.35870/jtik.v5i3.183 ◽

2020 ◽

Vol 5 (3) ◽

pp. 291

Author(s):

Hanif Rahman Burhani ◽

Iskandar Fitri ◽

Andrianingsih Andrianingsih

Keyword(s):

Expert System ◽

Naive Bayes ◽

Comparison Method ◽

Naïve Bayes ◽

Training Data ◽

Certainty Factor ◽

Bayes Method ◽

Lack Of Information ◽

Statistics And Probability ◽

Naive Bayes Method

Glaucoma is an eye disease that causes the second largest blindness after cataracts, this disease can cause decreased vision and can even be fatal, namely permanent blindness if it is not realized and treated immediately. Lack of information and education to the public to always maintain eye health is the basis for the purpose of making this expert system which aims to provide early diagnosis to people who are indicated to have glaucoma based on the symptoms or characteristics previously felt. The Naïve bayes method is a method that uses statistics and probability in predicting a person's chance of suffering from glaucoma based on the symptoms previously felt. It is made based on a website with PHP as the programming language and uses MySQL for the database. As for the comparison method used is the Certainty factor, which is a method that functions to determine a certainty value based on the calculation of the predetermined CF value by applying manual calculations. In the Naïve bayes method, the application can group symptom data and types of disease and can diagnose based on previous training data, while for the Certainty factor method based on the calculation of the value of the expert and the CF value that has been inputted by the user, it can produce a percentage of the diagnosis of the disease glaucoma in 96%.Keywords:Certainty factor, Expert System, Glaucoma, MySQL, Naïve bayes, PHP.

Download Full-text

Comparative Analysis of K-NN and Naïve Bayes Methods to Predict Stock Prices

International Journal of Computer and Information System (IJCIS) ◽

10.29040/ijcis.v2i2.32 ◽

2021 ◽

Vol 2 (2) ◽

pp. 49-53

Author(s):

Budi Soepriyanto

Keyword(s):

Stock Prices ◽

Stock Price ◽

Nearest Neighbor ◽

Naive Bayes ◽

High Accuracy ◽

Naïve Bayes ◽

Share Price ◽

Classification Methods ◽

K Nearest Neighbor ◽

Bayes Methods

Abstract— Buying and selling shares is a transaction that is widely carried out at this time, especially buying and selling stocks online which are widely available in the market, to make buying and selling shares require ability or knowledge so that the buying and selling of shares are profitable, to be able to help economic players predict prices. Profit shares or not purchased in the future, this research will conduct stock price predictions using classification methods, namely K-Nearest Neighbor and Naïve Bayes, to predict the stock price data used for one month in minute levels totalling 39065 data, based on prediction results. The highest results obtained were using Naïve Bayes with an accuracy value of 69.38 then the K-Nearest Neighbor method with a K = 5 value of 67.25%, based on these results it can be concluded that the use of the K-Nearest Neighbor and Naïve Bayes methods for prediction share price not yet owned I high accuracy, so it can be combined with other methods or by using other variable predictors.

Download Full-text

Penerapan Algoritma Naïve Bayes Dalam Penentuan Kelayakan Penerima Kartu Jakarta Pintar Plus

Jurnal Teknik Komputer ◽

10.31294/jtk.v6i1.6680 ◽

2020 ◽

Vol 6 (1) ◽

pp. 15-20

Author(s):

Endah Widya Ningsih ◽

Hardiyan Hardiyan

Keyword(s):

Smart Card ◽

Naive Bayes ◽

Naïve Bayes ◽

Training Data ◽

Bayes Method ◽

Using Data ◽

Naive Bayes Method

The eligibility of Jakarta Smart Card Plus recipients is still not on target due to subjective receipts. Schools has an important role in making decisions about the eligibility of Jakarta Smart Plus Card recipients. Therefore, the authors make this research using data that already exists or is called training data. The author uses the Naïve Bayes method with 7 independent attributes to knowing eligibility. The author also uses the another application Rapidmined 5.3 to test the accuracy of the Naïve Bayes method. The result of this research the accuracy of determining the eligibility of Jakarta Smart Plus Card recipients are good with 98.88% with an error of presentation 2.22%. So it can be concluded that the naive bayes method can help detrermine the eligibility of jakarta smart plus card recipients. Keywords: Jakarta Smart Card, Naïve Bayes, eligibility

Download Full-text

ANALISIS MODEL NAIVE BAYES UNTUK IDENTIFIKASI PENGGOLONGAN DAYA LISTRIK DI KOTA LHOKSUMAWE

KOMIK (Konferensi Nasional Teknologi Informasi dan Komputer) ◽

10.30865/komik.v2i1.971 ◽

2018 ◽

Vol 2 (1) ◽

Author(s):

Muhammad Saidi ◽

Fajriana Fajriana ◽

Wahyu Fuadi ◽

Ermatita Ermatita ◽

Iwan Pahendra

Keyword(s):

Naive Bayes ◽

Electrical Power ◽

Naïve Bayes ◽

Training Data ◽

Bayes Method ◽

Customer Data ◽

Poor Households ◽

House Area ◽

Naive Bayes Method

Electricity subsidy is provided for all 450 VA power household customers and 900 VA power household customers who are poor and disadvantaged. However, there are many facts that household customers with 450 VA power are capable and 900 VA power household customers consist of capable households, boarding houses or luxury rented. Households are able to use more electricity than poor households. This paper describe to the identification of household customers' electrical power in the Lhokseumawe city to facilitate PLN in classifying customer power by using the Naive Bayes method. Naive bayes value variables used in this study are: monthly income, highest diploma, last job, house area, subscription fee and government registered household. The classification of household customer power is grouped into three categories, namely low (450 VA down), medium (900 VA) and high (above 1300 VA).. Based on household customer data that is used as training data, the Naive Bayes method is able to classify the customer data tested. So the Naive Bayes method successfully predicts the magnitude of the probability of household electrical power with an accuracy percentage of 80%.Keywords: Electricity, Naive Bayes, CBS, low birth weight, subsidy

Download Full-text