scholarly journals Perbandingan Kinerja Metode Naive Bayes dan K-Nearest Neighbor untuk Klasifikasi Artikel Berbahasa indonesia

2018 ◽  
Vol 5 (4) ◽  
pp. 427 ◽  
Author(s):  
Riri Nada Devita ◽  
Heru Wahyu Herwanto ◽  
Aji Prasetya Wibawa

<p class="Abstrak">Kecocokan isi artikel dengan sebuah tema jurnal menjadi faktor utama diterima tidaknya sebuah artikel. Tetapi masih banyak mahasiswa yang bingung untuk menentukan jurnal yang sesuai dengan artikel yang dimilikinya. Untuk itu diperlukannya sebuah metode klasifikasi dokumen yang dapat mengelompokkan artikel secara otomatis dan akurat. Terdapat banyak metode klasifikasi yang dapat digunakan. Metode yang digunakan dalam penelitian ini adalah <em>Naive Bayes</em> dan sebagai <em>baseline </em>digunakan metode <em>K-Nearest Neighbor</em>. Metode <em>Naive Bayes </em>dipilih karena dapat menghasilkan akurasi yang maksimal dengan data latih yang sedikit. Sedangkan metode <em>K-Nearest Neighbor</em> dipilih karena metode tersebut tangguh terhadap data <em>noise</em>. Kinerja dari kedua metode tersebut akan dibandingkan, sehingga dapat diketahui metode mana yang lebih baik dalam melakukan klasifikasi dokumen. Hasil yang didapatkan menunjukkan metode <em>Naive Bayes </em>memiliki kinerja yang lebih baik dengan tingkat akurasi 70%, sedangkan metode <em>K-Nearest Neighbor </em>memiliki tingkat akurasi yang cukup rendah yaitu 40%.</p><p class="Abstrak"> </p><p class="Abstrak"><em><strong>Abstract</strong></em></p><p class="Abstrak"><em>One way to be accepted in a journal conference and get the publication is to create an article with perfect suitability content of the journal. Matching the content of the article with a journal theme is the main factor for acceptability an article. But there are still many students who are confused to choose the journal in accordance with the articles it has. So we need a method to classification article documents category automatically and accurately group articles. There are many classification methods that can be used. The method used in this study is Naive Bayes and as a baseline the K-Nearest Neighbor method. Naive Bayes method is chosen because it can produce maximum accuracy with little training data. While K-Nearest Neighbor method was chosen because the method is robust to data noise. The performance of the two methods will be compared, so we can be known which method is better in classifying the document. The results show that the Naive Bayes method performs is more accurate with 70% accuracy and K-Nearest Neighbors method has a fairly low accuracy of 40% on classification test.</em></p>

2020 ◽  
Vol 9 (2) ◽  
pp. 259
Author(s):  
Gede Putra Aditya Brahmantha ◽  
I Wayan Santiyasa

In addition to communicating, Social Media is a place to issue opinions by the public on many things that are currently taking place, Twitter is one of these social medias that is widely used in conveying opinions regardless of whether these opinions are negative, positive, or even neutral. Tweets data about the Enforcement of PSBB Part II in Jakarta were obtained as many as 200 opinions using web crawling then advanced to the preprocessing stage before being classified using the K-Nearest Neighbor and Multinomial Naive Bayes algorithms. In 3 tests, the highest accuracy was 65.00% for K-Nearest Neighbor and the highest accuracy was 85.00% for Multinomial Naive Bayes method.


Author(s):  
Cindy Astelia Ramadhan Suparman ◽  
Endah Purwanti ◽  
Prihartini Widiyanti

Dengue Hemorrhagic Fever is a disease which is endemic in most districts / cities still becomes a public health problem in Indonesia. The awareness of people to the dengue viral infection and its symptoms are needed to decrease the fatality of this disease. The community need to be known the symptoms thereby they could intervened and prevent from falling in to worse condition. This study was conducted to design system which could diagnose the onset of the disease with 3 levels of possibilities namely Grade 1 Dengue Hemorrhagic Fever, Grade 2 Dengue Hemorrhagic Fever, and Non Dengue Hemorrhagic Fever. The system is build based on patient medical records of Dr. Wahidin Sudiro Husodo General Hospital, Mojokerto, East Java using the Naive Bayes method. The method of this study including several steps such as collecting data, preprocessing data, designing database, interface design, calculation and processing data, classification and analyzing data and evaluating application. Determining the results of the application diagnose requires posterior calculation which searches the highest values in three degrees as the results of the initial diagnose. The application as a device for an early diagnosis of dengue hemorrhagic fever has a high accuracy value of 97% out of the 30 tested data. The homogenization of the training data and the test data by sex and age can be considered in future research.


2021 ◽  
Vol 328 ◽  
pp. 04011
Author(s):  
Alwin Ali ◽  
Amal Khairan ◽  
Firman Tempola ◽  
Achmad Fuad

The amount of rainfall that occurs cannot be determined with certainty, but it can be predicted or estimated. In predicting the potential for rain, data mining techniques can be used by classifying data using the naive Bayes method. Naïve Bayes algorithm is a classification method using probability and statistical methods. The purpose of this study is how to implement the naive Bayes method to predict the potential for rain in Ternate City, and be able to calculate the accuracy of the Naive Bayes method from system created. The highest calculation results with new data with a total of 400 training data and 30 test data, obtained 30 correct data with 100% precision, 100% recall and 100% accuracy and the lowest calculation results with new data with a total of 500 training data and 50 test data, obtained 38 correct data and 12 incorrect data with a percentage of precision 61.29%, recall 100% and accuracy 76%.


Author(s):  
Muqorobin Muqorobin ◽  
Kusrini Kusrini ◽  
Siti Rokhmah ◽  
Isnawati Muslihah

The Surakarta Al-Islam Vocational School is a private educational institution that requires all students to pay school tuition fees. Education is an obligation for all Indonesian citizens. The cost of education is one of the most important input components in implementing education. Because cost is the main requirement in achieving educational goals. SPP School is a routine school fee that is carried out every month. Based on last year's School Admin report, many students were late in paying school tuition fees, around 60%. This is a very big problem because the income of school funds comes from school tuition. The purpose of this research is that the researcher will build a prediction system using the best classification method, which is to compare the accuracy level of the Naïve Bayes method with the K-K-Nearest Neighbor method. Because both methods can make class classifications right or late, in paying school fees. processing using dapodic data for 2017/2018 as many as 236 data. In improving accuracy, the researcher also applies feature selection with Information Gain, which is useful for selecting optimal parameters. System testing is carried out using the Confusion Matrix method. The final results of this study indicate that the Naïve Bayes Method + Information Gain Method produces the highest accuracy, namely 95% compared to the Naïve Bayes method alone, namely 85% and the K-NN method, namely 81%.


2019 ◽  
Vol 16 (2) ◽  
pp. 187
Author(s):  
Mega Luna Suliztia ◽  
Achmad Fauzan

Classification is the process of grouping data based on observed variables to predict new data whose class is unknown. There are some classification methods, such as Naïve Bayes, K-Nearest Neighbor and Neural Network. Naïve Bayes classifies based on the probability value of the existing properties. K-Nearest Neighbor classifies based on the character of its nearest neighbor, where the number of neighbors=k, while Neural Network classifies based on human neural networks. This study will compare three classification methods for Seat Load Factor, which is the percentage of aircraft load, and also a measure in determining the profit of airline.. Affecting factors are the number of passengers, ticket prices, flight routes, and flight times. Based on the analysis with 47 data, it is known that the system of Naïve Bayes method has misclassifies in 14 data, so the accuracy rate is 70%. The system of K-Nearest Neighbor method with k=5 has misclassifies in 5 data, so the accuracy rate is 89%, and the Neural Network system has misclassifies in 10 data with accuracy rate 78%. The method with highest accuracy rate is the best method that will be used, which in this case is K-Nearest Neighbor method with success of classification system is 42 data, including 14 low, 10 medium, and 18 high value. Based on the best method, predictions can be made using new data, for example the new data consists of Bali flight routes (2), flight times in afternoon (2), estimate of passenger numbers is 140 people, and ticket prices is Rp.700,000. By using the K-Nearest Neighbor method, Seat Load Factor prediction is high or at intervals of 80% -100%.


2021 ◽  
Vol 4 (1) ◽  
pp. 33-39
Author(s):  
Budi Pangestu ◽  

Selection of majors by prospective students when registering at a school, especially a Vocational High School, is very vulnerable because prospective students usually choose a major not because of their individual wishes. And because of the increasing emergence of new schools in cities and districts in each province in Indonesia, especially in the province of Banten. Problems experienced by prospective students when choosing the wrong department or not because of their desire, so that it has an unsatisfactory value or value in each semester fluctuates, especially in their Productive Lessons or Competencies. To provide a solution, a departmental suitability system is needed that can provide recommendations for specialization or major suitability based on students' abilities through attributes that can later assist students in the suitability of majors. The process of classifying the suitability of majors in data mining uses the k-Nearest Neighbor and Naive Bayes Classifier methods by entering 16 (sixteen) criteria or attributes which can later provide an assessment of students through this test when determining the majors for themselves, and there is no interference from people. another when choosing a major later. Research that has been carried out successfully using the k-Nearest Neighbors method has a higher recall of 99%, 81% accuracy and 82% precision compared to the Naïve Bayes Classifier whose recall only yields 98% while the accuracy and precision is the same as the k- Nearest Neighbors.


2020 ◽  
Vol 5 (3) ◽  
pp. 291
Author(s):  
Hanif Rahman Burhani ◽  
Iskandar Fitri ◽  
Andrianingsih Andrianingsih

Glaucoma is an eye disease that causes the second largest blindness after cataracts, this disease can cause decreased vision and can even be fatal, namely permanent blindness if it is not realized and treated immediately. Lack of information and education to the public to always maintain eye health is the basis for the purpose of making this expert system which aims to provide early diagnosis to people who are indicated to have glaucoma based on the symptoms or characteristics previously felt. The Naïve bayes method is a method that uses statistics and probability in predicting a person's chance of suffering from glaucoma based on the symptoms previously felt. It is made based on a website with PHP as the programming language and uses MySQL for the database. As for the comparison method used is the Certainty factor, which is a method that functions to determine a certainty value based on the calculation of the predetermined CF value by applying manual calculations. In the Naïve bayes method, the application can group symptom data and types of disease and can diagnose based on previous training data, while for the Certainty factor method based on the calculation of the value of the expert and the CF value that has been inputted by the user, it can produce a percentage of the diagnosis of the disease glaucoma in 96%.Keywords:Certainty factor, Expert System, Glaucoma, MySQL, Naïve bayes, PHP.


Author(s):  
Budi Soepriyanto

Abstract— Buying and selling shares is a transaction that is widely carried out at this time, especially buying and selling stocks online which are widely available in the market, to make buying and selling shares require ability or knowledge so that the buying and selling of shares are profitable, to be able to help economic players predict prices. Profit shares or not purchased in the future, this research will conduct stock price predictions using classification methods, namely K-Nearest Neighbor and Naïve Bayes, to predict the stock price data used for one month in minute levels totalling 39065 data, based on prediction results. The highest results obtained were using Naïve Bayes with an accuracy value of 69.38 then the K-Nearest Neighbor method with a K = 5 value of 67.25%, based on these results it can be concluded that the use of the K-Nearest Neighbor and Naïve Bayes methods for prediction share price not yet owned I high accuracy, so it can be combined with other methods or by using other variable predictors.


2020 ◽  
Vol 6 (1) ◽  
pp. 15-20
Author(s):  
Endah Widya Ningsih ◽  
Hardiyan Hardiyan

The eligibility of Jakarta Smart Card Plus recipients is still not on target due to subjective receipts. Schools has an important role in making decisions about the eligibility of Jakarta Smart Plus Card recipients. Therefore, the authors make this research using data that already exists or is called training data. The author uses the Naïve Bayes method with 7 independent attributes to knowing eligibility. The author also uses the another application  Rapidmined 5.3 to test the accuracy of the Naïve Bayes method. The result of this research the accuracy of determining the eligibility of Jakarta Smart Plus Card recipients are good with 98.88% with an error of presentation 2.22%.  So it can be concluded that the naive bayes method can help detrermine the eligibility of jakarta smart plus card recipients. Keywords: Jakarta Smart Card, Naïve Bayes, eligibility


Author(s):  
Muhammad Saidi ◽  
Fajriana Fajriana ◽  
Wahyu Fuadi ◽  
Ermatita Ermatita ◽  
Iwan Pahendra

Electricity subsidy is provided for all 450 VA power household customers and 900 VA power household customers who are poor and disadvantaged. However, there are many facts that household customers with 450 VA power are capable and 900 VA power household customers consist of capable households, boarding houses or luxury rented. Households are able to use more electricity than poor households. This paper describe to the identification of household customers' electrical power in the Lhokseumawe city to facilitate PLN in classifying customer power by using the Naive Bayes method. Naive bayes value variables used in this study are: monthly income, highest diploma, last job, house area, subscription fee and government registered household. The classification of household customer power is grouped into three categories, namely low (450 VA down), medium (900 VA) and high (above 1300 VA).. Based on household customer data that is used as training data, the Naive Bayes method is able to classify the customer data tested. So the Naive Bayes method successfully predicts the magnitude of the probability of household electrical power with an accuracy percentage of 80%.Keywords: Electricity, Naive Bayes,  CBS, low birth weight, subsidy


Sign in / Sign up

Export Citation Format

Share Document