scholarly journals Sentiment analysis on film review in Gujarati language using machine learning

Author(s):  
Parita Shah ◽  
Priya Swaminarayan ◽  
Maitri Patel

<span>Opinion analysis is by a long shot most basic zone of characteristic language handling. It manages the portrayal of information to choose the motivation behind the wellspring of the content. The reason might be of a type of gratefulness (positive) or study (negative). This paper offers a correlation between the outcomes accomplished by applying the calculation arrangement using various classifiers for instance K-nearest neighbor and multinomial naive Bayes. These techniques are utilized to assess a significant assessment with either a positive remark or negative remark. The gathered information considered on the grounds of the extremity film datasets and an association with the results accessible proof has been created for a careful assessment. This paper investigates the word level count vectorizer and term frequency inverse document frequency (TF-IDF) influence on film sentiment analysis. We concluded that multinomial Naive Bayes (MNB) classier generate more accurate result using TF-IDF vectorizer compared to CountVectorizer, K-nearest-neighbors (KNN) classifier has the same accuracy result in case of TF-IDF and CountVectorizer.</span>

2021 ◽  
Vol 4 (1) ◽  
pp. 33-39
Author(s):  
Budi Pangestu ◽  

Selection of majors by prospective students when registering at a school, especially a Vocational High School, is very vulnerable because prospective students usually choose a major not because of their individual wishes. And because of the increasing emergence of new schools in cities and districts in each province in Indonesia, especially in the province of Banten. Problems experienced by prospective students when choosing the wrong department or not because of their desire, so that it has an unsatisfactory value or value in each semester fluctuates, especially in their Productive Lessons or Competencies. To provide a solution, a departmental suitability system is needed that can provide recommendations for specialization or major suitability based on students' abilities through attributes that can later assist students in the suitability of majors. The process of classifying the suitability of majors in data mining uses the k-Nearest Neighbor and Naive Bayes Classifier methods by entering 16 (sixteen) criteria or attributes which can later provide an assessment of students through this test when determining the majors for themselves, and there is no interference from people. another when choosing a major later. Research that has been carried out successfully using the k-Nearest Neighbors method has a higher recall of 99%, 81% accuracy and 82% precision compared to the Naïve Bayes Classifier whose recall only yields 98% while the accuracy and precision is the same as the k- Nearest Neighbors.


Author(s):  
Ardianne Luthfika Fairuz ◽  
Rima Dias Ramadhani ◽  
Nia Annisa Ferani Tanjung

Akhir tahun 2019 lalu dunia digemparkan oleh munculnya suatu penyakit yang disebabkan oleh virus SARS-CoV-2 yang merupakan jenis virus terbaru dari coronavirus. Penyakit ini dikenal dengan nama COVID-19. Penyebaran penyakit ini terbilang cukup luas dan cepat. Dalam waktu singkat penyakit ini mulai menyebar ke segala penjuru dunia tak terkecuali Indonesia. Dengan tingkat penyebaran yang begitu tinggi dan belum ditemukannya vaksin untuk COVID-19, menyebabkan kekacauan di tengah masyarakat. Hal ini mempengaruhi banyak sektor kehidupan masyarakat. Tak sedikit masyarakat yang aktif bersosial media dan menuliskan pendapat, opini serta pemikirannya di platform media sosial seperti Twitter. Terjadinya pandemi ini mendorong masyarakat untuk menuliskan opini, pemikiran serta pendapatnya terhadap COVID-19 pada media sosial Twitter. Dibutuhkan suatu model sentiment analysis untuk mengklasifikasi tweet masyarakat di Twitter menjadi positif dan negatif. Sentiment analysis merupakan bagian dari Natural Language Processing yang membuat sebuah sistem guna mengenali serta mengekstraksi opini dalam  bentuk teks. Pada penelitian ini digunakan algoritma Naive Bayes dan K-Nearest Neighbor untuk digunakan dalam membangun model sentiment analysis terhadap tweet pengguna Twitter terhadap COVID-19. Didapatkan akurasi sebesar 85% untuk algoritma Naïve Bayes dan 82% untuk algoritma K-Nearest Neighbor pada nilai k=6, 8, dan 14.


2020 ◽  
Vol 9 (2) ◽  
pp. 259
Author(s):  
Gede Putra Aditya Brahmantha ◽  
I Wayan Santiyasa

In addition to communicating, Social Media is a place to issue opinions by the public on many things that are currently taking place, Twitter is one of these social medias that is widely used in conveying opinions regardless of whether these opinions are negative, positive, or even neutral. Tweets data about the Enforcement of PSBB Part II in Jakarta were obtained as many as 200 opinions using web crawling then advanced to the preprocessing stage before being classified using the K-Nearest Neighbor and Multinomial Naive Bayes algorithms. In 3 tests, the highest accuracy was 65.00% for K-Nearest Neighbor and the highest accuracy was 85.00% for Multinomial Naive Bayes method.


Author(s):  
Kadda Zerrouki ◽  
Reda Mohamed Hamou ◽  
Abdellatif Rahmoun

Making use of social media for analyzing the perceptions of the masses over a product, event, or a person has gained momentum in recent times. Out of a wide array of social networks, the authors chose Twitter for their analysis as the opinions expressed there are concise and bear a distinctive polarity. Sentiment analysis is an approach to analyze data and retrieve sentiment that it embodies. The paper elaborately discusses three supervised machine learning algorithms—naïve bayes, k-nearest neighbor (KNN), and decision tree—and compares their overall accuracy, precision, as well as recall values, f-measure, number of tweets correctly classified, number of tweets incorrectly classified, and execution time.


2021 ◽  
Vol 8 (1) ◽  
pp. 50-56
Author(s):  
Nico Nathanael Wilim ◽  
Raymond Sunardi Oetama

Indonesia Lawyers Club (ILC) is a talk show on TVOne that discusses topics around public phenomena, legal issues, crime, and other similar topics. In 2018, ILC won the Panasonic Gobel Awards as the best news talk show program. But in 2019, ILC failed to win the award which was won by Mata Najwa which featured a talk show event that appeared on Trans7. As one of the television shows that has won awards, ILC has pros and cons for its shows from the public. This study applies a sentiment analysis approach to examine public opinion on Twitter about Mata Najwa and ILC in 2018 and 2019. This study applies K-Nearest Neighbor, Naïve Bayes Classifier, and Decision Tree classification algorithm to validate the result. The contribution of this study is to show that public opinion on Twitter can be examined to figure out community sentiment on a tv talk show as well as to confirm the Award winner of tv Talkshow.   Index Terms—datamining; Decision Tree; K-NN; Naïve Bayes Classifier; sentiment analysis


2018 ◽  
Vol 5 (4) ◽  
pp. 427 ◽  
Author(s):  
Riri Nada Devita ◽  
Heru Wahyu Herwanto ◽  
Aji Prasetya Wibawa

<p class="Abstrak">Kecocokan isi artikel dengan sebuah tema jurnal menjadi faktor utama diterima tidaknya sebuah artikel. Tetapi masih banyak mahasiswa yang bingung untuk menentukan jurnal yang sesuai dengan artikel yang dimilikinya. Untuk itu diperlukannya sebuah metode klasifikasi dokumen yang dapat mengelompokkan artikel secara otomatis dan akurat. Terdapat banyak metode klasifikasi yang dapat digunakan. Metode yang digunakan dalam penelitian ini adalah <em>Naive Bayes</em> dan sebagai <em>baseline </em>digunakan metode <em>K-Nearest Neighbor</em>. Metode <em>Naive Bayes </em>dipilih karena dapat menghasilkan akurasi yang maksimal dengan data latih yang sedikit. Sedangkan metode <em>K-Nearest Neighbor</em> dipilih karena metode tersebut tangguh terhadap data <em>noise</em>. Kinerja dari kedua metode tersebut akan dibandingkan, sehingga dapat diketahui metode mana yang lebih baik dalam melakukan klasifikasi dokumen. Hasil yang didapatkan menunjukkan metode <em>Naive Bayes </em>memiliki kinerja yang lebih baik dengan tingkat akurasi 70%, sedangkan metode <em>K-Nearest Neighbor </em>memiliki tingkat akurasi yang cukup rendah yaitu 40%.</p><p class="Abstrak"> </p><p class="Abstrak"><em><strong>Abstract</strong></em></p><p class="Abstrak"><em>One way to be accepted in a journal conference and get the publication is to create an article with perfect suitability content of the journal. Matching the content of the article with a journal theme is the main factor for acceptability an article. But there are still many students who are confused to choose the journal in accordance with the articles it has. So we need a method to classification article documents category automatically and accurately group articles. There are many classification methods that can be used. The method used in this study is Naive Bayes and as a baseline the K-Nearest Neighbor method. Naive Bayes method is chosen because it can produce maximum accuracy with little training data. While K-Nearest Neighbor method was chosen because the method is robust to data noise. The performance of the two methods will be compared, so we can be known which method is better in classifying the document. The results show that the Naive Bayes method performs is more accurate with 70% accuracy and K-Nearest Neighbors method has a fairly low accuracy of 40% on classification test.</em></p>


Sign in / Sign up

Export Citation Format

Share Document