Implementasi Text Mining Untuk Analisis Opini Publik Terhadap Calon Presiden

Sistem presidensial merupakan salah satu bentuk demokrasi di Indonesia. Sistem tersebut menitikberatkan pada penyelenggaraan pemilihan umum presiden dan wakilnya yang dilakukan secara langsung oleh rakyat. Tingkat terpilihnya seorang presiden dapat dilihat dari opini publik yang beredar, salah satunya pada media sosial yang juga merupakan bagian dari kampanye. Dalam penelitian ini akan dianalisa opini yang berkaitan dengan elektabilitas calon presiden dari media sosial Twitter dari media sosial Twitter menggunakan metode Naïve Bayes Classifier (NBC) dan menentukan faktor-faktor yang terbentuk dari opini menggunakan Principal Component Analysis (PCA). Data opini dari media sosial Twitter didapatkan menggunakan kata kunci “Jokowi” dan “Prabowo”. Sebagian opini tersebut dipilih sebagai data latih untuk didapatkan kelas bersentimen negatif dan positif. Setelah proses pelatihan, dilakukan proses terhadap data uji dan data validasi. Hasil akurasi untuk data uji topik Jokowi pada tweet bersentimen positif mendapatkan akurasi sebesar 88.63% dan negatif sebesar 91.06%. Sementara untuk Prabowo bersentimen positif mendapatkan akurasi sebesar 88.58% dan negatif sebesar 80.37%. Rerata akurasi untuk keseluruhan topik adalah adalah 86.89%. Untuk mendapatkan faktor pada setiap sentimen, dilakukan proses perhitungan nilai PCA. Setiap sentimen tersebut kemudian dilakukan analisis faktor oleh pakar, yakni didapatkan 20 faktor yang sudah berhasil diinterpretasikan oleh pakar.

Download Full-text

Feature Scaled Element Balancing with Random Boosting for Heart Disease Prediction using Machine Learning

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.e5069.018520 ◽

2020 ◽

Vol 8 (5) ◽

pp. 4105-4110

Keyword(s):

Principal Component Analysis ◽

Heart Disease ◽

Naive Bayes ◽

Principal Component ◽

Component Analysis ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Data Set

In the current scenario, the researchers are focusing towards health care project for the prediction of the disease and its type. In addition to the prediction, there exists a need to find the influencing parameter that directly related to the disease prediction. The analysis of the parameters needed to the prediction of the disease still remains a challenging issue. With this view, we focus on predicting the heart disease by applying the dataset with boosting the parameters of the dataset. The heart disease data set extracted from UCI Machine Learning Repository is used for implementation. The anaconda Navigator IDE along with Spyder is used for implementing the Python code. Our contribution is folded is folded in three ways. First, the data preprocessing is done and the attribute relationship is identified by the correlation values. Second, the data set is fitted to random boost regressor and the important features are identified. Third, the dataset is feature scaled reduced and then fitted to random forest classifier, decision tree classifier, Naïve bayes classifier, logistic regression classifier, kernel support vector machine and KNN classifier. Fourth, the dataset is reduced with principal component analysis with five components and then fitted to the above mentioned classifiers. Fifth, the performance of the classifiers is analyzed with the metrics like accuracy, recall, fscore and precision. Experimental results shows that, the Naïve bayes classifier is more effective with the precision, Recall and Fscore of 0.89 without random boost, 0.88 with random boosting and 0.90 with principal component analysis. Experimental results show, the Naïve bayes classifier is more effective with the accuracy of 89% without random boost, 90% with random boosting and 91% with principal component analysis.

Download Full-text

Diagnosa Kerusakan Bearing Menggunakan Principal Component Analysis (PCA) dan Naïve Bayes Classifier

JURNAL SISTEM INFORMASI BISNIS ◽

10.21456/vol6iss2pp114-123 ◽

2016 ◽

Vol 6 (2) ◽

pp. 114

Author(s):

Dwi Pudyastuti ◽

Toni Prahasto ◽

Achmad Widodo

Keyword(s):

Principal Component Analysis ◽

Fault Diagnosis ◽

Naive Bayes ◽

Principal Component ◽

Component Analysis ◽

Naïve Bayes ◽

Feature Reduction ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

This research is discussing about the usage of data mining which addressed for bearing fault diagnosis. Bearing was one of the essential parts in industry machinery. Bearing was used to reduce machines frictions or could be a moving component which oppressed each other. This fault diagnosis can avoid loss and damage of other machines components. This research was started with data preprocessing using wavelet discrete transformation, feature extraction, feature reduction using Principal Component Analysis (PCA), and classification process using Naïve Bayes classifier methods. Naïve Bayes Classifier is a classification method which based on probability and Bayesian theorem. Output of these method shows that Naïve Bayes classification have a good performance which shown by a good accuracy in each data test.

Download Full-text

Naive Bayes Classifier Based on Principal Component Analysis Applied to Clinical Diagnosis Decision of ECG Data

Software Engineering and Applications ◽

10.12677/sea.2021.105067 ◽

2021 ◽

Vol 10 (05) ◽

pp. 622-633

Author(s):

杰青闵

Keyword(s):

Principal Component Analysis ◽

Clinical Diagnosis ◽

Naive Bayes ◽

Principal Component ◽

Component Analysis ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Ecg Data

Download Full-text

Klasifikasi Berita Menggunakan Algoritma Naive Bayes Classifer Dengan Seleksi Fitur Dan Boosting

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v3i2.1042 ◽

2019 ◽

Vol 3 (2) ◽

pp. 227-232

Author(s):

Bobby Suryo Prakoso ◽

Didi Rosiyadi ◽

Heru Sukma Utama ◽

Dedi Aridarma

Keyword(s):

Text Mining ◽

Naive Bayes ◽

Information Gain ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Penelitian yang dilakukan ini merupakan bagian dari text mining untuk klasifikasi konten berita yang telah memiliki label berdasarkan katagori berita pada situs detik.com . Proses yang dilakukan adalah melakukan permodelan dan pengolahan data, mulai proses pre-processing, proses seleksi fitur information gain, dan penerapan model algoritma Naive Bayes Classifier dengan Bayesian Boosting. Hasil yang diperoleh atas model tersebut mendapatkan nilai evaluasi terhadap akurasi, recall, dan presisi sebesar 73.2%. Sedangkan dengan model yang lebih ringkas yaitu model algoritma Naive Bayes Classifier, dengan Bayesian Boosting mendapatkan nilai evaluasi yang sama besar yaitu 73.2%. Penilaian atas hasil evaluasi model yang telah terlaksankan berkesimpulan bahwa penerapan seleksi fitur Information Gain tidak berpengaruh besar atas kenaikan hasil performa terhadap kondisi label Polynomial.

Download Full-text

Analisis Sentimen Physical Distancing pada Twitter Menggunakan Text Mining dengan Algoritma Naive Bayes Classifier

SISTEMASI ◽

10.32520/stmsi.v10i1.1118 ◽

2021 ◽

Vol 10 (1) ◽

pp. 131

Author(s):

Nila Hardi ◽

Yuris Alkahfi ◽

Popon Handayani ◽

Windu Gata ◽

Muhammad Rifqi Firdaus

Keyword(s):

Text Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Download Full-text

Naïve Bayes Classifier with Parallel Abduction Reasoning Ensemble Principal Component Analysis for Prediction Modeling

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2016.5617 ◽

2016 ◽

Vol 13 (10) ◽

pp. 6707-6710

Author(s):

J Suganthi ◽

V Malathi

Keyword(s):

Latent Variable ◽

Naive Bayes ◽

Principal Component ◽

Naïve Bayes ◽

Abductive Reasoning ◽

Experimental Result ◽

Cancer Information ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

The classification could be a latent variable that is probabilistically relating to the discovered variables. In Bayesian algorithmic ways, logical thinking works in probabilistic mode. However PCM based parallel abductive reasoning with Naïve Bayes (NB) on cancer information could be a powerful technique to perform effective prediction in classification. Whereas whilst classifying the cancer information the strategy reads the parallel changes and predicts the severity level for supplementary treatments. Since the Bayesian classifier gives many premises for several supervised learning algorithms thereby the proposed Parallel abductive Naïve Bayes Classifier algorithm based on factor analysis of PCA enhances the granularity of prediction. The Principal components are chosen on multi-perspective domain of curator analysis dataset. Experimental result shows that it is potential to get parallel abductive classifiers that have comparatively high impact on prediction.

Download Full-text

NAIVE BAYES CLASSIFIER DAN SUPPORT VECTOR MACHINE SEBAGAI ALTERNATIF SOLUSI UNTUK TEXT MINING

Jurnal Teknologi Informasi dan Pendidikan ◽

10.24036/tip.v12i2.219 ◽

2019 ◽

Vol 12 (2) ◽

pp. 32-38

Author(s):

Iin Ernawati

Keyword(s):

Support Vector Machine ◽

Text Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Classification Algorithms ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

The Relationship

This study was conducted to text-based data mining or often called text mining, classification methods commonly used method Naïve bayes classifier (NBC) and support vector machine (SVM). This classification is emphasized for Indonesian language documents, while the relationship between documents is measured by the probability that can be proven with other classification algorithms. This evident from the conclusion that the probability result Naïve Bayes Classifier (NBC) word “party” at least in the economic document and political. Then the result of the algorithm support vector machine (svm) with the word “price” and “kpk” contains in both economic and politic document.

Download Full-text

Analisa Pola Penyalahgunaan Facebook Sebagai Alat Kejahatan Trafficking Menggunakan Data Mining

Jurnal Teknik Informatika ◽

10.35793/jti.8.1.2016.12231 ◽

2016 ◽

Vol 8 (1) ◽

Author(s):

Linda Jayanti ◽

Steven R. Sentinuwo ◽

Oktavian A. Lantang ◽

Agustinus Jacobus

Keyword(s):

Data Mining ◽

Text Mining ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Abstrak - Facebook memungkinkan penggunanya berinteraksi dengan orang yang kita kenal maupun orang yang tidak kita kenal, dimana hal tersebut dapat membuka peluang bagi kejahatan dunia maya seperti, penculikan, perdagangan manusia (trafficking), hingga pembunuhan. IOM mecatat bahwa korban perdagangan orang atau trafficking di Indonesia mencapai 74.616 hingga I juta per tahun, dimana tindak kejahatan teersebut banyak dilakukan melalui facebook sebagai medianya. Data teks (status) yang berada di halaman facebook sangat besar. Dengan menggunakan Teknik pengolahan data dari ilmu Data Mining, terutama di bidangtext mining, penulis memanfaatkannya untuk mengidentifikasi data teks (status facebook) yang terindikasi sebagai proses kejahatan trafficking dengan memakai salah satu teknik klasifikasi dengan teorema naïve bayes classifier (NBC). Kata kunci : facebook, trafficking, data mining, text mining, klasifikasi, naïve bayes classifier.

Download Full-text

The comparation of text mining with Naive Bayes classifier, nearest neighbor, and decision tree to detect Indonesian swear words on Twitter

2017 5th International Conference on Cyber and IT Service Management (CITSM) ◽

10.1109/citsm.2017.8089231 ◽

2017 ◽

Cited By ~ 7

Author(s):

Wildan Budiawan Zulfikar ◽

Mohamad Irfan ◽

Cecep Nurul Alam ◽

Muhammad Indra

Keyword(s):

Text Mining ◽

Decision Tree ◽

Nearest Neighbor ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Download Full-text

ANALISA PERBANDINGAN METODE NAÏVE BAYES CLASSIFIER DAN K-NEAREST NEIGHBOR TERHADAP KLASIFIKASI DATA

Sebatik ◽

10.46984/sebatik.v24i1.909 ◽

2020 ◽

Vol 24 (1) ◽

pp. 1-7

Author(s):

Aida Indriani

Keyword(s):

Text Mining ◽

Nearest Neighbor ◽

Naive Bayes ◽

Confusion Matrix ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

K Nearest Neighbor ◽

Naïve Bayes Classifier

Penggunaan forum sebagai sarana pembelajaran telah banyak digunakan pada kalangan Mahasiswa. Forum digunakan sebagai sarana berdiskusi antar sesama anggota forum untuk membahas materi sesuai dengan judul topik. Judul topik biasanya ditentukan sesuai dengan isi materi yang akan dibahas. Judul topik yang sudah terlalu banyak di dalam sebuah forum dapat berakibat salah dalam pemilihan judul. Salah satu cara untuk mengatasinya yaitu dengan melakukan klasifikasi judul topik secara otomatis sesuai dengan isi materi. Klasifikasi teks dapat diselesaikan dengan menggunakan teknik text mining. Pada proses klasifikasi yang dilakukan yaitu dengan membagi dataset menjadi 2 (dua) bagian menjadi data latih (training) dan data uji (testing). Pada tahapan awal klasifikasi dilakukan proses pre-processing yang diawali dengan tahapan tokenisasi, kemudian dilanjutkan dengan filtering dan diakhiri dengan stemming. Ada beberapa metode yang dapat digunakan dalam klasifikasi teks antara lain naïve bayes classifier (nbc), k-nearest neighbor (k-nn), rocchio, weight adjusted k-nearest neighbor (wa k-nn) dan lain-lain. Pada penelitian ini, penulis membandingkan 2 (dua) metode yaitu nbc dan k-nn. Dari hasil perbandingan kedua metode dapat disimpulkan bahwa metode k-nn lebih baik tingkat akurasinya daripada metode nbc. Hal ini dibuktikan dengan tingkat akurasi sebesar 80% untuk metode k-nn dan sebesar 73% untuk nbc yang dihitung dengan menggunakan metode confusion matrix.

Download Full-text