Model Prediksi Prestasi Mahasiswa Berdasarkan Evaluasi Pembelajaran Menggunakan Pendekatan Data Science

Tommy Tommy; Amir Mahmud Husein

doi:10.47709/dsi.v1i1.1168

Model Prediksi Prestasi Mahasiswa Berdasarkan Evaluasi Pembelajaran Menggunakan Pendekatan Data Science

Data Sciences Indonesia (DSI) ◽

10.47709/dsi.v1i1.1168 ◽

2021 ◽

Vol 1 (1) ◽

pp. 14-20

Author(s):

Tommy Tommy ◽

Amir Mahmud Husein

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Data Science ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Support Vector ◽

K Nearest Neighbors

Perguruan tinggi merupakan satuan penyelenggara pendidikan tinggi sebagai tingkat lanjut jenjang pendidikan menengah di jalur pendidikan formal. Aspek prestasi belajar merupakan salah satu aspek penilaian keberhasilan perguruan tinggi dalam proses belajar. Dalam makalah ini menyajikan hasil analisis hubungan antara pembelajaran dengan prestasi mahasiswa dimana tahapan yang dilakukan menggunakan pendetakan data science. Berdasarkan Analisis data terdapat tiga indikator penting dalam penilaian prestasi belajar yaitu pedagogi, profesional dan kepribadian. Ketiga fitur digunakan sebagai variabel dependen untuk memprediksi prestasi belajar dimana algoritma DecisionTree menghasilkan akurasi lebih baik dari pada model k-nearest neighbors (KNN), Logistic Regression, Support Vector Machine, Naive Bayes dan dengan tingkat akurasi 68%, kemudian KNN dengan akurasi 66% dan lainnya sebesar 55% pada masing-masing algoritma yang diusulkan.

Download Full-text

Técnicas de aprendizaje de máquina utilizadas para la minería de texto

Investigación Bibliotecológica Archivonomía Bibliotecología e Información ◽

10.22201/iibi.0187358xp.2017.71.57812 ◽

2017 ◽

Vol 31 (71) ◽

pp. 103

Author(s):

Ángel Freddy Godoy Viera

Keyword(s):

Support Vector Machine ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Support Vector ◽

K Nearest Neighbors ◽

Self Organizing Maps ◽

Self Organizing

Las técnicas de aprendizaje de máquina continúan siendo muy utilizadas para la minería de texto. Para este artículo se realizó una revisión de literatura en periódicos científicos publicados en los años de 2010 y 2011, con el objetivo de identificar las principales formas de aprendizaje de máquina empleadas para la minería de texto. Se utilizó estadística descriptiva para organizar, resumir y analizar los datos encontrados, y se presentó una descripción resumida de las principales encontradas. En los artículos analizados se hallaron 13 aplicadas para la minería de texto, el 83% de los artículos mencionaban de 1 a 3 técnicas de aprendizaje de máquina, las principales usadas por los autores en los artículos estudiados fueron support vector machine (svm), k-means (k-m),k-nearest neighbors (k-nn), naive bayes (nb), self-organizing maps (som). Los pares que aparecen con mayor frecuencia son svm/nb, svm/k-nn, svm/decission tree.

Download Full-text

Prediction of Hepatitis Disease Using K-Nearest Neighbors, Naive Bayes, Support Vector Machine, Multi-Layer Perceptron and Random Forest

2021 International Conference on Information and Communication Technology for Sustainable Development (ICICT4SD) ◽

10.1109/icict4sd50815.2021.9397013 ◽

2021 ◽

Author(s):

Md. Julker Nayeem ◽

Sohel Rana ◽

Farjana Alam ◽

Md. Ataur Rahman

Keyword(s):

Support Vector Machine ◽

Random Forest ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Support Vector ◽

Multi Layer Perceptron ◽

K Nearest Neighbors

Download Full-text

Comparison of Multinomial Naïve Bayes with K-Nearest Neighbors, Support Vector Machine and Random Forest for Classification of “Network Attacks” Document

2019 Fourth International Conference on Informatics and Computing (ICIC) ◽

10.1109/icic47613.2019.8985919 ◽

2019 ◽

Author(s):

Bambang Harjito ◽

Ardhi Wijayanto ◽

Kuni Nur Aini ◽

Budi Murtiyas

Keyword(s):

Support Vector Machine ◽

Random Forest ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Support Vector ◽

K Nearest Neighbors ◽

Network Attacks

Download Full-text

Implementasi Algoritma Naive Bayes, Support Vector Machine, dan K-Nearest Neighbors untuk Analisa Sentimen Aplikasi Halodoc

Faktor Exacta ◽

10.30998/faktorexacta.v14i2.9697 ◽

2021 ◽

Vol 14 (2) ◽

pp. 64

Author(s):

Elly Indrayuni ◽

Acmad Nurhadi ◽

Dinar Ajeng Kristiyanti

Keyword(s):

Support Vector Machine ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Support Vector ◽

K Nearest Neighbors

Download Full-text

Predicting Student’s Performance Using Machine Learning Algorithm

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-1209 ◽

2021 ◽

pp. 53-58

Author(s):

Sheela Rani P ◽

Dhivya S ◽

Dharshini Priya M ◽

Dharmila Chowdary A

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Prediction Model ◽

Naive Bayes ◽

Learning Algorithm ◽

Naïve Bayes ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Approaches ◽

K Nearest Neighbors

Machine learning is a new analysis discipline that uses knowledge to boost learning, optimizing the training method and developing the atmosphere within which learning happens. There square measure 2 sorts of machine learning approaches like supervised and unsupervised approach that square measure accustomed extract the knowledge that helps the decision-makers in future to require correct intervention. This paper introduces an issue that influences students' tutorial performance prediction model that uses a supervised variety of machine learning algorithms like support vector machine , KNN(k-nearest neighbors), Naïve Bayes and supplying regression and logistic regression. The results supported by various algorithms are compared and it is shown that the support vector machine and Naïve Bayes performs well by achieving improved accuracy as compared to other algorithms. The final prediction model during this paper may have fairly high prediction accuracy .The objective is not just to predict future performance of students but also provide the best technique for finding the most impactful features that influence student’s while studying.

Download Full-text

A study and identification of COVID-19 viruses using N-grams with Naïve Bayes, K-Nearest Neighbors, Artificial Neural Networks, Decision tree and Support Vector Machine

10.21203/rs.3.rs-40344/v2 ◽

2020 ◽

Author(s):

Mohamed El Boujnouni

Keyword(s):

Neural Networks ◽

Support Vector Machine ◽

Artificial Neural Networks ◽

Decision Tree ◽

Naive Bayes ◽

Nearest Neighbors ◽

Support Vector ◽

K Nearest Neighbors ◽

The World ◽

Artificial Neural

Abstract Coronavirus disease 2019 or COVID-19 is a global health crisis caused by a virus officially named as severe acute respiratory syndrome coronavirus 2 and well known with the acronym (SARS-CoV-2). This very contagious illness has severely impacted people and business all over the world and scientists are trying so far to discover all useful information about it, including its potential origin(s) and inter-host(s). This study is a part of this scientific inquiry and it aims to identify precisely the origin(s) of a large set of genomes of SARS-COV-2 collected from different geographic locations in all over the world. This research is performed through the combination of five powerful techniques of machine learning (Naïve Bayes, K-Nearest Neighbors, Artificial Neural Networks, Decision tree and Support Vector Machine) and a widely known tool of language modeling (N-grams). The experimental results have shown that the majority of techniques gave the same global results concerning the origin(s) and inter-host(s) of SARS-COV-2. These results demonstrated that this virus has one zoonotic source which is Pangolin.

Download Full-text

A Rough Set and Cellular Genetic Fusion Algorithm for Acute Critical Disease Prediction

International Journal of Computers Communications & Control ◽

10.15837/ijccc.2020.6.3894 ◽

2020 ◽

Vol 15 (6) ◽

Author(s):

Hongxin Wang ◽

Lijing Jia ◽

Heng Zhuang ◽

Xueyan Li ◽

Yuzhuo Zhao ◽

...

Keyword(s):

Genetic Algorithm ◽

Support Vector Machine ◽

Logistic Regression ◽

Rough Set ◽

Rough Set Theory ◽

Naive Bayes ◽

Naïve Bayes ◽

Support Vector ◽

Disease Prediction ◽

Fusion Algorithm

This study is to solve the problems of an overly-broad scale of medical indicators, lack of retrospective research samples, insufficient depth of data mining, and low disease prediction accuracy. In this paper, we propose an intelligent screening algorithm that combines a genetic algorithm, cellular automata, and rough set theory. This algorithm can achieve high accuracy in predicting patient outcomes with a small number of indicators. And we compare it with the traditional genetic algorithm. We built the prediction model with 64 indicators based on the logistic regression (AUC 0.8628), support vector machine (AUC 0.5319), Naïve Bayes (AUC 0.7102), and AdaBoost algorithms (AUC 0.9095). Using the cellular genetic algorithm for attribute screening not only effectively reduces the number of indicators but also achieve almost the same accuracy of prediction with 8 indicators based on the logistic regression (AUC 0.8782), support vector machine (AUC 0.8525), Naïve Bayes (AUC 0.8408), and AdaBoost algorithms (AUC 0.8770). Compared with the traditional scoring system, the predictive model established in this paper can more accurately predict rebleeding accidents based on physiological test indicators and continuous patient indicators.

Download Full-text

Movie Success Prediction Using Naïve Bayes, Logistic Regression and Support Vector Machine

10.1109/icrito51393.2021.9596138 ◽

2021 ◽

Author(s):

Rachaell Nihalaani ◽

Apoorva Shete ◽

Darakshan Khan

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Naive Bayes ◽

Naïve Bayes ◽

Success Prediction ◽

Support Vector ◽

Movie Success

Download Full-text

Comparison of Tree Method, Support Vector Machine, Naïve Bayes, and Logistic Regression on Coffee Bean Image

EMITTER International Journal of Engineering Technology ◽

10.24003/emitter.v9i1.536 ◽

2021 ◽

Vol 9 (1) ◽

pp. 126-136

Author(s):

Rahmat Robi Waliyansyah ◽

Umar Hafidz Asy'ari Hasbullah

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Naive Bayes ◽

Confusion Matrix ◽

Naïve Bayes ◽

Classification Model ◽

Support Vector ◽

Coffee Bean ◽

Coffee Beans ◽

The Many

Coffee is one of the many favorite drinks of Indonesians. In Indonesia there are 2 types of coffee, namely Arabica & Robusta. The classification of coffee beans is usually done in a traditional way & depends on the human senses. However, the human senses are often inconsistent, because it depends on the mental or physical condition in question at that time, and only qualitative measures can be determined. In this study, to classify coffee beans is done by digital image processing. The parameters used are texture analysis using the Gray Level Coocurrence Matrix (GLCM) method with 4 features, namely Energy, Correlation, Homogeneity & Contrast. For feature extraction using a classification algorithm, namely Naïve Bayes, Tree, Support Vector Machine (SVM) and Logistic Regression. The evaluation of the coffee bean classification model uses the following parameters: AUC, F1, CA, precision & recall. The dataset used is 29 images of Arabica coffee beans and 29 images of Robusta beans. To test the accuracy of the model using Cross Validation. The results obtained will be evaluated using the confusion Matrix. Based on the results of testing and evaluation of the model, it is obtained that the SVM method is the best with the value of AUC = 1, CA = 0.983, F1 = 0.983, Precision = 0.983 and Recall = 0.983.

Download Full-text

A Recommendation System & Their Performance Metrics using several ML Algorithms

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c5791.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 2445-2451

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Search Engine ◽

Recommendation System ◽

Performance Metrics ◽

Nearest Neighbors ◽

Support Vector ◽

K Nearest Neighbors ◽

Data Set ◽

Google Search

Recommendation systems are subdivision of Refine Data that request to anticipate ranking or liking a user would give to an item. Recommended systems produce user customized exhortations for product or service. Recommended systems are used in different services like Google Search Engine, YouTube, Gmail and also Product recommendation service on any E-Commerce website. These systems usually depends on content based approach. in this paper, we develop these type recommended systems by using several algorithms like K-Nearest neighbors(KNN), Support-Vector Machine(SVM), Logistic Regression(LR), MultinomialNB(MNB),and Multi-layer Perception(MLP). These will predict nearest categories from the News Category Data, among these categories we will recommend the most common sentence to a user and we analyze the performance metrics. This approach is tested on News Category Data set. This data set having more or less 200k Headlines of News and 41 classes, collected from the Huff post from the year of 2012-2018.

Download Full-text