A Recommendation System & Their Performance Metrics using several ML Algorithms

Recommendation systems are subdivision of Refine Data that request to anticipate ranking or liking a user would give to an item. Recommended systems produce user customized exhortations for product or service. Recommended systems are used in different services like Google Search Engine, YouTube, Gmail and also Product recommendation service on any E-Commerce website. These systems usually depends on content based approach. in this paper, we develop these type recommended systems by using several algorithms like K-Nearest neighbors(KNN), Support-Vector Machine(SVM), Logistic Regression(LR), MultinomialNB(MNB),and Multi-layer Perception(MLP). These will predict nearest categories from the News Category Data, among these categories we will recommend the most common sentence to a user and we analyze the performance metrics. This approach is tested on News Category Data set. This data set having more or less 200k Headlines of News and 41 classes, collected from the Huff post from the year of 2012-2018.

Download Full-text

Model Prediksi Prestasi Mahasiswa Berdasarkan Evaluasi Pembelajaran Menggunakan Pendekatan Data Science

Data Sciences Indonesia (DSI) ◽

10.47709/dsi.v1i1.1168 ◽

2021 ◽

Vol 1 (1) ◽

pp. 14-20

Author(s):

Tommy Tommy ◽

Amir Mahmud Husein

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Data Science ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Support Vector ◽

K Nearest Neighbors

Perguruan tinggi merupakan satuan penyelenggara pendidikan tinggi sebagai tingkat lanjut jenjang pendidikan menengah di jalur pendidikan formal. Aspek prestasi belajar merupakan salah satu aspek penilaian keberhasilan perguruan tinggi dalam proses belajar. Dalam makalah ini menyajikan hasil analisis hubungan antara pembelajaran dengan prestasi mahasiswa dimana tahapan yang dilakukan menggunakan pendetakan data science. Berdasarkan Analisis data terdapat tiga indikator penting dalam penilaian prestasi belajar yaitu pedagogi, profesional dan kepribadian. Ketiga fitur digunakan sebagai variabel dependen untuk memprediksi prestasi belajar dimana algoritma DecisionTree menghasilkan akurasi lebih baik dari pada model k-nearest neighbors (KNN), Logistic Regression, Support Vector Machine, Naive Bayes dan dengan tingkat akurasi 68%, kemudian KNN dengan akurasi 66% dan lainnya sebesar 55% pada masing-masing algoritma yang diusulkan.

Download Full-text

Preprocessing Unbalanced Data using Support Vector Machine with Method K-Nearest Neighbors for Cerebral Infarction Classification

Journal of Physics Conference Series ◽

10.1088/1742-6596/1752/1/012037 ◽

2021 ◽

Vol 1752 (1) ◽

pp. 012037

Author(s):

A G M Sari ◽

A M Putri ◽

Z Rustam ◽

J Pandelaki

Keyword(s):

Support Vector Machine ◽

Cerebral Infarction ◽

Nearest Neighbors ◽

Support Vector ◽

Unbalanced Data ◽

K Nearest Neighbors

Download Full-text

Prediction of breast cancer using support vector machine and K-Nearest neighbors

2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC) ◽

10.1109/r10-htc.2017.8288944 ◽

2017 ◽

Cited By ~ 25

Author(s):

Md. Milon Islam ◽

Hasib Iqbal ◽

Md. Rezwanul Haque ◽

Md. Kamrul Hasan

Keyword(s):

Breast Cancer ◽

Support Vector Machine ◽

Nearest Neighbors ◽

Support Vector ◽

K Nearest Neighbors

Download Full-text

Intelligent System to Classify Peanuts Varieties Using K-Nearest Neighbors (K-NN) and Support Vector Machine (SVM)

Communications in Computer and Information Science - Advanced Informatics for Computing Research ◽

10.1007/978-981-15-0108-1_33 ◽

2019 ◽

pp. 359-368

Author(s):

V. G. Narendra ◽

K. Govardhan Hegde

Keyword(s):

Support Vector Machine ◽

Intelligent System ◽

Nearest Neighbors ◽

Support Vector ◽

K Nearest Neighbors

Download Full-text

Técnicas de aprendizaje de máquina utilizadas para la minería de texto

Investigación Bibliotecológica Archivonomía Bibliotecología e Información ◽

10.22201/iibi.0187358xp.2017.71.57812 ◽

2017 ◽

Vol 31 (71) ◽

pp. 103

Author(s):

Ángel Freddy Godoy Viera

Keyword(s):

Support Vector Machine ◽

Naive Bayes ◽

Nearest Neighbors ◽

Naïve Bayes ◽

Support Vector ◽

K Nearest Neighbors ◽

Self Organizing Maps ◽

Self Organizing

Las técnicas de aprendizaje de máquina continúan siendo muy utilizadas para la minería de texto. Para este artículo se realizó una revisión de literatura en periódicos científicos publicados en los años de 2010 y 2011, con el objetivo de identificar las principales formas de aprendizaje de máquina empleadas para la minería de texto. Se utilizó estadística descriptiva para organizar, resumir y analizar los datos encontrados, y se presentó una descripción resumida de las principales encontradas. En los artículos analizados se hallaron 13 aplicadas para la minería de texto, el 83% de los artículos mencionaban de 1 a 3 técnicas de aprendizaje de máquina, las principales usadas por los autores en los artículos estudiados fueron support vector machine (svm), k-means (k-m),k-nearest neighbors (k-nn), naive bayes (nb), self-organizing maps (som). Los pares que aparecen con mayor frecuencia son svm/nb, svm/k-nn, svm/decission tree.

Download Full-text

Discrimination of soft tissues using laser-induced breakdown spectroscopy in combination with k nearest neighbors (kNN) and support vector machine (SVM) classifiers

Optics & Laser Technology ◽

10.1016/j.optlastec.2018.01.028 ◽

2018 ◽

Vol 102 ◽

pp. 233-239 ◽

Cited By ~ 17

Author(s):

Xiaohui Li ◽

Sibo Yang ◽

Rongwei Fan ◽

Xin Yu ◽

Deying Chen

Keyword(s):

Support Vector Machine ◽

Soft Tissues ◽

Nearest Neighbors ◽

Laser Induced Breakdown Spectroscopy ◽

Support Vector ◽

K Nearest Neighbors ◽

Breakdown Spectroscopy ◽

Laser Induced Breakdown

Download Full-text

Application of support vector machine combined with K-nearest neighbors in solar flare and solar proton events forecasting

Advances in Space Research ◽

10.1016/j.asr.2007.12.015 ◽

2008 ◽

Vol 42 (9) ◽

pp. 1469-1474 ◽

Cited By ~ 29

Author(s):

Rong Li ◽

Yanmei Cui ◽

Han He ◽

Huaning Wang

Keyword(s):

Support Vector Machine ◽

Solar Flare ◽

Nearest Neighbors ◽

Solar Proton ◽

Support Vector ◽

K Nearest Neighbors ◽

Solar Proton Events

Download Full-text

A COMBINATION OF SUPPORT VECTOR MACHINE AND k-NEAREST NEIGHBORS FOR MACHINE FAULT DETECTION

Applied Artificial Intelligence ◽

10.1080/08839514.2013.747370 ◽

2013 ◽

Vol 27 (1) ◽

pp. 36-49 ◽

Cited By ~ 16

Author(s):

Amaury B. Andre ◽

Eduardo Beltrame ◽

Jacques Wainer

Keyword(s):

Support Vector Machine ◽

Fault Detection ◽

Nearest Neighbors ◽

Support Vector ◽

K Nearest Neighbors ◽

Machine Fault

Download Full-text

Analisis Perbandingan Algoritma SVM, KNN, dan CNN untuk Klasifikasi Citra Cuaca

Jurnal Teknologi Informasi dan Ilmu Komputer ◽

10.25126/jtiik.2021824553 ◽

2021 ◽

Vol 8 (2) ◽

pp. 311

Author(s):

Mohammad Farid Naufal

Keyword(s):

Neural Network ◽

Machine Learning ◽

Computer Vision ◽

Support Vector Machine ◽

Convolutional Neural Network ◽

Cross Validation ◽

Nearest Neighbors ◽

Support Vector ◽

Classification Algorithms ◽

K Nearest Neighbors

Cuaca merupakan faktor penting yang dipertimbangkan untuk berbagai pengambilan keputusan. Klasifikasi cuaca manual oleh manusia membutuhkan waktu yang lama dan inkonsistensi. Computer vision adalah cabang ilmu yang digunakan komputer untuk mengenali atau melakukan klasifikasi citra. Hal ini dapat membantu pengembangan self autonomous machine agar tidak bergantung pada koneksi internet dan dapat melakukan kalkulasi sendiri secara real time. Terdapat beberapa algoritma klasifikasi citra populer yaitu K-Nearest Neighbors (KNN), Support Vector Machine (SVM), dan Convolutional Neural Network (CNN). KNN dan SVM merupakan algoritma klasifikasi dari Machine Learning sedangkan CNN merupakan algoritma klasifikasi dari Deep Neural Network. Penelitian ini bertujuan untuk membandingkan performa dari tiga algoritma tersebut sehingga diketahui berapa gap performa diantara ketiganya. Arsitektur uji coba yang dilakukan adalah menggunakan 5 cross validation. Beberapa parameter digunakan untuk mengkonfigurasikan algoritma KNN, SVM, dan CNN. Dari hasil uji coba yang dilakukan CNN memiliki performa terbaik dengan akurasi 0.942, precision 0.943, recall 0.942, dan F1 Score 0.942. AbstractWeather is an important factor that is considered for various decision making. Manual weather classification by humans is time consuming and inconsistent. Computer vision is a branch of science that computers use to recognize or classify images. This can help develop self-autonomous machines so that they are not dependent on an internet connection and can perform their own calculations in real time. There are several popular image classification algorithms, namely K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and Convolutional Neural Network (CNN). KNN and SVM are Machine Learning classification algorithms, while CNN is a Deep Neural Networks classification algorithm. This study aims to compare the performance of that three algorithms so that the performance gap between the three is known. The test architecture is using 5 cross validation. Several parameters are used to configure the KNN, SVM, and CNN algorithms. From the test results conducted by CNN, it has the best performance with 0.942 accuracy, 0.943 precision, 0.942 recall, and F1 Score 0.942.

Download Full-text

Perbandingan Algoritma Klasifikasi Sentimen Twitter Terhadap Insiden Kebocoran Data Tokopedia

JISKA (Jurnal Informatika Sunan Kalijaga) ◽

10.14421/jiska.2021.6.2.120-129 ◽

2021 ◽

Vol 6 (2) ◽

pp. 120-129

Author(s):

Nadhif Ikbar Wibowo ◽

Tri Andika Maulana ◽

Hamzah Muhammad ◽

Nur Aini Rakhmawati

Keyword(s):

Support Vector Machine ◽

Logistic Regression ◽

Random Forest ◽

Supervised Learning ◽

Support Vector ◽

Data Set ◽

Logistic Regression Classifier

Public responses, posted on Twitter reacting to the Tokopedia data leak incident, were used as a data set to compare the performance of three different classifiers, trained using supervised learning modeling, to classify sentiment on the text. All tweets were classified into either positive, negative, or neutral classes. This study compares the performance of Random Forest, Support-Vector Machine, and Logistic Regression classifier. Data was scraped automatically and used to evaluate several models; the SVM-based model has the highest f1-score 0.503583. SVM is the best performing classifier.

Download Full-text