Perancangan Aplikasi Prediksi Kelulusan Tepat Waktu Bagi Mahasiswa Baru Dengan Teknik Data Mining (Studi Kasus: Data Akademik Mahasiswa STMIK Dipanegara Makassar)

Penelitian ini dimaksudkan untuk melakukan prediksi terhadap kemungkian mahasiswa baru dapat menyelesaikan studi tepat waktu dengan menggunakan analisis data mining untuk menggali tumpukan histori data dengan menggunakan algoritma K-Nearest Neighbor (KNN). Aplikasi yang dihasilkan pada penelitian ini akan menggunakan berbagai atribut yang klasifikasikan dalam suatu data mining antara lain nilai ujian nasional (UN), asal sekolah/ daerah, jenis kelamin, pekerjaan dan penghasilan orang tua, jumlah bersaudara, dan lain-lain sehingga dengan menerapkan analysis KNN dapat dilakukan suatu prediksi berdasarkan kedekatan histori data yang ada dengan data yang baru, apakah mahasiswa tersebut berpeluang untuk menyelesaikan studi tepat waktu atau tidak. Dari hasil pengujian dengan menerapkan algoritma KNN dan menggunakan data sampel alumni tahun wisuda 2004 s.d. 2010 untuk kasus lama dan data alumni tahun wisuda 2011 untuk kasus baru diperoleh tingkat akurasi sebesar 83,36%.This research is intended to predict the possibility of new students time to complete studies using data mining analysis to explore the history stack data using K-Nearest Neighbor algorithm (KNN). Applications generated in this study will use a variety of attributes in a data mining classified among other Ujian Nasional scores (UN), the origin of the school / area, gender, occupation and income of parents, number of siblings, and others that by applying the analysis KNN can do a prediction based on historical proximity of existing data with new data, whether the student is likely to complete the study on time or not. From the test results by applying the KNN algorithm and uses sample data alumnus graduation year 2004 s.d 2010 for the case of a long and alumni data graduation year 2011 for new cases obtained accuracy rate of 83.36%.

Download Full-text

Data Mining Classification Of Filing Credit Customers Without Collateral With K-Nearest Neighbor Algorithm (Case study: PT. BPR Diori Double)

Journal Of Computer Networks, Architecture and High Performance Computing ◽

10.47709/cnapc.v2i2.401 ◽

2020 ◽

Vol 2 (2) ◽

pp. 204-210

Author(s):

Jeprianto Sinaga ◽

Bosker Sinaga

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Savings And Loans ◽

Long Time ◽

K Nearest Neighbor Algorithm ◽

Using Data ◽

The Many

Unsecured loans are the community's choice for lending to banks that provide Reviews These services. PT. RB Diori Ganda is a regional private banking company that serves savings and loans and loans without collateral for the community. Submission of unsecured loans must go through an assessor team to process the analysis of the attributes that Affect the customer's classification so that credit can be approved, the which is then submitted to the commissioner for credit approval. But what if Reviews those who apply for credit on the same day in large amounts, of course this will the make the process of credit analysis and approval will take a long time. If it is seen from the many needs of the community to apply for loans without collateral, a classification application is needed, in order to Facilitate the work of the assessor team in the process of analyzing the attributes that Affect customer classification. To find out the classification of customers who apply for unsecured loans for using data mining with the K-Nearest Neighbor algorithm. The result of this research is the classification of problematic or non-performing customers for credit applications without collateral.

Download Full-text

Analysis and Prediction of CET4 Scores Based on Data Mining Algorithm

Complexity ◽

10.1155/2021/5577868 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Hongyan Wang

Keyword(s):

Data Mining ◽

Linear Regression ◽

Test Score ◽

Nearest Neighbor ◽

Classification Model ◽

Data Mining Algorithm ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm ◽

Classification Efficiency

This paper presents the concept and algorithm of data mining and focuses on the linear regression algorithm. Based on the multiple linear regression algorithm, many factors affecting CET4 are analyzed. Ideas based on data mining, collecting history data and appropriate to transform, using statistical analysis techniques to the many factors influencing the CET-4 test were analyzed, and we have obtained the CET-4 test result and its influencing factors. It was found that the linear regression relationship between the degrees of fit was relatively high. We further improve the algorithm and establish a partition-weighted K-nearest neighbor algorithm. The K-weighted K nearest neighbor algorithm and the partition algorithm are used in the CET-4 test score classification prediction, and the statistical method is used to study the relevant factors that affect the CET-4 test score, and screen classification is performed to predict when the comparison verification will pass. The weight K of the input feature and the adjacent feature are weighted, although the allocation algorithm of the adjacent classification effect has not been significantly improved, but the stability classification is better than K-nearest neighbor algorithm, its classification efficiency is greatly improved, classification time is greatly reduced, and classification efficiency is increased by 119%. In order to detect potential risk graduating students earlier, this paper proposes an appropriate and timely early warning and preschool K-nearest neighbor algorithm classification model. Taking test scores or make-up exams and re-learning as input features, the classification model can effectively predict ordinary students who have not graduated.

Download Full-text

Detect Frauds in Credit Card using Data Mining Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.a5041.129219 ◽

2019 ◽

Vol 9 (2) ◽

pp. 4891-4895

Keyword(s):

Data Mining ◽

Credit Card ◽

Nearest Neighbor ◽

Hidden Markov ◽

K Nearest Neighbor ◽

Detection Systems ◽

Important Concern ◽

On Line ◽

Using Data ◽

The Individual

In today era credit card are extensively used for day to day business as well as other transactions. Ascent within the variety of transactions through master card has junction rectifier to rise in the dishonest activities. In trendy day's fraud is one in every of the most important concern within the monetary loses not solely to the merchants however additionally to the individual purchasers. Data processing had competed a commanding role within the detection of credit card in on-line group action. Our aim is to first of all establish the categories of the fraud secondly, the techniques like K-nearest neighbor, Hidden Markov model, SVM, logistic regression, decision tree and neural network. So fraud detection systems became essential for the banks to attenuate their loses. In this paper we have research about the various detecting techniques to identify and detect the fraud through varied techniques of data mining

Download Full-text

Application of K-Nearest Neighbor Algorithm on Classification of Disk Hernia and Spondylolisthesis in Vertebral Column

Indonesian Journal of Information Systems ◽

10.24002/ijis.v2i1.2352 ◽

2019 ◽

Vol 2 (1) ◽

pp. 57 ◽

Cited By ~ 1

Author(s):

Irma Handayani

Keyword(s):

Vertebral Column ◽

Nearest Neighbor ◽

Average Length ◽

Data Classification ◽

The Body ◽

Training Data ◽

K Nearest Neighbor ◽

Sample Data ◽

K Nearest Neighbor Algorithm

Vertebral column as a part of backbone has important role in human body. Trauma in vertebral column can affect spinal cord capability to send and receive messages from brain to the body system that controls sensory and motoric movement. Disk hernia and spondylolisthesis are examples of pathologies on the vertebral column. Research about pathology or damage bones and joints of skeletal system classification is rare whereas the classification system can be used by radiologists as a second opinion so that can improve productivity and diagnosis consistency of the radiologists. This research used dataset Vertebral Column that has three classes (Disk Hernia, Spondylolisthesis and Normal) and instances in UCI Machine Learning. This research applied the K-NN algorithm for classification of disk hernia and spondylolisthesis in vertebral column. The data were then classified into two different but related classification tasks: “normal” and “abnormal”. K-NN algorithm adopts the approach of data classification by optimizing sample data that can be used as a reference for training data to produce vertebral column data classification based on the learning process. The results showed that the accuracy of K-NN classifier was 83%. The average length of time needed to classify the K-NN classifier was 0.000212303 seconds.

Download Full-text

Analysis K-Nearest Neighbor Algorithm for Improving Prediction Student Graduation Time

SinkrOn ◽

10.33395/sinkron.v4i2.10480 ◽

2020 ◽

Vol 4 (2) ◽

pp. 42

Author(s):

Rizki Muliono ◽

Juanda Hakim Lubis ◽

Nurul Khairina

Keyword(s):

Higher Education ◽

Nearest Neighbor ◽

Training Data ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Study Program ◽

Sample Data ◽

Student Graduation ◽

K Nearest Neighbor Algorithm

Higher education plays a major role in improving the quality of education in Indonesia. The BAN-PT institution established by the government has a standard of higher education accreditation and study program accreditation. With the 4.0-based accreditation instrument, it encourages university leaders to improve the quality and quality of their education. One indicator that determines the accreditation of study programs is the timely graduation of students. This study uses the K-Nearest Neighbor algorithm to predict student graduation times. Students' GPA at the time of the seventh semester will be used as training data, and data of students who graduate are used as sample data. K-Nearest Neighbor works in accordance with the given sample data. The results of prediction testing on 60 data for students of 2015-2016, obtained the highest level of accuracy of 98.5% can be achieved when k = 3. Prediction results depend on the pattern of data entered, the more samples and training data used, the calculation of the K-Nearest Neighbor algorithm is also more accurate.

Download Full-text

NEWS TOPIC CLASSIFICATION ON TRIBUNNEWS ONLINE MEDIA USING K-NEAREST NEIGHBOR ALGORITHM

Journal of Information Technology and Its Utilization ◽

10.30818/jitu.1.2.1879 ◽

2018 ◽

Vol 1 (2) ◽

pp. 38

Author(s):

Nfn Herman

Keyword(s):

Nearest Neighbor ◽

Online Media ◽

K Nearest Neighbor ◽

Accuracy Rate ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm ◽

Manual Classification

Online media journalists like tribunnews journalists usually determine the news category when make news input. Unfortunately, often the topic submitted is not in accordance with what is expected by the editor. These errors will make it difficult for news searches by customers. To eliminate these errors, editors can be assisted by an application that able to classify topics. Thus, editors is no longer too dependent on journalist input. This study aims to design applications that able to classify topics based on the texts contained in the news. The method used is the K-Nearest Neighboor algorithm. This design has produced a system that able to classify news topics automatically. To measure the accuracy of the application, several test were carried out by comparing between its results and the results of manual classification by the editor. The tests those carried out with several scenarios produce an accuracy rate of 82%

Download Full-text

Data mining, fuzzy AHP and TOPSIS for optimizing taxpayer supervision

Indonesian Journal of Electrical Engineering and Computer Science ◽

10.11591/ijeecs.v18.i1.pp75-87 ◽

2020 ◽

Vol 18 (1) ◽

pp. 75

Author(s):

M. Jupri ◽

Riyanarto Sarno

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

Fuzzy Ahp ◽

Support Vector ◽

K Nearest Neighbor ◽

Data Mining Algorithms ◽

Using Data ◽

Time Required ◽

Mining Algorithms

The achievement of accepting optimal tax need effective and efficient tax supervision can be achieved by classifying taxpayer compliance to tax regulations. Considering this issue, this paper proposes the classification of taxpayer compliance using data mining algorithms; i.e. C4.5, Support Vector Machine, K-Nearest Neighbor, Naive Bayes, and Multilayer Perceptron based on the compliance of taxpayer data. The taxpayer compliance can be classified into four classes, which are (1) formal and material compliant taxpayers, (2) formal compliant taxpayers, (3) material compliant taxpayers, and (4) formal and material non-compliant taxpayers. Furthermore, the results of data mining algorithms are compared by using Fuzzy AHP and TOPSIS to determine the best performance classification based on the criteria of Accuracy, F-Score, and Time required. Selection of the taxpayer's priority for more detailed supervision at each level of taxpayer compliance is ranked using Fuzzy AHP and TOPSIS based on criteria of dataset variables. The results show that C4.5 is the best performance classification and achieves preference value of 0.998; whereas the MLP algorithm results from the lowest preference value of 0.131. Alternative taxpayer A233 is the top priority taxpayer with a preference value of 0.433; whereas alternative taxpayer A051 is the lowest priority taxpayer with a preference value of 0.036.

Download Full-text

Penyelesaian Masalah Pengelolaan Lumbung Pangan Desa Menggunakan Case-Based Reasoning dengan Algoritma K-Nearest Neighbor

JSI: Jurnal Sistem Informasi (E-Journal) ◽

10.36706/jsi.v11i1.7699 ◽

2019 ◽

Vol 11 (1) ◽

Author(s):

Mgs. Afriyan Firdaus ◽

Dwi Rosa Indah ◽

Putri Eka Sevtiyuni ◽

Choirunnisa Qonitah

Keyword(s):

Problem Solving ◽

Nearest Neighbor ◽

Technical Problem ◽

Case Based Reasoning ◽

Test Results ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Existing Problems ◽

K Nearest Neighbor Algorithm ◽

Case Based

In this paper, we discuss the problem solving of village food barn management using Case-Based Reasoning (CBR) with the K-Nearest Neighbor algorithm. This research was carried out by adopting the stages of the CBR cycle and the nearest neighbor algorithm. The results of the study show that the application of CBR and K-nearest neighbor algorithms can support the resolution of knowledge problems in village food barn management using technical problem solving based on the symptoms and solutions to existing problems. Based on the test results, the problem-solving accuracy was 92%.Keywords - case-based reasoning, K-nearest neighbor, food barn, problem-solving

Download Full-text

PREDIKSI HASIL PEMILU LEGISLATIF MENGGUNAKAN ALGORITMA K-NEAREST NEIGHBOR BERBASIS BACKWARD ELIMINATION

Jurnal RESISTOR (Rekayasa Sistem Komputer) ◽

10.31598/jurnalresistor.v3i1.517 ◽

2020 ◽

Vol 3 (1) ◽

pp. 27-41

Author(s):

Achmad Saiful Rizal ◽

Moch. Lutfi

Keyword(s):

Data Mining ◽

Nearest Neighbor ◽

Political Elite ◽

Data Mining Algorithm ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Backward Elimination ◽

K Nearest Neighbor Algorithm ◽

Fold Cross Validation ◽

Selection Of

Elections in Indonesia from period to period have undergone some changes. Elections legislative candidates not determined voters, but instead became a political elite authority in accordance with the order of the list of legislative candidates and their number sequence. To perform a prediction one of them with data mining. Data mining can be applied in the political sphere for example to predict the results of the legislative election and others. K-nearest neighbor algorithm is one of the data mining algorithm that performs classification based on learning object against which are closest to the object. Election-related research has been done with the k-nearest neighbor algorithm, but accuracy is obtained that method is still too low, so it takes an additional algorithm to improve accuracy. In this study, the proposed method, namely the method of k-nearest neighbor method combined with backward elimination as a selection of features. The dataset that will be used in the study comes from the KPU Sidoarjo that has special attributes 1 and 13 regular attributes. From the results of the analysis and computation of some methods, it can be concluded that the method of k-nearest neighbor method combined with backward elimination produced some conclusions. First, of the 14 attributes in the dataset, retrieved 8 most influential attribute. Second, the best accuracy are of 96.03% when k = 2 and tested by 10 fold cross validation.

Download Full-text

Diagnosis Of Heart Disease Using K-Nearest Neighbor Method Based On Forward Selection

Journal of Applied Intelligent System ◽

10.33633/jais.v4i2.2749 ◽

2020 ◽

Vol 4 (2) ◽

pp. 39-47

Author(s):

Junta Zeniarja ◽

Anisatawalanita Ukhifahdhina ◽

Abu Salam

Keyword(s):

Data Mining ◽

Feature Selection ◽

Heart Disease ◽

Early Diagnosis ◽

Nearest Neighbor ◽

Selection Method ◽

K Nearest Neighbor ◽

Forward Selection ◽

Nearest Neighbor Algorithm ◽

K Nearest Neighbor Algorithm

Heart is one of the essential organs that assume a significant part in the human body. However, heart can also cause diseases that affect the death. World Health Organization (WHO) data from 2012 showed that all deaths from cardiovascular disease (vascular) 7.4 million (42.3%) were caused by heart disease. Increased cases of heart disease require a step as an early prevention and prevention efforts by making early diagnosis of heart disease. In this research will be done early diagnosis of heart disease by using data mining process in the form of classification. The algorithm used is K-Nearest Neighbor algorithm with Forward Selection method. The K-Nearest Neighbor algorithm is used for classification in order to obtain a decision result from the diagnosis of heart disease, while the forward selection is used as a feature selection whose purpose is to increase the accuracy value. Forward selection works by removing some attributes that are irrelevant to the classification process. In this research the result of accuracy of heart disease diagnosis with K-Nearest Neighbor algorithm is 73,44%, while result of K-Nearest Neighbor algorithm accuracy with feature selection method 78,66%. It is clear that the incorporation of the K-Nearest Neighbor algorithm with the forward selection method has improved the accuracy result. Keywords - K-Nearest Neighbor, Classification, Heart Disease, Forward Selection, Data Mining

Download Full-text