Educational Data Mining and Analysis of Students’ Academic Performance Using WEKA

In this competitive scenario of the educational system, the higher education institutes use data mining tools and techniques for academic improvement of the student performance and to prevent drop out. The authors collected data from three colleges of Assam, India. The data consists of socio-economic, demographic as well as academic information of three hundred students with twenty-four attributes. Four classification methods, the J48, PART, Random Forest and Bayes Network Classifiers were used. The data mining tool used was WEKA. The high influential attributes were selected using the tool. The internal assessment attribute in the continuous evaluation process makes the highest impact in the final semester results of the students in our dataset. The results showed that random forest outperforms the other classifiers based on accuracy and classifier errors. Apriori algorithm was also used to find the association rule mining among all the attributes and the best rules were also displayed.

Download Full-text

Comparison of Some Classification Algorithms for the Analysis of Students Academic Performance in Educational Data Mining Using Orange

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-1394 ◽

2021 ◽

pp. 318-324

Author(s):

Vanthana V

Keyword(s):

Data Mining ◽

Academic Performance ◽

Random Forest ◽

Educational Data Mining ◽

Evaluation Process ◽

Support Vector ◽

Classification Algorithms ◽

Academic Improvement ◽

Academic Information ◽

Tools And Techniques

In the modern education system, many higher education institutions prefer data mining tools and techniques to analyze the academic improvement of their students. To support that many data mining techniques and tools are available. This paper uses the classification concept to analyze the student’s academic performance. This paper presents the comparison result of five classification algorithms – Decision Tree, Naïve Bayesian, K-Nearest Neighbour, Support Vector Machine and Random Forest which is applied to the data collected from three colleges of Assam, India. The data consists of socio-economic, demographic as well as academic information of three hundred students with twenty-four attributes. The data mining tool used was ORANGE. The internal assessment attribute in the continuous evaluation process makes the highest impact in the final semester results of the students in the dataset. The results showed that Random Forest out performs the other classifiers based on accuracy.

Download Full-text

A Systematic Literature Review of Student’ Performance Prediction Using Machine Learning Techniques

Education Sciences ◽

10.3390/educsci11090552 ◽

2021 ◽

Vol 11 (9) ◽

pp. 552

Author(s):

Balqis Albreiki ◽

Nazar Zaki ◽

Hany Alashwal

Keyword(s):

Machine Learning ◽

Data Mining ◽

At Risk ◽

Learning Environment ◽

Student Performance ◽

Critical Role ◽

Educational Data Mining ◽

Drop Out ◽

Machine Learning Techniques ◽

Students At Risk

Educational Data Mining plays a critical role in advancing the learning environment by contributing state-of-the-art methods, techniques, and applications. The recent development provides valuable tools for understanding the student learning environment by exploring and utilizing educational data using machine learning and data mining techniques. Modern academic institutions operate in a highly competitive and complex environment. Analyzing performance, providing high-quality education, strategies for evaluating the students’ performance, and future actions are among the prevailing challenges universities face. Student intervention plans must be implemented in these universities to overcome problems experienced by the students during their studies. In this systematic review, the relevant EDM literature related to identifying student dropouts and students at risk from 2009 to 2021 is reviewed. The review results indicated that various Machine Learning (ML) techniques are used to understand and overcome the underlying challenges; predicting students at risk and students drop out prediction. Moreover, most studies use two types of datasets: data from student colleges/university databases and online learning platforms. ML methods were confirmed to play essential roles in predicting students at risk and dropout rates, thus improving the students’ performance.

Download Full-text

Enhancement of Predicting Students Performance Model Using Ensemble Approaches and Educational Data Mining Techniques

Wireless Communications and Mobile Computing ◽

10.1155/2021/6241676 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Mahmoud Ragab ◽

Ahmed M. K. Abdel Aal ◽

Ali O. Jifri ◽

Nahla F. Omran

Keyword(s):

Data Mining ◽

Random Forest ◽

Student Performance ◽

Evaluation Method ◽

Educational Data Mining ◽

Performance Model ◽

Vector System ◽

Vote Procedure ◽

Support Vector ◽

Data Mining Techniques

Student performance prediction is extremely important in today’s educational system. Predicting student achievement in advance can assist students and teachers in keeping track of the student’s progress. Today, several institutes have implemented a manual ongoing evaluation method. Students benefit from such methods since they help them improve their performance. In this study, we can use educational data mining (EDM), which we recommend as an ensemble classifier to anticipate the understudy accomplishment forecast model based on data mining techniques as classification techniques. This model uses distinct datasets which represent the student’s intercommunication with the instructive model. The exhibition of an understudy’s prescient model is evaluated by a kind of classifiers, for instance, logistic regression, naïve Bayes tree, artificial neural network, support vector system, decision tree, random forest, and k -nearest neighbor. Additionally, we used set processes to evolve the presentation of these classifiers. We utilized Boosting, Random Forest, Bagging, and Voting Algorithms, which are the normal group of techniques used in studies. By using ensemble methods, we will have a good result that demonstrates the dependability of the proposed model. For better productivity, the various classifiers are gathered and, afterward, added to the ensemble method using the Vote procedure. The implementation results demonstrate that the bagging method accomplished a cleared enhancement with the DT model, where the DT algorithm accuracy with bagging increased from 90.4% to 91.4%. Recall results improved from 0.904 to 0.914. Precision results also increased from 0.905 to 0.915.

Download Full-text

Predicting Student Failure in University Examination using Machine Learning Algorithms

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.e2643.039520 ◽

2020 ◽

Vol 9 (5) ◽

pp. 956-959

Keyword(s):

Machine Learning ◽

Data Mining ◽

Performance Management ◽

Student Performance ◽

Learning Algorithms ◽

Educational Data Mining ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Social Characteristics ◽

Student Failure

Student Performance Management is one of the key pillars of the higher education institutions since it directly impacts the student’s career prospects and college rankings. This paper follows the path of learning analytics and educational data mining by applying machine learning techniques in student data for identifying students who are at the more likely to fail in the university examinations and thus providing needed interventions for improved student performance. The Paper uses data mining approach with 10 fold cross validation to classify students based on predictors which are demographic and social characteristics of the students. This paper compares five popular machine learning algorithms Rep Tree, Jrip, Random Forest, Random Tree, Naive Bayes algorithms based on overall classifier accuracy as well as other class specific indicators i.e. precision, recall, f-measure. Results proved that Rep tree algorithm outperformed other machine learning algorithms in classifying students who are at more likely to fail in the examinations.

Download Full-text

Educational Data Mining: A review of evaluation process in the e-learning

Telematics and Informatics ◽

10.1016/j.tele.2018.04.015 ◽

2018 ◽

Vol 35 (6) ◽

pp. 1701-1717 ◽

Cited By ~ 34

Author(s):

Marcos Wander Rodrigues ◽

Seiji Isotani ◽

Luiz Enrique Zárate

Keyword(s):

Data Mining ◽

Educational Data Mining ◽

Evaluation Process ◽

E Learning

Download Full-text

Student Performance Predictions Using Knowledge Discovery Database and Data Mining, DPU Students Records as Sample

Academic Journal of Nawroz University ◽

10.25007/ajnu.v10n3a875 ◽

2021 ◽

Vol 10 (3) ◽

pp. 121-127

Author(s):

Bareen Haval ◽

Karwan Jameel Abdulrahman ◽

Araz Rajab

Keyword(s):

Data Mining ◽

Decision Tree ◽

Student Performance ◽

Educational Data Mining ◽

Data Sets ◽

Decision Tree Classifier ◽

Data Mining Techniques ◽

Academic History ◽

Tree Classifier ◽

Using Data

This article presents the results of connecting an educational data mining techniques to the academic performance of students. Three classification models (Decision Tree, Random Forest and Deep Learning) have been developed to analyze data sets and predict the performance of students. The projected submission of the three classificatory was calculated and matched. The academic history and data of the students from the Office of the Registrar were used to train the models. Our analysis aims to evaluate the results of students using various variables such as the student's grade. Data from (221) students with (9) different attributes were used. The results of this study are very important, provide a better understanding of student success assessments and stress the importance of data mining in education. The main purpose of this study is to show the student successful forecast using data mining techniques to improve academic programs. The results of this research indicate that the Decision Tree classifier overtakes two other classifiers by achieving a total prediction accuracy of 97%.

Download Full-text

Educational Data Mining for Student Learning Pattern Analysis using Clustering Algorithms

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f1528.089620 ◽

2020 ◽

Vol 9 (6) ◽

pp. 481-488

Keyword(s):

Data Mining ◽

Student Performance ◽

Clustering Algorithms ◽

Educational Data Mining ◽

Data Mining Algorithm ◽

Learning Behavior ◽

Study Program ◽

Data Mining Algorithms ◽

Students First ◽

And Performance

The exponential increase in universities’ electronic data creates the need to derive some useful information from these massive amounts of data. The progression in the data mining field causes it conceivable to educational data to improve the nature of educational processes. This study, thus, uses data mining methods to study the learning behavior and performance of university students. It focused on two aspects of the performance of the students. First, predicting students' learning behavior at the end of a complete year of the study program. Second, predict student performance with the help of the data model proposed by this study. Finally, provide course material recommendations using the data mining algorithm. Three data mining algorithms were considered which are K-Means, FCM, and KFCM., and maximum accuracy of 90.22% was achieved by KFCM. The study indicates that in terms of time and memory usages K-means algorithm give better results. This creates an opportunity for identifying students that may graduate with poor results or may not graduate at all, so early intercession might be possible.

Download Full-text

Educational Data Mining for Predicting Student Graduation Using the Naïve Bayes Classifier Algorithm

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v4i1.1502 ◽

2020 ◽

Vol 4 (1) ◽

pp. 95-101 ◽

Cited By ~ 2

Author(s):

Edi Sutoyo ◽

Ahmad Almaarif

Keyword(s):

Data Mining ◽

Naive Bayes ◽

Educational Data Mining ◽

Naïve Bayes ◽

Drop Out ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Student Graduation

The quality of students can be seen from the academic achievements, which are evidence of the efforts made by students. Student academic achievement is evaluated at the end of each semester to determine the learning outcomes that have been achieved. If a student cannot meet certain academic criteria that are stated by fulfilling the requirements to continue his studies, the student may have the potential to not graduate on time or even Drop Out (DO). The high number of students who do not graduate on time or DO in higher education institutions can be minimized by detecting students who are at risk in the early stages of education and is supported by making policies that can direct students to complete their education. Also, if the time for completion of student studies can be predicted then the handling of students will be more effective. One technique for making predictions that can be used is data mining techniques. Therefore, in this study, the Naive Bayes Classifier (NBC) algorithm will be used to predict student graduation at Telkom University. The dataset was obtained from the Information Systems Directorate (SISFO), Telkom University which contained 4000 instance data. The results of this study prove that NBC was successfully implemented to predict student graduation. Prediction of the graduation of these students is able to produce an accuracy of 73,725%, precision 0.742, recall 0.736 and F-measure of 0.735.

Download Full-text

A Review on Prediction of Academic Performance of Students at-Risk Using Data Mining Techniques

Journal on Today s Ideas-Tomorrow s Technologies ◽

10.15415/jotitt.2017.51002 ◽

2017 ◽

Vol 5 (1) ◽

pp. 30-39 ◽

Cited By ~ 1

Author(s):

Preet Kamal ◽

Sachin Ahuja

Keyword(s):

Data Mining ◽

Learning Process ◽

High Probability ◽

Research Work ◽

Educational Data Mining ◽

Dropout Rate ◽

Educational Background ◽

Drop Out ◽

Teaching Learning ◽

The Right

Educational data mining is the procedure of converting raw data collected from educational databases into some useful information. It can be helpful in designing and answering research questions like performance prediction of students in academics, factors that affect the students’ performance, help the teachers in understanding the problems faced by the students to understand the course content and complexity of the subject taken so that the teachers can take timely action to control the dropout rate. This also includes improving the teaching learning process so that the interventions can be taken at the right time to improve the performance of the student. This paper is the review of the research work done in the field of educational data mining for the prediction of students’ performance. The factors that influence the performance of the students i.e. the type of classrooms they attend such as traditional or on-line, socio-economic, educational background of the family, attitude toward studies and challenges faced by the students during course progress. These factors leads to the categorization of the students into three groups “Low-Risk”: who have High probability of succeeding, “Medium-Risk”: who may succeed in their examination, “High-Risk”: who have High probability of failing or drop-out. It elaborates the different ways to improve the teaching learning process by providing the students personal assistance, notes, class-assignments and special class tests. The most efficient techniques that are used in educational data mining are also reviewed such as; classification, regression, clustering and and prediction.

Download Full-text

Prediction and Analysis of Student Performance in Secondary Education Based on Data Mining and Machine Learning Techniques

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit20653 ◽

2020 ◽

pp. 294-301

Author(s):

Meenal Joshi ◽

Shiv Kumar

Keyword(s):

Machine Learning ◽

Data Mining ◽

Secondary School ◽

Student Performance ◽

Learning Algorithms ◽

Research Work ◽

Educational Data Mining ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Student’S Performance

According to modern era education is the key to achieve success in the future; it develops a human personality, thoughts, and social skills. The purpose of this research work is to focus on educational data mining (EDM) through machine learning algorithms. EDM means to discover hidden knowledge and pattern about student's performance. Machine learning can be useful to predict the learning outcomes of students. From last few years, several tools have been used to judge the student's performance from different points of view like the student's level, objectives, techniques, algorithms, and different methods. In this paper, predicting and analyzing student performance in secondary school is conducted using data mining techniques and machine learning algorithms such as Naive Bayes, Decision Tree algorithm J48, and Logistic Regression. For this the collection of dataset from "Secondary School" and then filtration is applying on desired values using WEKA, tool.

Download Full-text