Comparison of Different Classification Techniques for Educational Data

Author(s):  
Kavita Pabreja

Data mining has been used extensively in various domains of application for prediction or classification. Data mining improves the productivity of its analysts tremendously by transforming their voluminous, unmanageable and prone to ignorable information into usable pieces of knowledge and has witnessed a great acceptance in scientific, bioinformatics and business domains. However, for education field there is still a lot to be done, especially there is plentiful research to be done as far as Indian Universities are concerned. Educational Data Mining is a promising discipline, concerned with developing techniques for exploring the unique types of educational data and using those techniques to better understand students' strengths and weaknesses. In this paper, the educational database of students undergoing higher education has been mined and various classification techniques have been compared so as to investigate the students' placement in software organizations, using real data from the students of a Delhi state university's affiliates.

Author(s):  
Chaka Chaka

This overview study set out to compare and synthesise the findings of review studies conducted on predicting student academic performance (SAP) in higher education using educational data mining (EDM) methods, EDM algorithms and EDM tools from 2013 to June 2020. It conducted multiple searches for suitable and relevant peer-reviewed articles on two online search engines, on nine online databases, and on two online academic social networks. It, then, selected 26 eligible articles from 2,050 articles. Some of the findings of this overview study are worth mentioning. First, only 2 studies explicitly stated their precise sample sizes with maths and science as the two most mentioned subject areas. Second, 16 review studies had purposes related to either EDM techniques, EDM methods, EDM models, or EDM algorithms employed to predict SAP and student success in the higher education sector. Third, there are six commonly used typologies of input variables reported by 26 review studies, of which student demographics was the most commonly utilised variable for predicting SAP. Fourth and last, seven common EDM algorithms employed for predicting SAP were identified, of which Decision Tree emerged both as the most used algorithm and as the algorithm with the highest prediction accuracy rate for predicting SAP.


Author(s):  
Garima Jaiswal ◽  
Arun Sharma ◽  
Reeti Sarup

Machine learning aims to give computers the ability to automatically learn from data. It can enable computers to make intelligent decisions by recognizing complex patterns from data. Through data mining, humongous amounts of data can be explored and analyzed to extract useful information and find interesting patterns. Classification, a supervised learning technique, can be beneficial in predicting class labels for test data by referring the already labeled classes from available training data set. In this chapter, educational data mining techniques are applied over a student dataset to analyze the multifarious factors causing alarmingly high number of dropouts. This work focuses on predicting students at risk of dropping out using five classification algorithms, namely, K-NN, naive Bayes, decision tree, random forest, and support vector machine. This can assist in improving pedagogical practices in order to enhance the performance of students predicted at risk of dropping out, thus reducing the dropout rates in higher education.


2018 ◽  
Vol 27 (3S) ◽  
pp. 513-525 ◽  
Author(s):  
Bruno Elias Penteado ◽  
Paula Maria Pereira Paiva ◽  
Marina Morettin-Zupelari ◽  
Seiji Isotani ◽  
Deborah Viviane Ferrari

Purpose This article introduces concepts and a general taxonomy used by the educational data mining (EDM) community, as well as examples of their applications, with the aims of providing audiology educators with a referential basis for developing this area. Method A narrative review was carried out to present an overview of EDM and its main methods. Some of these methods were exemplified with analysis of real data from an Internet-based specialization course on pediatric auditory rehabilitation. Results The review introduced EDM main concepts and applications and described methods from its area. Real data examples illustrated EDM use to predict interpersonal help-seeking, model interpersonal interaction, analyze students' trajectories within a course's module, and understand how students approached group assignments. Some of the insights provided by EDM to support teaching and learning processes were also described. Conclusions EDM methods offer new tools to discover knowledge from digital traces (i.e., logs) and support key stakeholders (students, instructors, or course administrators) to raise awareness about course dynamics. This approach has the potential to foster a better understanding and management of educational processes in audiology distance education.


2017 ◽  
Vol 7 (1.2) ◽  
pp. 43 ◽  
Author(s):  
K. Sreenivasa Rao ◽  
N. Swapna ◽  
P. Praveen Kumar

Data Mining is the process of extracting useful information from large sets of data. Data mining enablesthe users to have insights into the data and make useful decisions out of the knowledge mined from databases. The purpose of higher education organizations is to offer superior opportunities to its students. As with data mining, now-a-days Education Data Mining (EDM) also is considered as a powerful tool in the field of education. It portrays an effective method for mining the student’s performance based on various parameters to predict and analyze whether a student (he/she) will be recruited or not in the campus placement. Predictions are made using the machine learning algorithms J48, Naïve Bayes, Random Forest, and Random Tree in weka tool and Multiple Linear Regression, binomial logistic regression, Recursive Partitioning and Regression Tree (rpart), conditional inference tree (ctree) and Neural Network (nnet) algorithms in R studio. The results obtained from each approaches are then compared with respect to their performance and accuracy levels by graphical analysis. Based on the result, higher education organizations can offer superior training to its students.


2019 ◽  
Vol 9 (3) ◽  
pp. 4287-4291 ◽  
Author(s):  
M. Alsuwaiket ◽  
A. H. Blasi ◽  
R. A. Al-Msie'deen

The choice of an effective student assessment method is an issue of interest in Higher Education. Various studies [1] have shown that students tend to get higher marks when assessed through coursework-based assessment methods which include either modules that are fully assessed through coursework or a mixture of coursework and examinations than assessed by examination alone. There are a large number of educational data mining (EDM) studies that pre-process data through conventional data mining processes including data preparation process, but they are using transcript data as they stand without looking at examination and coursework results weighting which could affect prediction accuracy. This paper proposes a different data preparation process through investigating more than 230,000 student records in order to prepare students’ marks based on the assessment methods of enrolled modules. The data have been processed through different stages in order to extract a categorical factor through which students’ module marks are refined during the data preparation process. The results of this work show that students’ final marks should not be isolated from the nature of the enrolled module’s assessment methods. They must rather be investigated thoroughly and considered during EDM’s data pre-processing phases. More generally, it is concluded that educational data should not be prepared in the same way as other data types due to differences as data sources, applications, and types of errors in them. Therefore, an attribute, coursework assessment ratio (CAR), is proposed to be used in order to take the different modules’ assessment methods into account while preparing student transcript data. The effect of CAR on prediction process using the random forest classification technique has been investigated. It is shown that considering CAR as an attribute increases the accuracy of predicting students’ second-year averages based on their first-year results.


2019 ◽  
Vol 2 (2) ◽  
pp. 77-84
Author(s):  
Frank Rogers

Educational data mining is the process of converting raw data from educational systems to useful information that can be used by educational software developers, students, teachers, parents, and other educational researchers. Fuzzy educational datasets are datasets consisting of uncertain values. The purpose of this study is to develop and test a classification model under uncertainty unique to the modern student. This is done by developing a model of the uncertain data that come from an educational setting with Linear Fuzzy Real data. Machine learning was then used to understand students and their optimal learning environment. The ability to predict student performance is important in a web or online environment. This is true in the brick and mortar classroom as well and is especially important in rural areas where academic achievement is lower than ideal.


2021 ◽  
Vol 11 (1) ◽  
pp. 26-35
Author(s):  
Yulison Herry Chrisnanto ◽  
◽  
Gunawan Abdullah ◽  

Education is an important thing in a person's life, because by having adequate education, one's life will be better. Education can be obtained formally through formal institutions that constructively provide a person's abilities academically. This study aims to determine student performance in terms of academic and non-academic domains at a certain time during their education using techniques in data mining (DM) which are directed towards academic data analysis. Academic performance is delivered through the Educational Data Mining (EDM) integrated data mining model, in which the techniques used include classification (ID3, SVM), clustering (k-Means, k-Medoids), association rules (Apriori) and anomaly detection (DBSCAN). The data set used is academic data in the form of study results over a certain period of time. The results of EDM can be used for analysis related to academic performance which can be used for strategic decision making in aca-demic management at higher education institutions. The results of this study indicate that the use of several techniques in data mining together can maximize the ability to analyze academic performance with the same data source and produce different analysis patterns.


Sign in / Sign up

Export Citation Format

Share Document