scholarly journals The Role of Machine Learning and Data Mining Techniques in Predicting Students’ Academic Performance

Author(s):  
Dr. Aliyu Y. Rufai ◽  
Dr.Hassan U. Suru ◽  
James Afrifa

The advancement in Information Technology makes it easier and cheaper to collect large amounts of data, but if this data is not further analyzed, it remains only huge amounts of data. These large amounts of data set have motivated research and development in various fields to extract meaningful information with a view of analyzing it to solve complex problem. With new methods and techniques, data can be analyze and be of great advantage. Data mining and machine learning are two computing disciplines that enable analysis of large data sets using different techniques. This paper gave an overview of several applications using these disciplines in education, with focus on student’s academic performance prediction. Early prediction of students’ performance is useful in taking early action of improving learning outcome. The perfect methods for this are machine learning and data mining. This paper also discusses special use of data mining in education, called educational data mining. Educational Data Mining (EDM) uses different methods and techniques from machine learning, statistics, data mining and data analysis, to analyze data collected during teaching and learning. The goal of this paper is to introduce the role of machine learning and data mining in predicting student’s academic performance and to present its applications and benefits

2021 ◽  
Vol 11 (1) ◽  
pp. 26-35
Author(s):  
Yulison Herry Chrisnanto ◽  
◽  
Gunawan Abdullah ◽  

Education is an important thing in a person's life, because by having adequate education, one's life will be better. Education can be obtained formally through formal institutions that constructively provide a person's abilities academically. This study aims to determine student performance in terms of academic and non-academic domains at a certain time during their education using techniques in data mining (DM) which are directed towards academic data analysis. Academic performance is delivered through the Educational Data Mining (EDM) integrated data mining model, in which the techniques used include classification (ID3, SVM), clustering (k-Means, k-Medoids), association rules (Apriori) and anomaly detection (DBSCAN). The data set used is academic data in the form of study results over a certain period of time. The results of EDM can be used for analysis related to academic performance which can be used for strategic decision making in aca-demic management at higher education institutions. The results of this study indicate that the use of several techniques in data mining together can maximize the ability to analyze academic performance with the same data source and produce different analysis patterns.


Author(s):  
Abdulazeez Yusuf ◽  
Ayuba John

The increasing need for data driven decision making recently has resulted in the application of data mining in various fields including the educational sector which is referred to as educational data mining. The need for improving the performance of data mining models has also been identified as a gap for future researcher. In Nigeria, higher educational institutions collect various students’ data, but these data are rarely used in any decision or policy making to improve the academic performance of students. This research work, attempts to improve the performance of data mining models for predicting students’ academic performance using stacking classifiers ensemble and synthetic minority over-sampling techniques. The research was conducted by adopting and evaluating the performance of J48, IBK and SMO classifiers. The individual classifiers models, standard stacking classifier ensemble model and stacking classifiers ensemble model were trained and tested on 206 students’ data set from the faculty of science federal university Dutse. Students’ specific previous academic performance records at Unified Tertiary Matriculation Examination, Senior Secondary Certificate Examination and first year Cumulative Grade Point Average of students are used as data inputs in WEKA 3.9.1 data mining tool to predict students’ graduation classes of degrees at undergraduate level. The result shows that application of synthetic minority over-sampling technique for class balancing improves all the various models performance with the proposed modified stacking classifiers ensemble model outperforming the various classifiers models in both performance accuracy and RSME values making it the best model.<strong></strong>


Author(s):  
Adeel Ahmed ◽  
Kamlesh Kumar ◽  
Mansoor A. Khuhro ◽  
Asif A. Wagan ◽  
Imtiaz A. Halepoto ◽  
...  

Nowadays, educational data mining is being employed as assessing tool for study and analysis of hidden patterns in academic databases which can be used to predict student’s academic performance. This paper implements various machine learning classification techniques on students’ academic records for results predication. For this purpose, data of MS(CS) students were collected from a public university of Pakistan through their assignments, quizzes, and sessional marks. The WEKA data mining tool has been used for performing all experiments namely, data pre-processing, classification, and visualization. For performance measure, classifier models were trained with 3- and 10-fold cross validation methods to evaluate classifiers' accuracy. The results show that bagging classifier combined with support vector machines outperform other classifiers in terms of accuracy, precision, recall, and F-measure score. The obtained outcomes confirm that our research provides significant contribution in prediction of students’ academic performance which can ultimately be used to assists faculty members to focus low grades students in improving their academic records.


2021 ◽  
Vol ahead-of-print (ahead-of-print) ◽  
Author(s):  
Adel Bessadok ◽  
Ehab Abouzinadah ◽  
Osama Rabie

Purpose This paper aims to investigate the relationship between the students’ digital activities and their academic performance through two stages. In the first stage, students’ digital activities were studied and clustered based on the attributes of their activity log of learning management system (LMS) data set. In the second stage, the significance of the relationship between these profiles and the associated academic performance was tested statistically. Design/methodology/approach The LMS delivers E-learning courses and keeps track of the students’ activities. Investigating these students’ digital activities became a real challenge. The diversity of students’ involvement in the learning process was proven through the LMS which characterize students’ specific profiles. The Educational Data Mining (EDM) approach was used to discover students’ learning profiles and associated academic performances, where the activity log file exemplified their activities hosted in the LMS. The sample study data is from an undergraduate e-course hosted on the platform of Blackboard LMS offered at a Saudi University during the first semester of the 2019–2020 academic year. The chosen undergraduate course had 25 sections, and the students attending came from science, technology, engineering and math background. Findings Results show three clusters based on the digital activities of the students. The correlation test shows the statistical significance and proves the effect of the student’s profile on his academic performance. The data analysis shows that students with different profiles can still get similar academic performance using LMS. Originality/value This empirical study emphasizes the importance of the EDM approach using clustering techniques which can help the instructor understand how students use the provided LMS content to learn and then can deliver them the best educational experience.


2019 ◽  
Vol 8 (4) ◽  
pp. 6566-6570

Data mining technologies allow collection, storage and processing huge amounts of data and carrying a large variety of data types and samples. Predicting academic performance of student is the most successive research in this era. Previous research work researchers are used different classification algorithm to predict the student performance. There is lot of research work to be taken in the field of educational data mining and big data in education to increase the accuracy of the classification algorithm and predict the academic performance of student. In this research work we used hybrid classification algorithm for predicting the performance of students. Two Popular classification algorithms ID3 and J48 were applied on the data set. To make hybrid classification voting technique is applied using weka machine learning tool. In this work we tested how the hybrid algorithm accurately predicts the student data set. To check the predicted result classification accuracy was computed. This hybrid classification algorithm gives accuracy with 62.67%.


Author(s):  
Alisa Bilal Zorić

We live in a world where we collect huge amounts of data, but if this data is not further analyzed, it remains only huge amounts of data. With new methods and techniques, we can use this data, analyze it and get a great advantage. The perfect method for this is data mining. Data mining is the process of extracting hidden and useful information and patterns from large data sets. Its application in various areas such as finance, telecommunications, healthcare, sales marketing, banking, etc. is already well known. In this paper, we want to introduce special use of data mining in education, called educational data mining. Educational Data Mining (EDM) is an interdisciplinary research area created as the application of data mining in the educational field. It uses different methods and techniques from machine learning, statistics, data mining and data analysis, to analyze data collected during teaching and learning. Educational Data Mining is the process of raw data transformation from large educational databases to useful and meaningful information which can be used for a better understanding of students and their learning conditions, improving teaching support as well as for decision making in educational systems.The goal of this paper is to introduce educational data mining and to present its application and benefits.


2021 ◽  
pp. 073563312110487
Author(s):  
Ruangsak Trakunphutthirak ◽  
Vincent C. S. Lee

Educators in higher education institutes often use statistical results obtained from their online Learning Management System (LMS) dataset, which has limitations, to evaluate student academic performance. This study differs from the current body of literature by including an additional dataset that advances the knowledge about factors affecting student academic performance. The key aims of this study are fourfold. First, is to fill the educational literature gap by applying machine learning techniques in educational data mining, making use of the Internet usage behaviour log files and LMS data. Second, LMS data and Internet usage log files were analysed with machine learning techniques for predicting at-risk-of-failure students, with greater explanation added by combining student demographic data. Third, the demographic features help to explain the prediction in understandable terms for educators. Fourth, the study used a range of Internet usage data, which were categorized according to type of usage data and type of web browsing data to increase prediction accuracy.


2014 ◽  
Vol 13 (9) ◽  
pp. 5020-5028
Author(s):  
Anurag Jindal ◽  
Er. Williamjeet Singh

Currently there is an increasing interest in data mining and educational systems, making educational data mining as a new growing research community. Higher education, throughout the world is delivered through universities, colleges affiliated to various universities and some other recognized academic institutes. The main objective of higher education institutes is to provide quality education to its students. Indian education sector has a lot of data that can produce valuable information which can be used to increase the quality of education. Good prediction of student’s success in higher learning institution is one way to reach the higher level of quality in higher education system. In this paper we analyzed the potential use of data mining in education section and survey the most relevant work in this area. Data Mining can be used for dropout students, student’s academic performance, teacher’s performance and student’s complaints. As we know large amount of data is stored in educational database, so in order to get required data and to find the hidden relationship, different data mining techniques are developed & used. Various algorithms and data mining techniques like Classification, Clustering, Regression, Artificial Intelligence, Neural Networks, Association Rules, Decision Trees (CART and CHIAD), Genetic algorithms, Nearest Neighbor method etc. are used for knowledge discovery from databases and helps in prediction of students academic performance. In future work we can apply different data mining techniques on an expanded data set with more distinct attributes to get more accurate results.


2020 ◽  
Author(s):  
Marc Philipp Bahlke ◽  
Natnael Mogos ◽  
Jonny Proppe ◽  
Carmen Herrmann

Heisenberg exchange spin coupling between metal centers is essential for describing and understanding the electronic structure of many molecular catalysts, metalloenzymes, and molecular magnets for potential application in information technology. We explore the machine-learnability of exchange spin coupling, which has not been studied yet. We employ Gaussian process regression since it can potentially deal with small training sets (as likely associated with the rather complex molecular structures required for exploring spin coupling) and since it provides uncertainty estimates (“error bars”) along with predicted values. We compare a range of descriptors and kernels for 257 small dicopper complexes and find that a simple descriptor based on chemical intuition, consisting only of copper-bridge angles and copper-copper distances, clearly outperforms several more sophisticated descriptors when it comes to extrapolating towards larger experimentally relevant complexes. Exchange spin coupling is similarly easy to learn as the polarizability, while learning dipole moments is much harder. The strength of the sophisticated descriptors lies in their ability to linearize structure-property relationships, to the point that a simple linear ridge regression performs just as well as the kernel-based machine-learning model for our small dicopper data set. The superior extrapolation performance of the simple descriptor is unique to exchange spin coupling, reinforcing the crucial role of choosing a suitable descriptor, and highlighting the interesting question of the role of chemical intuition vs. systematic or automated selection of features for machine learning in chemistry and material science.


Sign in / Sign up

Export Citation Format

Share Document