Institutional Research Using Data Mining

Author(s):  
Constanta-Nicoleta Bodea ◽  
Vasile Bodea ◽  
Radu Mogos

The aim of this chapter is to explore the application of data mining for analyzing academic performance in connection with the participatory behavior of the students enrolled in an online two-year Master degree program in project management. The main data sources were the operational database with the students’ records and the log files and statistics provided by the e-learning platform. One hundred eighty-one enrolled students, and more than 150 distinct characteristics/ variables per student were used. Due to the large number of variables, an exploratory data analysis through data mining was chosen, and a model-based discovery approach was designed and executed in Weka environment. The association rules, clustering, and classification were applied in order to identify the factors explaining the students’ performance and the relationship between academic performance and behavior in the virtual learning environment. Data mining has revealed interesting patterns in data. These patterns indicate that academic performance is related to the intensity of the student activities in virtual environment. If the student understands how to work and she/he is motivated to communicate with others, then he might have a good academic performance. Based on clustering analysis, different student profiles were discovered, explaining the academic performance. The results are very encouraging and suggest several future developments.

Author(s):  
Constanta-Nicoleta Bodea ◽  
Vasile Bodea ◽  
Ion Gh. Rosca ◽  
Radu Mogos ◽  
Maria-Iuliana Dascalu

The aim of this chapter is to explore the application of data mining for analyzing participatory behavior of the students enrolled in an online two-year Master degree programme in Project Management. The main data sources were the operational database with the students’ records and the log files and statistics provided by the e-learning platform. 129 enrolled students and more than 195 distinct characteristics/ variables per student were used. Due to the large number of variables, an exploratory data analysis through data mining is decided, and a model-based discovery approach was designed and executed in Weka environment. The association rules, clustering, and classification were applied in order to describe the participatory behavior of the students, as well as to identify the factors explaining the students’ behavior, and the relationship between academic performance and behavior in the virtual learning environment. The results are very encouraging and suggest several future developments.


Author(s):  
Eric Araka ◽  
Robert Oboko ◽  
Elizaphan Maina ◽  
Rhoda K. Gitonga

Self-regulated learning is attracting tremendous researches from various communities such as information communication technology. Recent studies have greatly contributed to the domain knowledge that the use self-regulatory skills enhance academic performance. Despite these developments in SRL, our understanding on the tools and instruments to measure SRL in online learning environments is limited as the use of traditional tools developed for face-to-face classroom settings are still used to measure SRL on e-learning systems. Modern learning management systems (LMS) allow storage of datasets on student activities. Subsequently, it is now possible to use Educational Data Mining to extract learner patterns which can be used to support SRL. This chapter discusses the current tools for measuring and promoting SRL on e-learning platforms and a conceptual model grounded on educational data mining for implementation as a solution to promoting SRL strategies.


2017 ◽  
Vol 9 (1) ◽  
pp. 38-49
Author(s):  
Fatma Önay Koçoğlu ◽  
İlkim Ecem Emre ◽  
Çiğdem Selçukcan Erol

The aim of this study is to analyze success in e-learning with data mining methods and find out potential patterns. In this context, 374.073 data of 2013-14 period taken from an institution serving in e-learning field in Turkey are used. Data set, which is collected from information technology, banking and pharmaceutical industries, includes success and industry of employees', trainings which they complete, whether the trainings are completed, first login and last logout dates, training completion date and duration of experience in training. Using this data set, success status of participants is observed by using data mining methods (C5.0, Random Forest and Gini). By observing using accuracy, error rate, specificity and f- score from performance evaluation criteria, C5.0 has chosen the algorithm which gives the best performance results. According to the results of the study, it has been determined that the sectors of the employees are not important, on the contrary the ones that are important are the completion status, the duration of experience and training.


Author(s):  
Jayanti Mehra ◽  
Ramjeevan Singh Thakur

Weblog analysis takes raw data from access logs and performs study on this data for extracting statistical information. This info incorporates a variety of data for the website activity such as average no. of hits, total no. of user visits, failed and successful cached hits, average time of view, average path length over a website; analytical information such as page was not found errors and server errors; server information, which includes exit and entry pages, single access pages, and top visited pages; requester information like which type of search engines is used, keywords and top referring sites, and so on. In general, the website administrator uses this kind of knowledge to make the system act better, helping in the manipulation process of site, then also forgiving marketing decisions support. Most of the advanced web mining systems practice this kind of information to take out more difficult or complex interpretations using data mining procedures like association rules, clustering, and classification.


Sign in / Sign up

Export Citation Format

Share Document