Early Identification of At-Risk Students Using Iterative Logistic Regression

Author(s):  
Li Zhang ◽  
Huzefa Rangwala
Author(s):  
Dennis Foung

Use of algorithms and data mining approaches are not new to Industry 4.0. However, these may not be common for students and educators in higher education. This chapter compares various classification techniques: classification tree, logistic regression, and artificial neural networks (ANN). The comparison focuses on each method's accuracy, algorithm, and practicality in higher education. This study made use of a dataset from two academic writing courses in a university in Hong Kong with more than 5,000 records. Results suggest that classification trees and logistic regression can be easily used in the higher education context, but ANN may not be applicable in higher educational settings. The research team suggests that higher education administrators take this research forward and design platforms to realize these classification algorithms to predict at-risk students.


2017 ◽  
Vol 1 (1) ◽  
pp. 8-18
Author(s):  
Adam Christian Haupt ◽  
Jonathan Alt ◽  
Samuel Buttrey

Purpose This paper aims to use a data-driven approach to identify the factors and metrics that provide the best indicators of academic attrition in the Korean language program at the Defense Language Institute Foreign Language Center. Design methodology approach This research develops logistic regression models to aid in the identification of at-risk students in the Defense Language Institute’s Korean language school. Findings The results from this research demonstrates that this methodology can detect significant factors and metrics that identify students at-risk. Additionally, this research shows that school policy changes can be detected using logistic regression models and stepwise regression. Originality value This research represents a real-world application of logistic regression modeling methods applied to the problem of identifying at-risk students for the purpose of academic intervention or other negative outcomes. By using logistic regression, the authors are able to gain a greater understanding of the problem and identify statistically significant predictors of student attrition that they believe can be converted into meaningful policy change.


Author(s):  
Joseph G. Glynn ◽  
Paul L. Sauer ◽  
Thomas E. Miller

A logistic regression model will be developed to provide early identification of freshmen at risk of attrition. The early identification is accomplished literally within a couple of weeks after freshman orientation. The dependent variable of interest is persistence, and it is a binary, nominal variable. Students who proceed from freshman matriculation to graduation without ever having dropped out are labeled persistors. Freshman matriculates who leave college either temporarily or permanently are classified as dropouts. The independent variables employed to predict attrition include demographics, high school experiences, and attitudes, opinions, and values as reported on a survey administered during freshman orientation. The model and its results will be presented along with a brief description of the institutional intervention program designed to enhance student persistence.


Author(s):  
Cheng-Huan Chen ◽  
Stephen J. H. Yang ◽  
Jian-Xuan Weng ◽  
Hiroaki Ogata ◽  
Chien-Yuan Su

Providing early predictions of academic performance is necessary for identifying at-risk students and subsequently providing them with timely intervention for critical factors affecting their academic performance. Although e-book systems are often used to provide students with teaching/learning materials in university courses, seldom has research made the early prediction based on their online reading behaviours by implementing machine learning classifiers. This study explored to what extent university students’ academic achievement can be predicted, based on their reading behaviours in an e-book supported course, using the classifiers. It further investigated which of the features extracted from the reading logs influence the predictions. The participants were 100 first-year undergraduates enrolled in a compulsory course at a university in Taiwan. The results suggest that logistic regression supports vector classification, decision trees, and random forests, and neural networks achieved moderate prediction performance with accuracy, precision, and recall metrics. The Bayes classifier identified almost all at-risk students. Additionally, student online reading behaviours affecting the prediction models included: turning pages, going back to previous pages and jumping to other pages, adding/deleting markers, and editing/removing memos. These behaviours were significantly positively correlated to academic achievement and should be encouraged during courses supported by e-books. Implications for practice or policy: For identifying at-risk students, educators could prioritise using Gaussian naïve Bayes in an e-book supported course, as it shows almost perfect recall performance. Assessors could give priority to logistic regression and neural networks in this context because they have stable achievement prediction performance with different evaluation metrics. The prediction models are strongly affected by student online reading behaviours, in particular by locating/returning to relevant pages and modifying markers.


Sign in / Sign up

Export Citation Format

Share Document