TY - JOUR
T1 - A comparative analysis of machine learning techniques for student retention management
AU - Delen, Dursun
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2010/5/1
Y1 - 2010/5/1
N2 - Student retention is an essential part of many enrollment management systems. It affects university rankings, school reputation, and financial wellbeing. Student retention has become one of the most important priorities for decision makers in higher education institutions. Improving student retention starts with a thorough understanding of the reasons behind the attrition. Such an understanding is the basis for accurately predicting at-risk students and appropriately intervening to retain them. In this study, using five years of institutional data along with several data mining techniques (both individuals as well as ensembles), we developed analytical models to predict and to explain the reasons behind freshmen student attrition. The comparative analyses results showed that the ensembles performed better than individual models, while the balanced dataset produced better prediction results than the unbalanced dataset. The sensitivity analysis of the models revealed that the educational and financial variables are among the most important predictors of the phenomenon.
AB - Student retention is an essential part of many enrollment management systems. It affects university rankings, school reputation, and financial wellbeing. Student retention has become one of the most important priorities for decision makers in higher education institutions. Improving student retention starts with a thorough understanding of the reasons behind the attrition. Such an understanding is the basis for accurately predicting at-risk students and appropriately intervening to retain them. In this study, using five years of institutional data along with several data mining techniques (both individuals as well as ensembles), we developed analytical models to predict and to explain the reasons behind freshmen student attrition. The comparative analyses results showed that the ensembles performed better than individual models, while the balanced dataset produced better prediction results than the unbalanced dataset. The sensitivity analysis of the models revealed that the educational and financial variables are among the most important predictors of the phenomenon.
KW - Classification
KW - Machine learning
KW - Prediction
KW - Retention management
KW - Sensitivity analysis
KW - Student attrition
UR - http://www.scopus.com/inward/record.url?scp=78049421754&partnerID=8YFLogxK
U2 - 10.1016/j.dss.2010.06.003
DO - 10.1016/j.dss.2010.06.003
M3 - Article
AN - SCOPUS:78049421754
SN - 0167-9236
VL - 49
SP - 498
EP - 506
JO - Decision Support Systems
JF - Decision Support Systems
IS - 4
ER -