Predictive Models as Early Warning Systems: A Bayesian Classification Model to Identify At-Risk Students of Programming




Veerasamy Ashok Kumar, Laakso Mikko-Jussi, D'Souza Daryl, Salakoski Tapio

Kohei Arai

Computing Conference

2021

Lecture notes in networks and systems

Intelligent Computing: Proceedings of the 2021 Computing Conference, Volume 2

Lecture Notes in Networks and Systems

284

174

195

978-3-030-80125-0

978-3-030-80126-7

2367-3370

DOIhttps://doi.org/10.1007/978-3-030-80126-7_14

https://link.springer.com/chapter/10.1007%2F978-3-030-80126-7_14

https://research.utu.fi/converis/portal/detail/Publication/62108806



The pursuit of a deeper understanding of factors that influence student performance outcomes has long been of interest to the computing education community. Among these include the development of effective predictive models to predict student academic performance. Predictive models may serve as early
warning systems to identify students at risk of failing or quitting early. This paper presents a class of machine learning predictive models based on Naive Bayes classification, to predict student performance in introductory programming. The models use formative assessment tasks and self-reported cognitive features such as prior programming knowledge and problem-solving skills. Our analysis revealed that the use of just three variables was a good fit for the models employed. The models that used in-class assessment and cognitive features as predictors returned best at-risk prediction accuracies, compared with models that used take-home assessment and cognitive features as predictors. The prediction accuracy in identifying
at-risk students on unknown data for the coursewas 71% (overall prediction accuracy) in compliance with the area under the curve (ROC) score (0.66). Based on these results we present a generic predictive model and its potential application as an early warning system for early identification of students at risk.


Last updated on 2024-26-11 at 23:12