Analyzing feature importance for a predictive undergraduate student dropout model

Alberto Jiménez-Macias1, Pedro Manuel Moreno-Marcos1, Pedro J. Muñoz-Merino1, Margarita Ortiz-Rojas2, 3 and Carlos Delgado Kloos1

  1. Universidad Carlos III de Madrid, Avda de la Universidad, 30
    E-28911, Leganes, Spain
    {albjimen,pemoreno,pedmume,cdk}@it.uc3m.es
  2. Escuela Superior Politécnica del Litoral, ESPOL, ,Campus Gustavo Galindo Km. 30.5 Vía Perimetral
    EC-090112 , Guayaquil, Ecuador
    maelorti@espol.edu.ec
  3. Ghent University, Henri Dunantlaan 2
    B-9000, Ghent, Belgium
    margaritaelizabeth.ortizrojas@ugent.be

Abstract

Worldwide, one of the main concerns of universities is to reduce the dropout rate. Several initiatives have been taken to avoid this problem; however, it is essential to recognize at-risk students as early as possible. This article is an extension of a previous study that proposed a predictive model to identify students at risk of dropout from the beginning of their university degree. The new contribution is the analysis of the feature importance for dropout segmented by faculty, degree program, and semester in the different predictive models. In addition, we propose a dropout model based on faculty characteristics to try to infer the dropout based on faculty features. We used data of 30,576 students enrolled in a Higher Education Institution ranging from years 2000 to 2020. The findings indicate that the variables related to Grade Point Average(GPA), socioeconomic factor, and a pass rate of courses taken have a more significant impact on the model, regardless of the semester, faculty, or program. Additionally, we found a significant difference in the predictive power between Science, Technology, Engineering, and Mathematics (STEM) and humanistic programs.

Key words

dropout model, features importance, data mining, learning analytics

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS211110050J

Publication information

Volume 20, Issue 1 (January 2023)
Year of Publication: 2023
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Jiménez-Macias, A., Moreno-Marcos, P. M., Muñoz-Merino, P. J., Ortiz-Rojas, M., Kloos, C. D.: Analyzing feature importance for a predictive undergraduate student dropout model. Computer Science and Information Systems, Vol. 20, No. 1, 175–194. (2023), https://doi.org/10.2298/CSIS211110050J