Using LMS Log Data to Identify At-Risk Students: A Systematic Review of Machine Learning Approaches and Bibliographic Analysis

Authors

  • B. Karim-Abdallah University of Energy and Natural Resources, Sunyani, Ghana
  • B. A. Weyori University of Energy and Natural Resources Sunyani, Ghana
  • P. K. Mensah University of Energy and Natural Resources Sunyani, Ghana

DOI:

https://doi.org/10.26437/ajar.v11i2.1038

Keywords:

At-Risk Students. drop-out. e-learning. learning management system. log data.

Abstract

Purpose: This study evaluates the effectiveness of machine learning algorithms in predicting student dropout using Learning Management System (LMS) log-in data.

Design/ Methodology/ Approach: The study used a systematic literature review and bibliographic analysis. The search encompassed papers from the Scopus database up to October 2024. Initially, 100 articles were identified. After applying exclusion criteria, including removing editorials, letters, comments, and conference papers, 61 studies were chosen for the final review. Performance criteria such as accuracy, precision, recall, and f1-score were employed to assess these studies.

Research Limitation: Several limitations were acknowledged, including potential publication bias due to the inclusion of only peer-reviewed articles, variability in educational contexts and LMS platforms, and heterogeneity in machine learning methods and performance metrics.

Findings: Random Forest emerged as the most commonly used machine learning algorithm for identifying at-risk students, followed by Convolutional Neural Networks.  In the analysed research, Random Forest outperformed all other algorithms, achieving a 99% accuracy rate in predicting at-risk students. Students' assessment scores emerged as the most significant feature in the model performances, followed by students' participation in a session.                             

Practical Implication: It is noted that most researchers do not report on significant features/variables or the contribution of features to the model’s performances.

Social Implication: These significant features or variables are essential for institutions employing a blended learning approach, as they provide insights into where to allocate limited resources most effectively.                                                                                        

Originality/Value: This study contributed to the pool of knowledge on how Machine learning techniques have been employed with Learning Management System log-in data to predict student dropout.

Author Biographies

  • B. Karim-Abdallah, University of Energy and Natural Resources, Sunyani, Ghana

    Mr. Bright Karim-Abdallah is a Research Fellow and Acting Head of the Quality Assurance Department at the Quality Assurance and Academic Planning Directorate, University of Energy and Natural Resources, Ghana.

  • B. A. Weyori, University of Energy and Natural Resources Sunyani, Ghana

    Prof. Benjamin Asubam Weyori  is an Associate Professor and the Head of the Computer and Electrical Engineering Department at the University of Energy and Natural Resources, Ghana.  

  • P. K. Mensah, University of Energy and Natural Resources Sunyani, Ghana

    Prof. Patrick Kwabena Mensah is an Associate Professor and the Head of the Computer Science and Informatics Department at the University of Energy and Natural Resources, Ghana.

References

Adejo, O. W., & Connolly, T. (2018). Predicting student academic performance using multi-model heterogeneous ensemble approach. Journal of Applied Research in Higher Education, 10(1), 61–75. https://doi.org/10.1108/JARHE-09-2017-0113

Ahajjam, T., Moutaib, M., Aissa, H., Azrour, M., Farhaoui, Y., & Fattah, M. (2022). Predicting Students’ Final Performance Using Artificial Neural Networks. Big Data Mining and Analytics, 5(4), 294–301. https://doi.org/10.26599/BDMA.2021.9020030

Ahmed, E. (2024). Student Performance Prediction Using Machine Learning Algorithms. Applied Computational Intelligence and Soft Computing, 2024. https://doi.org/10.1155/2024/4067721

Alsulami, A. A., AL-Ghamdi, A. S. A. M., & Ragab, M. (2023). Enhancement of E-Learning Student’s Performance Based on Ensemble Techniques. Electronics (Switzerland), 12(6). https://doi.org/10.3390/electronics12061508

Arizmendi, C. J., Bernacki, M. L., Raković, M., Plumley, R. D., Urban, C. J., Panter, A. T., Greene, J. A., & Gates, K. M. (2023). Predicting student outcomes using digital logs of learning behaviors: Review, current standards, and suggestions for future work. Behavior Research Methods, 55(6), 3026–3054. https://doi.org/10.3758/s13428-022-01939-9

Baker, R. S., Lindrum, D., Lindrum, M. J., & Perkowski, D. (n.d.). Analyzing Early At-Risk Factors in Higher Education e-Learning Courses.

Bañeres, D., Rodríguez-González, M. E., Guerrero-Roldán, A. E., & Cortadas, P. (2023). An early warning system to identify and intervene online dropout learners. International Journal of Educational Technology in Higher Education, 20(1). https://doi.org/10.1186/s41239-022-00371-5

Bervell, B., & Umar, I. N. (2017). A decade of LMS acceptance and adoption research in Sub-Sahara African higher education: A systematic review of models, methodologies, milestones and main challenges. Eurasia Journal of Mathematics, Science and Technology Education, 13(11), 7269–7286. https://doi.org/10.12973/ejmste/79444

Chung, J. Y., & Lee, S. (2019). Dropout early warning systems for high school students using machine learning. Children and Youth Services Review, 96, 346–353. https://doi.org/10.1016/j.childyouth.2018.11.030

Dasi, H., & Kanakala, S. (n.d.). International Journal of INTELLIGENT SYSTEMS AND APPLICATIONS IN ENGINEERING Student Dropout Prediction Using Machine Learning Techniques. In Original Research Paper International Journal of Intelligent Systems and Applications in Engineering IJISAE (Vol. 2022, Issue 4). www.ijisae.org

Figueroa-Canas, J., & Sancho-Vinuesa, T. (2020). Early prediction of dropout and final exam performance in an online statistics course. Revista Iberoamericana de Tecnologias Del Aprendizaje, 15(2), 86–94. https://doi.org/10.1109/RITA.2020.2987727

Fu, Q., Gao, Z., Zhou, J., & Zheng, Y. (2021). CLSA: A novel deep learning model for MOOC dropout prediction. Computers and Electrical Engineering, 94. https://doi.org/10.1016/j.compeleceng.2021.107315

Kamal, P., & Ahuja, S. (2019). An ensemble-based model for prediction of academic performance of students in undergrad professional course. Journal of Engineering, Design and Technology, 17(4), 769–781. https://doi.org/10.1108/JEDT-11-2018-0204

Karim-Abdallah, B., Ayitey Junior, M., Appiahene, P., Harris, E., & Binful, D. K. (2025a). Application of Machine Learning Algorithms in Predicting Academic Performance of Students in Higher Education Institutes (HEIs): A Systematic Review and Bibliographic Analysis. AFRICAN JOURNAL OF APPLIED RESEARCH, 11(1), 536–559. https://doi.org/10.26437/ajar.v11i1.869

Karim-Abdallah, B., & Harris, E. (2022). Modelling Customer Switching for Banks in Ghana. Journal of Energy and Natural Resource Management (JENRM), 8(1), 17–26. https://doi.org/10.26796/jenrm.v8i1.188

Karim-Abdallah, B., Okai Darko, G., Gyabaah, O., Oteng Fening, L., Chimsi, I., Derkyi, M. A. A., & Yeboah-Kyereh, A. (2025). Innovations, Technologies and Challenges Associated with Transnational Education. AFRICAN JOURNAL OF APPLIED RESEARCH, 11(1), 560–587. https://doi.org/10.26437/ajar.v11i1.870

Mertova, Patricie., & Nair, C. Sid. (2011). Student feedback : the cornerstone to an effective quality assurance system in higher education. Chandos Publishing.

Palmer, S. (n.d.). Modelling Engineering Student Academic Performance Using Academic Analytics*.

Porras, J. M., Lara, J. A., Romero, C., & Ventura, S. (2023). A Case-Study Comparison of Machine Learning Approaches for Predicting Student’s Dropout from Multiple Online Educational Entities. Algorithms, 16(12). https://doi.org/10.3390/a16120554

Prahani, B. K., Alfin, J., Fuad, A. Z., Saphira, H. V., Hariyono, E., & Suprapto, N. (2022). Learning Management System (LMS) Research During 1991–2021: How Technology Affects Education. International Journal of Emerging Technologies in Learning, 17(17), 28–49. https://doi.org/10.3991/ijet.v17i17.30763

Saluja, R., Rai, M., & Saluja, R. (2023). Designing new student performance prediction model using ensemble machine learning. Journal of Autonomous Intelligence, 6(1), 1–12. https://doi.org/10.32629/jai.v6i1.583

Tan, M., & Shao, P. (2015). Prediction of student dropout in E-learning program through the use of machine learning method. International Journal of Emerging Technologies in Learning, 10(1), 11–17. https://doi.org/10.3991/ijet.v10i1.4189

Wang, Q., & Mousavi, A. (2023). Which log variables significantly predict academic achievement? A systematic review and meta-analysis. In British Journal of Educational Technology (Vol. 54, Issue 1, pp. 142–191). John Wiley and Sons Inc. https://doi.org/10.1111/bjet.13282

Yousafzai, B. K., Afzal, S., Rahman, T., Khan, I., Ullah, I., Rehman, A. U., Baz, M., Hamam, H., & Cheikhrouhou, O. (2021). Student-performulator: Student academic performance using hybrid deep neural network. Sustainability (Switzerland), 13(17). https://doi.org/10.3390/su13179775

Zhen, Y., Luo, J. Der, & Chen, H. (2023). Prediction of Academic Performance of Students in Online Live Classroom Interactions - An Analysis Using Natural Language Processing and Deep Learning Methods. Journal of Social Computing, 4(1), 12–29. https://doi.org/10.23919/JSC.2023.0007

Zhou, Y., & Xu, Z. (2020). Multi-model stacking ensemble learning for dropout prediction in MOOCs. Journal of Physics: Conference Series, 1607(1). https://doi.org/10.1088/1742-6596/1607/1/012004

Downloads

Published

30-04-2025

How to Cite

Using LMS Log Data to Identify At-Risk Students: A Systematic Review of Machine Learning Approaches and Bibliographic Analysis. (2025). AFRICAN JOURNAL OF APPLIED RESEARCH, 11(2), 278-312. https://doi.org/10.26437/ajar.v11i2.1038

Most read articles by the same author(s)