Predictive model for students’ academic performance using machine learning approach
Abstract
The early prediction of students' academic performance using machine learning has emerged as a valuable approach for identifying at-risk learners and enabling timely intervention. Many factors, such as students' academic background, prior performance, institutional policies, and learning environment, influence educational outcomes; their complex interplay remains inadequately understood in many contexts. This study aimed to explore the effectiveness of machine learning algorithms in predicting students' academic performance at Adamawa State University, Mubi, Adamawa State, Nigeria. This study used 1,730 datasets from the academic records of first-year students from the Faculty of Science for the 2024/2025 academic session. The study split the datasets into 80% training and 20% for testing. Data were analysed using the Waikato Environment for Knowledge Analysis (Weka) and Python. The model was evaluated using students' cumulative grade point averages (CGPAs) from the academic session results. The machine learning algorithms used were Logistic Regression (LR), Random Forest (RF), Decision Trees (DT), Naïve Bayesian (NB), and Support Vector Machines (SVM). Experimental results based on various performance metrics indicate that the SVM model achieved the best result with an accuracy of 0.92, precision of 0.92, recall of 0.93, and F1-score of 0.93. The results revealed that the SVM approach outperforms individual benchmark methods and provides robust insight into factors that determine academic success. The findings offer evidence-based guidance for educators, departments, faculties, institutional management, and policymakers to design targeted interventions to improve learning outcomes.
Copyright (c) 2026 Wadzani Aduwamai Gadzama, Ogah Stephen Ugbowu, Lucy Bulus Dalhatu

This work is licensed under a Creative Commons Attribution 4.0 International License.
References
[1]Daniel B. Big Data and analytics in higher education: Opportunities and challenges. British Journal of Educational Technology. 2015; 46(5): 904–920. doi: 10.1111/bjet.12230
[2]Namoun A, Alshanqiti A. Predicting Student Performance Using Data Mining and Learning Analytics Techniques: A Systematic Literature Review. Applied Sciences. 2020; 11(1): 237. doi: 10.3390/app11010237
[3]Ofori F, Maina E, Gitonga R. Using Machine Learning Algorithms to Predict Students’ Performance and Improve Learning Outcome: A Literature Based Review. Journal of Information, Technology and Data Science. 2020; 4(1): 33–55. Available online: https://stratfordjournalpublishers.org/journals/index.php/Journal-of-Information-and-Techn/article/view/480
[4]Belachew EB, Gobena FA. Student Performance Prediction Model using Machine Learning Approach: The Case of Wolkite University. International Journal of Advanced Research in Computer Science and Software Engineering. 2017; 7(2): 46–50.
[5]Alsariera YA, Baashar Y, Alkawsi G, et al. Assessment and Evaluation of Different Machine Learning Algorithms for Predicting Student Performance. Computational Intelligence and Neuroscience. 2022; 2022: 1–11. doi: 10.1155/2022/4151487
[6]Shahiri AM, Husain W, Rashid NA. A Review on Predicting Student’s Performance Using Data Mining Techniques. Procedia Computer Science. 2015; 72: 414–422. doi: 10.1016/j.procs.2015.12.157
[7]Lakkaraju H, Aguiar E, Shan C, et al. A Machine Learning Framework to Identify Students at Risk of Adverse Academic Outcomes. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 10 August 2015; Sydney, NSW, Australia. pp. 1909–1918. doi: 10.1145/2783258.2788620
[8]Ashraf A, Anwer S, Gufran KM. A Comparative study of predicting student’s performance by use of data mining techniques. American Scientific Research Journal for Engineering, Technology, and Sciences. 2018; 44(1): 122–136. Available online: https://www.asrjetsjournal.org/index.php/American_Scientific_Journal/article/view/4170
[9]Mueen A, Zafar B, Manzoor U. Modeling and Predicting Students’ Academic Performance Using Data Mining Techniques. International Journal of Modern Education and Computer Science. 2016; 8(11): 36–42. doi: 10.5815/ijmecs.2016.11.05
[10]Abubakar AB, Gabi D, Garba M, et al. Hybrid predictive model for students’ academic performance based on machine learning approach. FUDMA Journal of Sciences. 2025; 9(4): 215–222. doi: 10.33003/fjs-2025-0904-3004
[11]Bhutto ES, Siddiqui IF, Arain QA, et al. Predicting Students’ Academic Performance Through Supervised Machine Learning. In: Proceedings of the 2020 International Conference on Information Science and Communication Technology (ICISCT); 8–9 February 2020; Karachi, Pakistan. pp. 1–6. doi: 10.1109/ICISCT49550.2020.9080033
[12]Osemwegie EE, Amadin FI, Uduehi OM. Student dropout prediction using machine learning. FUDMA Journal of Sciences. 2023; 7(6): 347–353. doi: 10.33003/fjs-2023-0706-2103
[13]Alamri LH, S. Almuslim R, S. Alotibi M, et al. Predicting Student Academic Performance using Support Vector Machine and Random Forest. In: Proceedings of the 2020 3rd International Conference on Education Technology Management; 17–19 December 2020; London, UK. pp. 100–107. doi: 10.1145/3446590.3446607
[14]Hussain S, Khan MQ. Student-Performulator: Predicting Students’ Academic Performance at Secondary and Intermediate Level Using Machine Learning. Annals of Data Science. 2023; 10(3): 637–655. doi: 10.1007/s40745-021-00341-0
[15]Chen M, Liu Z. Predicting performance of students by optimizing tree components of random forest using genetic algorithm. Heliyon. 2024; 10(12): e32570. doi: 10.1016/j.heliyon.2024.e32570
[16]Bello U. Predictive Modeling of Student Academic Performance Using Machine Learning Techniques: A Statistical Approach. International Journal of Innovation in Mathematics, Statistics & Energy Policies. 2025; 13(4), 79–91. doi: 10.5281/ZENODO.17478207
[17]Bello A, Ajibade I, Wonuola I, et al. Developing a predictive model for student academic performance using machine learning techniques. International Journal of Science and Research Archive. 2025; 16(3): 219–234. doi: 10.30574/ijsra.2025.16.3.2539
[18]Alkan BB, Kuzucuk S, Alkan N, et al. Using machine learning to predict student outcomes for early intervention and formative assessment. Scientific Reports. 2025; 15(1): 39797. doi: 10.1038/s41598-025-23409-w
[19]Shoukath TK, Midhunchakkravarthy. A Study on Predictive Modelling of Student Academic Performance using Machine Learning Method. Journal of Information Systems Engineering and Management. 2024; 10(1): 78–88. doi: 10.52783/jisem.v10i1s.103
[20]Amjad A, Ullah M, Khan MT, et al. Impact of Artificial Intelligence-Based Predictive Analytics on Improving Academic Performance in Pakistani Universities: The Moderating Role of Digital Literacy. Spectrum of Engineering Sciences. 2026; 4(3): 1–12.
[21]Guevara-Reyes R, Ortiz-Garcés I, Andrade R, et al. Machine learning models for academic performance prediction: interpretability and application in educational decision-making. Frontiers in Education. 2025; 10: 1632315. doi: 10.3389/feduc.2025.1632315
[22]Bouallegue S, Omri A, Al-Naemi S. Machine Learning Approaches for Early Student Performance Prediction in Programming Education. Information. 2026; 17(1): 60. doi: 10.3390/info17010060
[23]Suleiman IB. Performance of linear regression in predicting academic outcome. Journal of Electrical Systems and Information Technology. 2026; 13(1): 24. doi: 10.1186/s43067-026-00325-1
[24]Adedeji OB, Asanre AA, Omilabu AA, et al. Early Prediction of Student Academic Performance Using Machine Learning. Indonesian Journal of Pedagogy and Teacher Education. 2026; 4(1): 31–40. doi: 10.58723/ijopate.v4i1.625
[25]Zhou H, Shen S, Su Y, et al. LLM-EPSP: Large language model empowered early prediction of student performance. Information Processing & Management. 2026; 63(1): 104351. doi: 10.1016/j.ipm.2025.104351
[26]Kukkar A, Mohana R, Sharma A, et al. A novel methodology using RNN + LSTM + ML for predicting student’s academic performance. Education and Information Technologies. 2024; 29(11): 14365–14401. doi: 10.1007/s10639-023-12394-0
[27]Varnavsky AN, Romanova II. Using Brain-Computer Interface for Measuring Indicators of Cognitive Processes (A Case of Doing Educational Reading). In: Proceedings of the 2020 IEEE 40th International Conference on Electronics and Nanotechnology (ELNANO); 22–24 April 2020; Kyiv, Ukraine. pp. 484–487. doi: 10.1109/ELNANO50318.2020.9088759
[28]Gokalp M. Compare the Effect of Mastery Learning and Constructivist Approaches to the Academic Achievements of Teachers. International Journal of Management and Applied Science. 2017; 3(1): 123–128. Available online: https://www.iraj.in/journal/journal_file/journal_pdf/14-341-1489657999123-128.pdf
[29]Tinto V. Research and Practice of Student Retention: What Next? Journal of College Student Retention: Research, Theory & Practice. 2006; 8(1): 1–19. doi: 10.2190/4YNU-4TMB-22DJ-AN4W
[30]Sule BO, Saporu FWO. A logistic regression model of students’ academic performance in University of Maiduguri, Maiduguri, Nigeria. Mathematical Theory and Modeling. 2015; 5(10): 124–136. Available online: https://www.iiste.org/Journals/index.php/MTM/article/view/25766
[31]Balabied SAA, Eid HF. Utilizing random forest algorithm for early detection of academic underperformance in open learning environments. PeerJ Computer Science. 2023; 9: e1708. doi: 10.7717/peerj-cs.1708

