On-Time Student Graduation Prediction Modeling: A Comparative Analysis of Naive Bayes Algorithm and Other Data Mining Classifications

Pemodelan Prediksi Kelulusan Mahasiswa Tepat Waktu: Analisis Komparatif Algoritma Naive Bayes Dan Klasifikasi Data Mining Lainnya

Authors

  • Achmad Ridwan Universitas Muhammadiyah Kudus
  • Tole Sutikno Universitas Ahmad Dahlan Yogyakarta
  • Imam Riyadi Universitas Ahmad Dahlan Yogyakarta
  • Widya Cholid Wahyudin Universitas Muhammadiyah Kudus

DOI:

https://doi.org/10.21070/joincs.v8i2.1679

Keywords:

student graduation prediction, educational data mining, machine learning, classification, Naive Bayes, comparative analysis

Abstract

Predicting the on-time graduation of university students is a crucial task in higher education institutions, enabling proactive support and improving institutional effectiveness. This paper presents a comparative analysis of several machine learning algorithms for predicting on-time graduation, with a specific focus on challenging the performance of the Naive Bayes (NB) algorithm. Although often used as a baseline model, the effectiveness of NB in the complex domain of educational data is frequently debated.

We compare NB with MultinomialNB and Decision Tree (DT), both widely favored in recent literature. Using a public dataset containing students' academic records, we follow the CRISP-DM methodology, incorporating feature selection and SMOTE to address class imbalance. The models are evaluated using accuracy, precision, recall, and F1-score metrics.

Our results show that while Decision Tree achieves the highest accuracy, Naive Bayes offers an appealing balance of performance, computational efficiency, and interpretability, making it a strong candidate for implementation in early warning systems at universities. This study provides empirical evidence on the role of Naive Bayes in the current landscape of educational data mining. The classification results show an accuracy of 0.82 for Naive Bayes, 0.81 for MultinomialNB, and 0.85 for Decision Tree.

References

[1] A. Sadqui, M. Ertel, H. Sadiki, and S. Amali, “Evaluating Machine Learning Models for Predicting Graduation Timelines in Moroccan Universities,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 7, 2023.

[2] A. Salam and J. Zeniarja, “Classification of deep learning convolutional neural network feature extraction for student graduation prediction,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 32, no. 1, pp. 335–341, 2023.

[3] M. S. Pallavi, A. L. A. D. Urs, K. G. Ravikala, and V. Hegde, “Exploring the Impact of Financial Stress on Students' Academic Performance: A Machine Learning Approach,” in 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), 2024.

[4] A. Singh, K. Singh, A. Kumar, A. Shrivastava, and S. Kumar, “Optimizing Well-Being: Unveiling Eustress and Distress through Machine Learning,” in 2024 International Conference on Inventive Computation Technologies (ICICT), 2024.

[5] I. Dawar, S. Negi, S. Lamba, and A. Kumar, “Enhancing Student Academic Performance Forecasting: A Comparative Analysis of Machine Learning Algorithms,” SN Computer Science, vol. 5, no. 758, 2024.

[6] T. T. Tin, L. S. Hock, and O. M. Ikumapayi, “Educational Big Data Mining: Comparison of Multiple Machine Learning Algorithms in Predictive Modelling of Student Academic Performance,” International Journal of Advanced Computer Science and Applications, vol. 15, no. 6, 2024.

[7] E. Ahmed, “Student Performance Prediction Using Machine Learning Algorithms,” Applied Computational Intelligence and Soft Computing, vol. 2024, pp. 1–15, 2024.

[8] [8] A. Alfahid, “Algorithmic Prediction of Students On-Time Graduation from the University,” TEM Journal, vol. 13, no. 1, pp. 692–698, 2024.

[9] [9] T. Sureshkumar, J. Charanya, T. Kumaresan, G. Rajeshkumar, P. S. Kavin kumar, and B. Anuj, “Envisioning Educational Success Through Advanced Analytics and Intelligent Performance Prediction,” in 2024 10th International Conference on Communication and Signal Processing (ICCSP), 2024.

[10] U. Farooq et al., “Transforming educational insights: strategic integration of federated learning for enhanced prediction of student learning outcomes,” The Journal of Supercomputing, vol. 80, pp. 16334–16367, 2024.

[11] M. Beseiso, “Enhancing Student Success Prediction: A Comparative Analysis of Machine Learning Technique,” TechTrends, vol. 69, pp. 372–384, 2025.

[12] S. Verma, K. Kholiya, and K. Bala, “Leveraging Ensemble Model and Optimized Feature Selection to Boost Prediction Accuracy in Educational Data Mining,” SN Computer Science, vol. 6, no. 487, 2025.

[13] H. Herianto, B. Kurniawan, Z. H. Hartomi, Y. Irawan, and M. K. Anam, “Machine Learning Algorithm Optimization using Stacking Technique for Graduation Prediction,” Journal of Applied Data Sciences, vol. 5, no. 3, pp. 1272–1285, 2024.

[14] S. Zhao, D. Zhou, H. Wang, D. Chen, and L. Yu, “Enhancing Student Academic Success Prediction Through Ensemble Learning and Image-Based Behavioral Data Transformation,” Applied Sciences, vol. 15, no. 1231, 2025.

[15] Z. Khoudi, N. Hafidi, M. Nachaoui, and S. Lyaqini, “Leveraging machine learning and clickstream data to improve student performance prediction in virtual learning environments,” Information Discovery and Delivery, 2025.

[16] V. Nakhipova, Y. Kerimbekov, Z. Umarova, H. I. Bulbul, L. Suleimenova, and E. Adylbekova, “Integration of Collaborative Filtering Into Naive Bayes Method to Enhance Student Performance Prediction,” International Journal of Information and Communication Technology Education, vol. 20, no. 1, 2024.

[17] A. A. M. E. Hussen and A. Saikhu, “Modeling Of Student Graduation Prediction Using the Naive Bayes Classifier Algorithm,” in 2024 3rd International Conference on Creative Communication and Innovative Technology (ICCIT), 2024.

[18] T. S. Devkishan, S. K. Singh, and A. K. Bharti, “Ensemble Learning for Student Performance Assessment: Identifying and Analyzing Significant Affecting Factors in Higher Education,” in 2024 7th International Conference on Contemporary Computing and Informatics (IC3I), 2024.

[19] H. Setiadi, K. Sanjaya, A. Wijayanto, D. W. Wardhani, and H. D. Cahyono, “Comparative Analysis of Classification Algorithms Using Feature Selection Techniques to Predict On-Time Student Graduation,” Ingénierie des Systèmes d'Information, vol. 29, no. 4, pp. 1365–1379, 2024.

[20] T. G. Wijaya, O. N. Pratiwi, and I. Darmawan, “Implementation of CRISP-DM to Predict Student Graduation on Time Using Naïve Bayes Algorithm,” in 2024 International Conference on Intelligent Cybernetics Technology & Applications (ICICYTA), 2024.

[21] A. Harif and M. A. Kassimi, “Predictive Modeling of Student Performance Using RFECV-RF for Feature Selection and Machine Learning Techniques,” International Journal of Advanced Computer Science and Applications, vol. 15, no. 7, 2024.

[22] K. Roy and D. M. Farid, “An Adaptive Feature Selection Algorithm for Student Performance Prediction,” IEEE Access, vol. 12, pp. 75577-75598, 2024.

[23] U. Smaragdina, D. D. Prasetya, G. K. Cakir, and U. Pujianto, “A Comparative Study of Machine Learning Algorithms for Graduation Prediction in Electrical Engineering and Informatics Department,” in 2024 Ninth International Conference on Informatics and Computing (ICIC), 2024.

[24] A. Durak and V. Bulut, “Classification and prediction-based machine learning algorithms to predict students’ low and high programming performance,” Computer Applications in Engineering Education, vol. 32, 2024.

[25] A. Hafizha, “Prediksi Kelulusan Mahasiswa” [Dataset], Kaggle, 2023. [Online]. Available: https://www.kaggle.com/datasets/hafizhathallah/kelulusan-mahasiswa

Downloads

Published

2025-11-30

How to Cite

Achmad Ridwan, Sutikno, T., Riyadi, I., & Cholid Wahyudin, W. (2025). On-Time Student Graduation Prediction Modeling: A Comparative Analysis of Naive Bayes Algorithm and Other Data Mining Classifications: Pemodelan Prediksi Kelulusan Mahasiswa Tepat Waktu: Analisis Komparatif Algoritma Naive Bayes Dan Klasifikasi Data Mining Lainnya. JOINCS (Journal of Informatics, Network, and Computer Science), 8(2), 128–135. https://doi.org/10.21070/joincs.v8i2.1679