Affiliations 

  • 1 Lovely Professional University, Phagwara, 144411, Punjab, India
  • 2 College of Arts and Science, Applied Science University, Al Ekir, 3201, Kingdom of Bahrain
  • 3 Management and Science University, Shah Alam, 40100, Selangor, Malaysia
  • 4 Hourani Center for Applied Scientific Research, Al-Ahliyya Amman University, Amman, 19328, Jordan
  • 5 Department of Mechanical and Production Engineering, Guru Nanak Dev Engineering College, Ludhiana, 141006, Punjab, India
  • 6 Department of Computer Science, College of Computer Science, King Khalid University, Abha, 61421, Asir, Saudi Arabia
  • 7 Department of Mechanical Engineering, Faculty of Engineering, Universitas Diponegoro, Semarang, 50275, Central Java, Indonesia. imamammarullah@gmail.com
Sci Rep, 2025 Mar 24;15(1):10172.
PMID: 40128279 DOI: 10.1038/s41598-025-94642-6

Abstract

Higher education is essential because it exposes students to a variety of areas. The academic performance of IT students is crucial and might fail if it isn't documented to identify the features influencing them, as well as their strengths and shortcomings. The student academic prediction system needs to be enhanced so that teachers can forecast their students' performance. Numerous studies have been conducted to increase the prediction accuracy of IT students, but they encountered difficulties with unbalanced data and algorithm tuning. To address these issues, the study proposed different machine learning (ML) algorithms that handled imbalanced data by applying the synthetic minority oversampling technique (SMOTE) and employing hyperparameter tuning algorithms to enhance prediction during the training process. The ML models we used were decision tree (DT), k-nearest neighbor, and XGBoost. The models were fine-tuned by applying Ant colony optimization (ACO) and artificial bee colony optimization techniques. Subsequently, these optimization techniques further enhanced the performance of the models. After comparing them, the results showed that SMOTE and ACO combined with the DT model outperformed other models for academic prediction. Additionally, the study utilized the Kendall Tau correlation coefficient technique to analyze the correlation between features and identify factors that positively or negatively impact student success.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.