Affiliations 

  • 1 Hospital Tuanku Ampuan Najihah, Jalan Melang, 72000, Kuala Pilah, Negeri Sembilan, Malaysia
  • 2 Institute for Tropical Biology and Conservation, University Malaysia Sabah, Jalan UMS, 88400, Kota Kinabalu, Sabah, Malaysia. songquan.ong@ums.edu.my
  • 3 Hospital Tuanku Ja'afar, Jalan Rasah, 70300, Seremban, Negeri Sembilan, Malaysia
Sci Rep, 2025 Jan 24;15(1):3131.
PMID: 39856094 DOI: 10.1038/s41598-024-80538-4

Abstract

The COVID-19 pandemic has burdened healthcare systems globally. To curb high hospital admission rates, only patients with genuine medical needs are admitted. However, machine learning (ML) models to predict COVID-19 hospitalization in Asian children are lacking. This study aimed to develop and validate ML models to predict pediatric COVID-19 hospitalization. We collected secondary data with 2200 patients and 65 variables from Malaysian aged 0 to 12 with COVID-19 between 1st February 2020 and 31st March 2022. The sample was partitioned into training, internal, and external validation groups. Recursive Feature Elimination (RFE) was employed for feature selection, and we trained seven supervised classifiers. Grid Search was used to optimize the hyperparameters of each algorithm. The study analyzed 1988 children and 30 study variables after data were processed. The RFE algorithm selected 12 highly predicted variables for COVID-19 hospitalization, including age, male sex, fever, cough, rhinorrhea, shortness of breath, vomiting, diarrhea, seizures, body temperature, chest indrawing, and abnormal breath sounds. With external validation, Adaptive Boosting was the highest-performing classifier (AUROC = 0.95) to predict COVID-19 hospital admission in children. We validated AdaBoost as the best to predict COVID-19 hospitalization among children. This model may assist front-line clinicians in making medical disposition decisions.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.