Affiliations 

  • 1 Advanced Analytics Engineering Centre, Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 40450, Shah Alam, Selangor, Malaysia
  • 2 Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Cawangan Kelantan, Kampus Kota Bharu, Lembah Sirrh, 15050, Kota Bharu, Kelantan, Malaysia
  • 3 Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA
  • 4 Vector Borne Disease Sector, Disease Control Division, Ministry of Health Malaysia, Level 4, Block E10, Complex E, Federal Government Administration Complex, 62590, Putrajaya, Malaysia
  • 5 Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX, 76107, USA. mdubydul.haque@unthsc.edu
Sci Rep, 2021 01 13;11(1):939.
PMID: 33441678 DOI: 10.1038/s41598-020-79193-2

Abstract

Dengue fever is a mosquito-borne disease that affects nearly 3.9 billion people globally. Dengue remains endemic in Malaysia since its outbreak in the 1980's, with its highest concentration of cases in the state of Selangor. Predictors of dengue fever outbreaks could provide timely information for health officials to implement preventative actions. In this study, five districts in Selangor, Malaysia, that demonstrated the highest incidence of dengue fever from 2013 to 2017 were evaluated for the best machine learning model to predict Dengue outbreaks. Climate variables such as temperature, wind speed, humidity and rainfall were used in each model. Based on results, the SVM (linear kernel) exhibited the best prediction performance (Accuracy = 70%, Sensitivity = 14%, Specificity = 95%, Precision = 56%). However, the sensitivity for SVM (linear) for the testing sample increased up to 63.54% compared to 14.4% for imbalanced data (original data). The week-of-the-year was the most important predictor in the SVM model. This study exemplifies that machine learning has respectable potential for the prediction of dengue outbreaks. Future research should consider boosting, or using, nature inspired algorithms to develop a dengue prediction model.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.