Affiliations 

  • 1 School of Civil Engineering, Universiti Sains Malaysia, Nibong Tebal, Penang 14300, Malaysia E-mail: kailunyeoh@student.usm.my
  • 2 River Engineering and Urban Drainage Research Centre, Universiti Sains Malaysia, Nibong Tebal, Penang 14300, Malaysia
  • 3 School of Civil Engineering, Universiti Sains Malaysia, Nibong Tebal, Penang 14300, Malaysia
Water Sci Technol, 2023 Jul;88(1):75-91.
PMID: 37452535 DOI: 10.2166/wst.2023.193

Abstract

Short-term streamflow prediction is essential for managing flood early warning and water resources systems. Although numerical models are widely used for this purpose, they require various types of data and experience to operate the model and often tedious calibration processes. Under the digital revolution, the application of data-driven approaches to predict streamflow has increased in recent decades. In this work, multiple linear regression (MLR) and random forest (RF) models with three different input combinations are developed and assessed for multi-step ahead short-term streamflow predictions, using 14 years of hydrological datasets from the Kulim River catchment, Malaysia. Introducing more precedent streamflow events as predictor improves the performance of these data-driven models, especially in predicting peak streamflow during the high-flow event. The RF model (Nash-Sutcliffe efficiency (NSE): 0.599-0.962) outperforms the MLR model (NSE: 0.584-0.963) in terms of overall prediction accuracy. However, with the increasing lead-time length, the models' overall prediction accuracy on the arrival time and magnitude of peak streamflow decrease. These findings demonstrate the potential of decision tree-based models, such as RF, for short-term streamflow prediction and offer insights into enhancing the accuracy of these data-driven models.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.