Affiliations 

  • 1 Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
  • 2 Cataclysmic Management and Sustainable Development Research Group (CAMSDE), Department of Civil Engineering, Kulliyyah of Engineering, International Islamic University Malaysia, Malaysia
  • 3 Department of Civil Engineering, College of Engineering, Qassim University, Unaizah, Saudi Arabia
  • 4 Faculty of Engineering & Quantity Surveying, INTI International University (INTI-IU), Persiaran Perdana BBN, Putra Nilai, Nilai, 71800, Negeri Sembilan, Malaysia
  • 5 Civil and Environmental Eng. Dept., College of Engineering, United Arab Emirates University, Al Ain, 15551, United Arab Emirates
  • 6 Department of Civil Engineering, Faculty of Engineering, University of Malaya (UM), 50603, Kuala Lumpur, Malaysia
Heliyon, 2023 Jul;9(7):e17689.
PMID: 37456046 DOI: 10.1016/j.heliyon.2023.e17689

Abstract

Accurate water level prediction for both lake and river is essential for flood warning and freshwater resource management. In this study, three machine learning algorithms: multi-layer perceptron neural network (MLP-NN), long short-term memory neural network (LSTM) and extreme gradient boosting XGBoost were applied to develop water level forecasting models in Muda River, Malaysia. The models were developed using limited amount of daily water level and meteorological data from 2016 to 2018. Different input scenarios were tested to investigate the performance of the models. The results of the evaluation showed that the MLP model outperformed both the LSTM and the XGBoost models in predicting water levels, with an overall accuracy score of 0.871 compared to 0.865 for LSTM and 0.831 for XGBoost. No noticeable improvement has been achieved after incorporating meteorological data into the models. Even though the lowest reported performance was reported by the XGBoost, it is the faster of the three algorithms due to its advanced parallel processing capabilities and distributed computing architecture. In terms of different time horizons, the LSTM model was found to be more accurate than the MLP and XGBoost model when predicting 7 days ahead, demonstrating its superiority in capturing long-term dependencies. Therefore, it can be concluded that each ML model has its own merits and weaknesses, and the performance of different ML models differs on each case because these models depends largely on the quantity and quality of data available for the model training.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.