Affiliations 

  • 1 Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Jalan Sg. Long, Bandar Sg. Long, 43000, Kajang, Selangor, Malaysia
  • 2 Faculty of Engineering & Quantity Surveying, INTI International University (INTI-IU), Persiaran Perdana BBN, Putra Nilai, Nilai, 71800, Negeri Sembilan, Malaysia
  • 3 Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional, Kajang, 43000, Selangor, Malaysia
  • 4 School of Civil Engineering, College of Engineering, Universiti Teknologi MARA (UiTM), 40450, Shah Alam, Selangor, Malaysia
  • 5 Department of Environmental Engineering, Faculty of Engineering & Green Technology, Universiti Tunku Abdul Rahman, Jalan Universiti, Bandar Barat, 31900, Kampar, Perak, Malaysia
  • 6 National Water and Energy Center, United Arab Emirates University, Al Ain, P.O. Box, 15551, United Arab Emirates
  • 7 Department of Civil Engineering, Faculty of Engineering, University of Malaya (UM), 50603, Kuala Lump, Malaysia
Heliyon, 2023 Sep;9(9):e19426.
PMID: 37662729 DOI: 10.1016/j.heliyon.2023.e19426

Abstract

In consideration of the distinct behavior of machine learning (ML) algorithms, six well-defined ML used were carried out in this study for predicting sea level on a day-to-day basis. Data compiled from 1985 to 2018 was utilized for training and testing the developed models. An assessment of the multiple statistics-driven regression algorithms resulted such that each tested location was associated with a particular preferred model. The following were the developed best models for their respective study areas: In Peninsular Malaysia, the interactions linear regression model was the best at Pulau Langkawi (RMSE = 19.066), the Matern 5/2 gaussian process regression model at Geting (RMSE = 49.891), and the trilayered artificial neural network at Pulau Pinang (RMSE = 20.026), while the linear regression model was the best at Sandakan in Sabah, East Malaysia (RMSE = 14.054). Other metrics, such as MAE and R-square, were also at their best values, each providing its best values, further substantiating the RMSE respectively, at each of the study areas. These empirical statistics (or metrics) also revealed that despite employing sea level as the sole parameter, results obtained were exceptional better when utilizing a 7-day lag, regardless of the model used. Notably, lag variables with less than a 7-day lag could degrade the model's accuracy in representing ground reality. The study emphasizes the importance of thorough training and testing of ML to aid decision-makers in developing mitigation actions for the climate change phenomena of sea level rise through reliable ML.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.