Skip to main navigation Skip to search Skip to main content

Advancing water level prediction using clustering-based machine learning techniques in data-scarce regions

  • Sang Hyun Lee
  • , Taeil Jang*
  • *Corresponding author for this work
  • Jeonbuk National University

Research output: Contribution to journalJournal articlepeer-review

Abstract

Accurate and scalable water level forecasting is essential for effective water resources management, particularly in regions with limited long-term records. We present a clustering-based framework for one- and three-day-ahead water level prediction in the Saemangeum Watershed, South Korea. Twenty-five monitoring stations were grouped into six hydrologically similar clusters using k-means clustering with wavelet-entropy features. Within each cluster, multilayer perceptron (MLP) models were trained using two strategies: (1) training only at the centroid station and (2) training at the station with the longest record in each cluster. The longest-record strategy showed strong agreement with observations, achieving mean Nash–Sutcliffe efficiency and root-mean-square error values of 0.97 and 0.06 for one-day-ahead forecasts, and 0.83 and 0.14 for three-day-ahead forecasts across all stations. By training one MLP per cluster and transferring it to all member stations, the framework reduces computational cost and provides a practical solution for large-scale water level forecasting in data-scarce environments.

Original languageEnglish
Article number106899
JournalEnvironmental Modelling and Software
Volume198
DOIs
StatePublished - 2026.03

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 6 - Clean Water and Sanitation
    SDG 6 Clean Water and Sanitation

Keywords

  • Clustering
  • Data-scarce
  • Long-term time-series
  • MLP
  • Water level forecasting
  • Wavelet transform

Fingerprint

Dive into the research topics of 'Advancing water level prediction using clustering-based machine learning techniques in data-scarce regions'. Together they form a unique fingerprint.

Cite this