Feature-Enhanced Erroneous Outlier Detection in Hydrological Time Series Using Ensemble Methods
Files
(Published version)
Date
2026
Authors
Kuhaneswaran, B.
Sorwar, G.
Alaei, A.R.
Tong, F.
Editors
Advisors
Journal Title
Journal ISSN
Volume Title
Type:
Journal article
Citation
Water (Switzerland), 2026; 18(4):446-1-446-29
Statement of Responsibility
Banujan Kuhaneswaran, Golam Sorwar, Ali Reza Alaei, and Feifei Tong
Conference Name
Abstract
Data quality issues in hydrological time series directly affect hydrological modelling applications, including flood forecasting and water resource management. A critical challenge in hydrological monitoring is distinguishing erroneous outliers caused by sensor malfunctions or data transmission errors from natural extreme events such as floods, which exhibit similar statistical characteristics but require opposite treatments in forecasting models. Current detection practices rely on generic algorithms without systematic validation or adaptation to hydrological temporal dependencies, limiting their effectiveness in operational contexts. This study addresses these gaps through a comprehensive framework for detecting erroneous outliers in daily hydrological time series. We engineered 19 features that capture temporal dependencies and hydrological patterns, and reduced them to six key features that capture raw measurements, temporal patterns, and hydrological dynamics. We evaluated 13 detection algorithms across three categories: statistical methods (e.g., Extreme Studentised Deviate and Hampel filter), ML approaches (e.g., Isolation Forest, and Local Outlier Factor), and feature-enhanced variants. Three data-driven ensemble strategies were developed: Accurate (maximising F1-score), Diverse (balancing performance with method diversity), and Fast (prioritising computational efficiency). By injecting controlled outliers into the recorded hydrological data from five-gauge stations (in the Tweed River catchment, Australia), the outlier detection framework was validated. The outcomes showed that the ensemble methods achieved satisfactory F1 scores (0.6–0.9) in detecting the erroneous outliers. Statistical testing also identified the top-performing detection algorithms. The framework developed in this paper provides a validated tool for quality control in hydrological analysis, with potential applications in drought monitoring and flood forecasting systems.
School/Discipline
Dissertation Note
Provenance
Description
Access Status
Rights
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.