Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction

Read original: arXiv:2406.07484 - Published 6/12/2024 by Bekir Z. Demiray, Ibrahim Demir

🔮

Overview

This study explores the use of a Transformer model for predicting 120-hour streamflow across 125 diverse locations in Iowa, US.
The model utilizes data from the preceding 72 hours, including precipitation, evapotranspiration, and discharge values, to make predictions.
The Transformer model's performance is benchmarked against three deep learning models (LSTM, GRU, and Seq2Seq) and the Persistence approach.
Evaluation metrics include Nash-Sutcliffe Efficiency (NSE), Kling-Gupta Efficiency (KGE), Pearson's r, and Normalized Root Mean Square Error (NRMSE).

Plain English Explanation

The researchers in this study wanted to find a way to accurately predict how much water will flow in rivers and streams over the next 5 days (120 hours) across many different locations in Iowa. To do this, they developed a machine learning model called a Transformer. This Transformer model used information about the weather and water levels from the previous 3 days to make its predictions.

The researchers compared the Transformer model's performance to three other popular machine learning models (LSTM, GRU, and Seq2Seq) as well as a simple "Persistence" approach that just assumes the water levels will stay the same. They used several different metrics to evaluate how well each model did, including measures of how closely the predictions matched the actual observed water levels.

The key finding was that the Transformer model outperformed the other approaches, producing more accurate predictions that better captured the variability in water levels across the different locations in Iowa. This suggests the Transformer model has promise as an advanced tool for hydrological modeling and streamflow forecasting, offering significant improvements over traditional methods.

Technical Explanation

The researchers developed a Transformer model to predict 120-hour streamflow across 125 diverse locations in Iowa, USA. They utilized data from the preceding 72 hours, including precipitation, evapotranspiration, and discharge values, to train their generalized model. This approach contrasts with traditional methods that rely on location-specific models.

To evaluate the Transformer model's performance, the researchers benchmarked it against three deep learning models: LSTM, GRU, and Seq2Seq. They also included the Persistence approach as a baseline. The evaluation metrics used were Nash-Sutcliffe Efficiency (NSE), Kling-Gupta Efficiency (KGE), Pearson's r, and Normalized Root Mean Square Error (NRMSE).

The results showed the Transformer model had superior performance, maintaining higher median NSE and KGE scores and exhibiting the lowest NRMSE values compared to the other models. This indicates the Transformer model's capability to accurately simulate and predict streamflow, adapting effectively to varying hydrological conditions and geographical variances.

The researchers' findings suggest the Transformer model has significant potential as an advanced tool in hydrological modeling, offering improvements over traditional approaches like those discussed in this paper on snowmelt-driven streamflow forecasting and this paper on transformer-based multi-step water level forecasting. The study also complements research on parsimonious streamflow forecasting using CNNs and LSTMs and routing river water in land surface models.

Critical Analysis

The paper provides a thorough evaluation of the Transformer model's performance, but there are a few potential areas for further research and consideration:

Explainability: While the Transformer model demonstrated superior predictive accuracy, the paper does not delve into the model's interpretability or explainability. Understanding the key factors driving the Transformer's performance could lead to further insights and potential model improvements.
Generalization: The study focused on 125 locations in Iowa, so additional research would be needed to assess the Transformer model's ability to generalize to other regions or watersheds with different hydrological characteristics.
Operational Feasibility: The paper does not discuss the computational requirements or real-world implementation challenges of using the Transformer model for operational streamflow forecasting. These practical considerations would be important to address before the model could be widely adopted.
Ensemble Modeling: Combining the Transformer model with other approaches, as explored in this paper on multi-channel spatial-temporal transformer models for traffic forecasting, could potentially lead to further performance gains and more robust forecasting.

Overall, this study offers promising results for the application of Transformer models in hydrological modeling and streamflow prediction. However, additional research is needed to fully understand the model's capabilities, limitations, and practical implementation challenges.

Conclusion

This study demonstrates the efficacy of a Transformer model for 120-hour streamflow prediction across diverse locations in Iowa, USA. The model's superior performance, as evidenced by higher median NSE and KGE scores and lower NRMSE values, highlights its potential as an advanced tool in hydrological modeling. The findings suggest the Transformer model can adapt effectively to varying hydrological conditions and geographical variances, offering significant improvements over traditional and contemporary approaches.

While the study provides a solid foundation, further research is needed to address aspects such as model explainability, generalization to other regions, operational feasibility, and potential benefits of ensemble modeling. Nonetheless, this work contributes valuable insights to the ongoing efforts in improving streamflow forecasting and strengthening hydrological modeling capabilities.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔮

Towards Generalized Hydrological Forecasting using Transformer Models for 120-Hour Streamflow Prediction

Bekir Z. Demiray, Ibrahim Demir

This study explores the efficacy of a Transformer model for 120-hour streamflow prediction across 125 diverse locations in Iowa, US. Utilizing data from the preceding 72 hours, including precipitation, evapotranspiration, and discharge values, we developed a generalized model to predict future streamflow. Our approach contrasts with traditional methods that typically rely on location-specific models. We benchmarked the Transformer model's performance against three deep learning models (LSTM, GRU, and Seq2Seq) and the Persistence approach, employing Nash-Sutcliffe Efficiency (NSE), Kling-Gupta Efficiency (KGE), Pearson's r, and Normalized Root Mean Square Error (NRMSE) as metrics. The study reveals the Transformer model's superior performance, maintaining higher median NSE and KGE scores and exhibiting the lowest NRMSE values. This indicates its capability to accurately simulate and predict streamflow, adapting effectively to varying hydrological conditions and geographical variances. Our findings underscore the Transformer model's potential as an advanced tool in hydrological modeling, offering significant improvements over traditional and contemporary approaches.

6/12/2024

Comparative Analysis on Snowmelt-Driven Streamflow Forecasting Using Machine Learning Techniques

Ukesh Thapa, Bipun Man Pati, Samit Thapa, Dhiraj Pyakurel, Anup Shrestha

The rapid advancement of machine learning techniques has led to their widespread application in various domains including water resources. However, snowmelt modeling remains an area that has not been extensively explored. In this study, we propose a state-of-the-art (SOTA) deep learning sequential model, leveraging the Temporal Convolutional Network (TCN), for snowmelt-driven discharge modeling in the Himalayan basin of the Hindu Kush Himalayan Region. To evaluate the performance of our proposed model, we conducted a comparative analysis with other popular models including Support Vector Regression (SVR), Long Short Term Memory (LSTM), and Transformer. Furthermore, Nested cross-validation (CV) is used with five outer folds and three inner folds, and hyper-parameter tuning is performed on the inner folds. To evaluate the performance of the model mean absolute error (MAE), root mean square error (RMSE), R square ($R^{2}$), Kling-Gupta Efficiency (KGE), and Nash-Sutcliffe Efficiency (NSE) are computed for each outer fold. The average metrics revealed that TCN outperformed the other models, with an average MAE of 0.011, RMSE of 0.023, $R^{2}$ of 0.991, KGE of 0.992, and NSE of 0.991. The findings of this study demonstrate the effectiveness of the deep learning model as compared to traditional machine learning approaches for snowmelt-driven streamflow forecasting. Moreover, the superior performance of TCN highlights its potential as a promising deep learning model for similar hydrological applications.

4/24/2024

🔗

A Transformer variant for multi-step forecasting of water level and hydrometeorological sensitivity analysis based on explainable artificial intelligence technology

Mingyu Liu, Nana Bao, Xingting Yan, Chenyang Li, Kai Peng

Understanding the combined influences of meteorological and hydrological factors on water level and flood events is essential, particularly in today's changing climate environments. Transformer, as one kind of the cutting-edge deep learning methods, offers an effective approach to model intricate nonlinear processes, enables the extraction of key features and water level predictions. EXplainable Artificial Intelligence (XAI) methods play important roles in enhancing the understandings of how different factors impact water level. In this study, we propose a Transformer variant by integrating sparse attention mechanism and introducing nonlinear output layer for the decoder module. The variant model is utilized for multi-step forecasting of water level, by considering meteorological and hydrological factors simultaneously. It is shown that the variant model outperforms traditional Transformer across different lead times with respect to various evaluation metrics. The sensitivity analyses based on XAI technology demonstrate the significant influence of meteorological factors on water level evolution, in which temperature is shown to be the most dominant meteorological factor. Therefore, incorporating both meteorological and hydrological factors is necessary for reliable hydrological prediction and flood prevention. In the meantime, XAI technology provides insights into certain predictions, which is beneficial for understanding the prediction results and evaluating the reasonability.

5/24/2024

🔮

Long-term drought prediction using deep neural networks based on geospatial weather data

Alexander Marusov, Vsevolod Grabar, Yury Maximov, Nazar Sotiriadi, Alexander Bulkin, Alexey Zaytsev

The problem of high-quality drought forecasting up to a year in advance is critical for agriculture planning and insurance. Yet, it is still unsolved with reasonable accuracy due to data complexity and aridity stochasticity. We tackle drought data by introducing an end-to-end approach that adopts a spatio-temporal neural network model with accessible open monthly climate data as the input. Our systematic research employs diverse proposed models and five distinct environmental regions as a testbed to evaluate the efficacy of the Palmer Drought Severity Index (PDSI) prediction. Key aggregated findings are the exceptional performance of a Transformer model, EarthFormer, in making accurate short-term (up to six months) forecasts. At the same time, the Convolutional LSTM excels in longer-term forecasting. Both models achieved high ROC AUC scores: 0.948 for one month ahead and 0.617 for twelve months ahead forecasts, becoming closer to perfect ROC-AUC by $54%$ and $16%$, respectively, c.t. classic approaches.

7/2/2024