Towards Interpretable Physical-Conceptual Catchment-Scale Hydrological Modeling using the Mass-Conserving-Perceptron

Read original: arXiv:2401.14521 - Published 7/30/2024 by Yuan-Heng Wang, Hoshin V. Gupta

📈

Overview

The paper investigates using machine learning techniques to create parsimonious, interpretable, catchment-scale hydrologic models using directed-graph architectures based on the mass-conserving perceptron (MCP) as the fundamental computational unit.
The focus is on architectural complexity (depth) at a single location, rather than universal applicability (breadth) across large samples of catchments.
The goal is to discover a minimal representation (numbers of cell-states and flow paths) that can explain the input-state-output behaviors of a given catchment, with emphasis on simulating the full range of flow dynamics (high, medium, and low).

Plain English Explanation

The researchers are exploring how machine learning can be used to develop hydrologic models that are simple to understand, yet still accurately predict the flow of water in a specific catchment area. They are using a type of machine learning model called a mass-conserving perceptron (MCP) as the building block for these models.

The key idea is to find the simplest possible model - one with the fewest number of internal states and flow paths - that can still capture the full range of water flow, from high to low. This could make the models more interpretable and easier to use, while still maintaining accuracy.

The researchers focused on a single catchment area, rather than trying to create a model that works across many different locations. This allows them to really hone in on the specific characteristics of that one area and find the minimal representation that works best.

Technical Explanation

The paper explores the use of directed-graph architectures based on the mass-conserving perceptron (MCP) as the fundamental computational unit for developing parsimonious, interpretable, catchment-scale hydrologic models. The focus is on architectural complexity (depth) at a single location, rather than universal applicability (breadth) across large samples of catchments.

The goal is to discover a minimal representation (numbers of cell-states and flow paths) that can explain the input-state-output behaviors of a given catchment, with particular emphasis on simulating the full range of flow dynamics (high, medium, and low). The researchers find that a HyMod-like architecture with three cell-states and two major flow pathways achieves such a representation at their study location.

Importantly, the researchers note that incorporating an input-bypass mechanism significantly improves the timing and shape of the hydrograph, while the inclusion of bi-directional groundwater mass exchanges enhances the simulation of baseflow. This highlights the importance of using multiple diagnostic metrics for model evaluation and the need for designing training metrics that are better suited to extracting information across the full range of flow dynamics.

The findings set the stage for interpretable regional-scale MCP-based hydrological modeling (using large sample data) by using neural architecture search to determine appropriate minimal representations for catchments in different hydroclimatic regimes.

Critical Analysis

The paper makes a strong case for the use of directed-graph architectures and MCPs in developing interpretable, catchment-scale hydrologic models. The focus on finding a minimal representation that can capture the full range of flow dynamics is particularly noteworthy, as it could lead to models that are both accurate and easy to understand.

However, the research is limited to a single catchment area, and the findings may not necessarily generalize to other locations with different hydroclimatic regimes. The authors acknowledge this and suggest that further research using neural architecture search to determine appropriate minimal representations for different catchments would be a valuable next step.

Additionally, the paper does not address the potential challenges of scaling these models to larger regional or national scales, where the diversity of catchment characteristics may require more complex representations. The authors' mention of MCP-based hydrological modeling at the regional scale is intriguing, but the specifics of how this would be achieved are not elaborated upon.

Overall, the research presented in this paper is a promising step towards the development of interpretable, machine learning-based hydrologic models, but further work is needed to fully realize the potential of this approach.

Conclusion

This paper explores the use of machine learning, specifically directed-graph architectures and mass-conserving perceptrons (MCPs), to create parsimonious, interpretable, catchment-scale hydrologic models. The key finding is that a minimal representation with just three cell-states and two major flow pathways can effectively capture the full range of flow dynamics in a specific catchment area.

The incorporation of an input-bypass mechanism and bi-directional groundwater mass exchanges further improved the model's performance, highlighting the importance of using multiple diagnostic metrics for evaluation. The researchers suggest that this work lays the foundation for interpretable regional-scale MCP-based hydrological modeling using neural architecture search to determine appropriate minimal representations for different hydroclimatic regimes.

While the findings are promising, the limited scope of the study and the potential challenges of scaling the models to larger regions warrant further research. Nevertheless, this paper represents an important step towards the development of simple, yet accurate, hydrologic models that can be readily understood and applied by practitioners in the field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📈

Towards Interpretable Physical-Conceptual Catchment-Scale Hydrological Modeling using the Mass-Conserving-Perceptron

Yuan-Heng Wang, Hoshin V. Gupta

We investigate the applicability of machine learning technologies to the development of parsimonious, interpretable, catchment-scale hydrologic models using directed-graph architectures based on the mass-conserving perceptron (MCP) as the fundamental computational unit. Here, we focus on architectural complexity (depth) at a single location, rather than universal applicability (breadth) across large samples of catchments. The goal is to discover a minimal representation (numbers of cell-states and flow paths) that represents the dominant processes that can explain the input-state-output behaviors of a given catchment, with particular emphasis given to simulating the full range (high, medium, and low) of flow dynamics. We find that a HyMod Like architecture with three cell-states and two major flow pathways achieves such a representation at our study location, but that the additional incorporation of an input-bypass mechanism significantly improves the timing and shape of the hydrograph, while the inclusion of bi-directional groundwater mass exchanges significantly enhances the simulation of baseflow. Overall, our results demonstrate the importance of using multiple diagnostic metrics for model evaluation, while highlighting the need for properly selecting and designing the training metrics based on information-theoretic foundations that are better suited to extracting information across the full range of flow dynamics. This study sets the stage for interpretable regional-scale MCP-based hydrological modeling (using large sample data) by using neural architecture search to determine appropriate minimal representations for catchments in different hydroclimatic regimes.

7/30/2024

🔗

A Mass-Conserving-Perceptron for Machine Learning-Based Modeling of Geoscientific Systems

Yuan-Heng Wang, Hoshin V. Gupta

Although decades of effort have been devoted to building Physical-Conceptual (PC) models for predicting the time-series evolution of geoscientific systems, recent work shows that Machine Learning (ML) based Gated Recurrent Neural Network technology can be used to develop models that are much more accurate. However, the difficulty of extracting physical understanding from ML-based models complicates their utility for enhancing scientific knowledge regarding system structure and function. Here, we propose a physically-interpretable Mass Conserving Perceptron (MCP) as a way to bridge the gap between PC-based and ML-based modeling approaches. The MCP exploits the inherent isomorphism between the directed graph structures underlying both PC models and GRNNs to explicitly represent the mass-conserving nature of physical processes while enabling the functional nature of such processes to be directly learned (in an interpretable manner) from available data using off-the-shelf ML technology. As a proof of concept, we investigate the functional expressivity (capacity) of the MCP, explore its ability to parsimoniously represent the rainfall-runoff (RR) dynamics of the Leaf River Basin, and demonstrate its utility for scientific hypothesis testing. To conclude, we discuss extensions of the concept to enable ML-based physical-conceptual representation of the coupled nature of mass-energy-information flows through geoscientific systems.

5/14/2024

Machine learning surrogates for efficient hydrologic modeling: Insights from stochastic simulations of managed aquifer recharge

Timothy Dai, Kate Maher, Zach Perzan

Process-based hydrologic models are invaluable tools for understanding the terrestrial water cycle and addressing modern water resources problems. However, many hydrologic models are computationally expensive and, depending on the resolution and scale, simulations can take on the order of hours to days to complete. While techniques such as uncertainty quantification and optimization have become valuable tools for supporting management decisions, these analyses typically require hundreds of model simulations, which are too computationally expensive to perform with a process-based hydrologic model. To address this gap, we propose a hybrid modeling workflow in which a process-based model is used to generate an initial set of simulations and a machine learning (ML) surrogate model is then trained to perform the remaining simulations required for downstream analysis. As a case study, we apply this workflow to simulations of variably saturated groundwater flow at a prospective managed aquifer recharge (MAR) site. We compare the accuracy and computational efficiency of several ML architectures, including deep convolutional networks, recurrent neural networks, vision transformers, and networks with Fourier transforms. Our results demonstrate that ML surrogate models can achieve under 10% mean absolute percentage error and yield order-of-magnitude runtime savings over processed-based models. We also offer practical recommendations for training hydrologic surrogate models, including implementing data normalization to improve accuracy, using a normalized loss function to improve training stability and downsampling input features to decrease memory requirements.

7/31/2024

Physics-aware Machine Learning Revolutionizes Scientific Paradigm for Machine Learning and Process-based Hydrology

Qingsong Xu, Yilei Shi, Jonathan Bamber, Ye Tuo, Ralf Ludwig, Xiao Xiang Zhu

Accurate hydrological understanding and water cycle prediction are crucial for addressing scientific and societal challenges associated with the management of water resources, particularly under the dynamic influence of anthropogenic climate change. Existing reviews predominantly concentrate on the development of machine learning (ML) in this field, yet there is a clear distinction between hydrology and ML as separate paradigms. Here, we introduce physics-aware ML as a transformative approach to overcome the perceived barrier and revolutionize both fields. Specifically, we present a comprehensive review of the physics-aware ML methods, building a structured community (PaML) of existing methodologies that integrate prior physical knowledge or physics-based modeling into ML. We systematically analyze these PaML methodologies with respect to four aspects: physical data-guided ML, physics-informed ML, physics-embedded ML, and physics-aware hybrid learning. PaML facilitates ML-aided hypotheses, accelerating insights from big data and fostering scientific discoveries. We first conduct a systematic review of hydrology in PaML, including rainfall-runoff hydrological processes and hydrodynamic processes, and highlight the most promising and challenging directions for different objectives and PaML methods. Finally, a new PaML-based hydrology platform, termed HydroPML, is released as a foundation for hydrological applications. HydroPML enhances the explainability and causality of ML and lays the groundwork for the digital water cycle's realization. The HydroPML platform is publicly available at https://hydropml.github.io/.

7/15/2024