On the importance of learning non-local dynamics for stable data-driven climate modeling: A 1D gravity wave-QBO testbed

Read original: arXiv:2407.05224 - Published 7/17/2024 by Hamid A. Pahlavan, Pedram Hassanzadeh, M. Joan Alexander

On the importance of learning non-local dynamics for stable data-driven climate modeling: A 1D gravity wave-QBO testbed

Overview

Explores the importance of modeling non-local dynamics for stable data-driven climate modeling
Presents a 1D gravity wave-QBO (Quasi-Biennial Oscillation) testbed to study this challenge
Investigates the instability of neural network-based parameterizations and the role of receptive field in learning non-local dynamics

Plain English Explanation

The paper focuses on an important challenge in using machine learning for climate modeling: the need to capture non-local, or long-range, interactions between different parts of the climate system.

The researchers created a simplified 1D model of gravity waves and the Quasi-Biennial Oscillation (QBO) - a important climate phenomenon where winds in the tropical stratosphere oscillate between eastward and westward directions.

They found that standard neural network-based approaches to modeling this system can be unstable and fail to accurately capture the non-local nature of the gravity wave-QBO interactions. The size of the neural network's "receptive field" - how much of the input it can "see" at once - plays a crucial role in determining whether it can learn these non-local dynamics.

The findings suggest that to build reliable data-driven climate models, it's important to design neural networks that can effectively capture long-range interactions between different parts of the climate system, rather than just local, short-range effects. This is a key challenge that must be addressed for machine learning to become a robust tool for climate modeling and prediction.

Technical Explanation

The paper investigates the importance of modeling non-local dynamics for stable and reliable data-driven climate modeling, using a 1D gravity wave-QBO testbed.

The researchers first demonstrate the instability of neural network-based parameterizations when applied to this 1D system. They show that standard neural network architectures fail to capture the non-local interactions between gravity waves and the QBO, leading to unstable and inaccurate predictions.

The authors then analyze the role of the neural network's receptive field in learning these non-local dynamics. They find that the size of the receptive field is crucial - networks with a larger receptive field are better able to learn the long-range interactions between gravity waves and the QBO, resulting in more stable and accurate simulations.

These findings highlight the importance of designing machine learning models that can effectively capture non-local, long-range effects in complex climate systems, rather than just local, short-range interactions. This is a key challenge that must be addressed for machine learning to become a robust tool for climate modeling and prediction.

Critical Analysis

The paper provides valuable insights into the challenges of using neural networks for climate modeling, particularly in capturing non-local dynamics. The 1D gravity wave-QBO testbed is a useful simplification that allows the researchers to isolate and study this important problem.

One potential limitation is the use of a highly simplified 1D model, which may not fully capture the complexity of real-world climate systems. While this approach allows for controlled experimentation, further research is needed to understand how these findings translate to more comprehensive climate models.

Additionally, the paper does not explore potential solutions or alternative neural network architectures that could better handle non-local dynamics. Investigating novel model designs or hybrid approaches that combine machine learning with physical understanding may be a fruitful direction for future work.

Overall, this research highlights a critical challenge that must be addressed for machine learning to become a reliable tool for climate modeling and prediction. The findings encourage readers to think critically about the limitations of current approaches and the importance of designing models that can capture the full complexity of the climate system.

Conclusion

This paper emphasizes the importance of learning non-local dynamics for stable and reliable data-driven climate modeling. By using a 1D gravity wave-QBO testbed, the researchers demonstrate the instability of standard neural network-based parameterizations and the crucial role of receptive field size in capturing long-range interactions.

The findings suggest that to build effective machine learning models for climate applications, it is essential to design architectures that can effectively handle non-local, long-range effects, rather than just local, short-range interactions. This is a key challenge that must be addressed for machine learning to become a robust tool for climate modeling and prediction, with significant implications for our understanding and forecasting of complex environmental systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

On the importance of learning non-local dynamics for stable data-driven climate modeling: A 1D gravity wave-QBO testbed

Hamid A. Pahlavan, Pedram Hassanzadeh, M. Joan Alexander

Machine learning (ML) techniques, especially neural networks (NNs), have shown promise in learning subgrid-scale parameterizations for climate models. However, a major problem with data-driven parameterizations, particularly those learned with supervised algorithms, is model instability. Current remedies are often ad-hoc and lack a theoretical foundation. Here, we combine ML theory and climate physics to address a source of instability in NN-based parameterization. We demonstrate the importance of learning spatially $textit{non-local}$ dynamics using a 1D model of the quasi-biennial oscillation (QBO) with gravity wave (GW) parameterization as a testbed. While common offline metrics fail to identify shortcomings in learning non-local dynamics, we show that the concept of receptive field (RF) can identify instability a-priori. We find that NN-based parameterizations that seem to accurately predict GW forcings from wind profiles ($mathbf{R^2 approx 0.99}$) cause unstable simulations when RF is too small to capture the non-local dynamics, while NNs of the same size but large-enough RF are stable. We examine three broad classes of architectures, namely convolutional NNs, Fourier neural operators, and fully-connected NNs; the latter two have inherently large RFs. We also demonstrate that learning non-local dynamics is crucial for the stability and accuracy of a data-driven spatiotemporal emulator of the zonal wind field. Given the ubiquity of non-local dynamics in the climate system, we expect the use of effective RF, which can be computed for any NN architecture, to be important for many applications. This work highlights the necessity of integrating ML theory with physics to design and analyze data-driven algorithms for weather and climate modeling.

7/17/2024

Machine Learning Global Simulation of Nonlocal Gravity Wave Propagation

Aman Gupta, Aditi Sheshadri, Sujit Roy, Vishal Gaur, Manil Maskey, Rahul Ramachandran

Global climate models typically operate at a grid resolution of hundreds of kilometers and fail to resolve atmospheric mesoscale processes, e.g., clouds, precipitation, and gravity waves (GWs). Model representation of these processes and their sources is essential to the global circulation and planetary energy budget, but subgrid scale contributions from these processes are often only approximately represented in models using parameterizations. These parameterizations are subject to approximations and idealizations, which limit their capability and accuracy. The most drastic of these approximations is the single-column approximation which completely neglects the horizontal evolution of these processes, resulting in key biases in current climate models. With a focus on atmospheric GWs, we present the first-ever global simulation of atmospheric GW fluxes using machine learning (ML) models trained on the WINDSET dataset to emulate global GW emulation in the atmosphere, as an alternative to traditional single-column parameterizations. Using an Attention U-Net-based architecture trained on globally resolved GW momentum fluxes, we illustrate the importance and effectiveness of global nonlocality, when simulating GWs using data-driven schemes.

6/24/2024

A probabilistic framework for learning non-intrusive corrections to long-time climate simulations from short-time training data

Benedikt Barthel Sorensen, Leonardo Zepeda-N'u~nez, Ignacio Lopez-Gomez, Zhong Yi Wan, Rob Carver, Fei Sha, Themistoklis Sapsis

Chaotic systems, such as turbulent flows, are ubiquitous in science and engineering. However, their study remains a challenge due to the large range scales, and the strong interaction with other, often not fully understood, physics. As a consequence, the spatiotemporal resolution required for accurate simulation of these systems is typically computationally infeasible, particularly for applications of long-term risk assessment, such as the quantification of extreme weather risk due to climate change. While data-driven modeling offers some promise of alleviating these obstacles, the scarcity of high-quality simulations results in limited available data to train such models, which is often compounded by the lack of stability for long-horizon simulations. As such, the computational, algorithmic, and data restrictions generally imply that the probability of rare extreme events is not accurately captured. In this work we present a general strategy for training neural network models to non-intrusively correct under-resolved long-time simulations of chaotic systems. The approach is based on training a post-processing correction operator on under-resolved simulations nudged towards a high-fidelity reference. This enables us to learn the dynamics of the underlying system directly, which allows us to use very little training data, even when the statistics thereof are far from converged. Additionally, through the use of probabilistic network architectures we are able to leverage the uncertainty due to the limited training data to further improve extrapolation capabilities. We apply our framework to severely under-resolved simulations of quasi-geostrophic flow and demonstrate its ability to accurately predict the anisotropic statistics over time horizons more than 30 times longer than the data seen in training.

8/7/2024

Higher order quantum reservoir computing for non-intrusive reduced-order models

Vinamr Jain, Romit Maulik

Forecasting dynamical systems is of importance to numerous real-world applications. When possible, dynamical systems forecasts are constructed based on first-principles-based models such as through the use of differential equations. When these equations are unknown, non-intrusive techniques must be utilized to build predictive models from data alone. Machine learning (ML) methods have recently been used for such tasks. Moreover, ML methods provide the added advantage of significant reductions in time-to-solution for predictions in contrast with first-principle based models. However, many state-of-the-art ML-based methods for forecasting rely on neural networks, which may be expensive to train and necessitate requirements for large amounts of memory. In this work, we propose a quantum mechanics inspired ML modeling strategy for learning nonlinear dynamical systems that provides data-driven forecasts for complex dynamical systems with reduced training time and memory costs. This approach, denoted the quantum reservoir computing technique (QRC), is a hybrid quantum-classical framework employing an ensemble of interconnected small quantum systems via classical linear feedback connections. By mapping the dynamical state to a suitable quantum representation amenable to unitary operations, QRC is able to predict complex nonlinear dynamical systems in a stable and accurate manner. We demonstrate the efficacy of this framework through benchmark forecasts of the NOAA Optimal Interpolation Sea Surface Temperature dataset and compare the performance of QRC to other ML methods.

8/1/2024