A probabilistic framework for learning non-intrusive corrections to long-time climate simulations from short-time training data

Read original: arXiv:2408.02688 - Published 8/7/2024 by Benedikt Barthel Sorensen, Leonardo Zepeda-N'u~nez, Ignacio Lopez-Gomez, Zhong Yi Wan, Rob Carver, Fei Sha, Themistoklis Sapsis

A probabilistic framework for learning non-intrusive corrections to long-time climate simulations from short-time training data

Overview

Presents a probabilistic framework for learning corrections to long-term climate simulations from short-term training data
Aims to improve the accuracy of climate models by incorporating observed data into the simulation process
Proposes a non-intrusive approach that can be applied without modifying the underlying climate model

Plain English Explanation

The paper describes a new way to improve the accuracy of long-term climate simulations by using short-term observational data. Climate models are complex computer programs that simulate the Earth's climate, but they often have biases or errors that can accumulate over time. The researchers developed a probabilistic framework that can learn corrections to the simulations based on real-world data, without needing to modify the underlying climate model itself.

The key idea is to train a machine learning model on short-term observational data, and then use that model to make non-intrusive corrections to the long-term climate simulation. This allows the simulation to stay close to the observed data, while still capturing the complex dynamics of the climate system. The approach is "non-intrusive" because it doesn't require changing the climate model itself, which can be a challenging and time-consuming process.

Technical Explanation

The paper presents a probabilistic framework for learning corrections to long-term climate simulations from short-term training data. The framework consists of three key components:

Discrepancy Model: A machine learning model that learns the discrepancy between the climate model output and the observed data, using short-term training data.
Bayesian Inference: A Bayesian inference procedure that updates the discrepancy model parameters based on new observations, allowing the corrections to adapt over time.
Non-intrusive Correction: A method for applying the learned discrepancy corrections to the long-term climate simulation without modifying the underlying model.

The authors demonstrate the effectiveness of this approach using numerical experiments on a simplified climate model, showing that it can significantly improve the accuracy of long-term simulations compared to the original climate model.

Critical Analysis

The paper presents a promising approach for improving the accuracy of long-term climate simulations, but it also has some limitations and caveats:

The framework assumes that the discrepancy between the climate model and observations can be learned from short-term data, which may not always be the case, especially for rare or extreme events.
The method relies on the availability of high-quality observational data, which can be scarce or unevenly distributed, particularly in remote or inaccessible regions.
The non-intrusive correction approach may not be able to capture all the complex interactions and feedbacks within the climate system, potentially limiting the effectiveness of the corrections.

Further research could explore ways to address these limitations, such as developing more robust discrepancy models, incorporating additional sources of observational data, or exploring more sophisticated correction techniques that better capture the underlying climate dynamics.

Conclusion

This paper presents a novel probabilistic framework for improving the accuracy of long-term climate simulations by learning non-intrusive corrections from short-term training data. The approach offers a promising way to leverage observational data to enhance the performance of climate models, without the need for extensive modifications to the underlying simulation code. While the method has some limitations, it represents an important step towards developing more reliable and accurate climate projections, which are essential for informing policy decisions and addressing the challenges of climate change.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

A probabilistic framework for learning non-intrusive corrections to long-time climate simulations from short-time training data

Benedikt Barthel Sorensen, Leonardo Zepeda-N'u~nez, Ignacio Lopez-Gomez, Zhong Yi Wan, Rob Carver, Fei Sha, Themistoklis Sapsis

Chaotic systems, such as turbulent flows, are ubiquitous in science and engineering. However, their study remains a challenge due to the large range scales, and the strong interaction with other, often not fully understood, physics. As a consequence, the spatiotemporal resolution required for accurate simulation of these systems is typically computationally infeasible, particularly for applications of long-term risk assessment, such as the quantification of extreme weather risk due to climate change. While data-driven modeling offers some promise of alleviating these obstacles, the scarcity of high-quality simulations results in limited available data to train such models, which is often compounded by the lack of stability for long-horizon simulations. As such, the computational, algorithmic, and data restrictions generally imply that the probability of rare extreme events is not accurately captured. In this work we present a general strategy for training neural network models to non-intrusively correct under-resolved long-time simulations of chaotic systems. The approach is based on training a post-processing correction operator on under-resolved simulations nudged towards a high-fidelity reference. This enables us to learn the dynamics of the underlying system directly, which allows us to use very little training data, even when the statistics thereof are far from converged. Additionally, through the use of probabilistic network architectures we are able to leverage the uncertainty due to the limited training data to further improve extrapolation capabilities. We apply our framework to severely under-resolved simulations of quasi-geostrophic flow and demonstrate its ability to accurately predict the anisotropic statistics over time horizons more than 30 times longer than the data seen in training.

8/7/2024

Conditional diffusion models for downscaling & bias correction of Earth system model precipitation

Michael Aich, Philipp Hess, Baoxiang Pan, Sebastian Bathiany, Yu Huang, Niklas Boers

Climate change exacerbates extreme weather events like heavy rainfall and flooding. As these events cause severe losses of property and lives, accurate high-resolution simulation of precipitation is imperative. However, existing Earth System Models (ESMs) struggle with resolving small-scale dynamics and suffer from biases, especially for extreme events. Traditional statistical bias correction and downscaling methods fall short in improving spatial structure, while recent deep learning methods lack controllability over the output and suffer from unstable training. Here, we propose a novel machine learning framework for simultaneous bias correction and downscaling. We train a generative diffusion model in a supervised way purely on observational data. We map observational and ESM data to a shared embedding space, where both are unbiased towards each other and train a conditional diffusion model to reverse the mapping. Our method can be used to correct any ESM field, as the training is independent of the ESM. Our approach ensures statistical fidelity, preserves large-scale spatial patterns and outperforms existing methods especially regarding extreme events and small-scale spatial features that are crucial for impact assessments.

4/24/2024

On the importance of learning non-local dynamics for stable data-driven climate modeling: A 1D gravity wave-QBO testbed

Hamid A. Pahlavan, Pedram Hassanzadeh, M. Joan Alexander

Machine learning (ML) techniques, especially neural networks (NNs), have shown promise in learning subgrid-scale parameterizations for climate models. However, a major problem with data-driven parameterizations, particularly those learned with supervised algorithms, is model instability. Current remedies are often ad-hoc and lack a theoretical foundation. Here, we combine ML theory and climate physics to address a source of instability in NN-based parameterization. We demonstrate the importance of learning spatially $textit{non-local}$ dynamics using a 1D model of the quasi-biennial oscillation (QBO) with gravity wave (GW) parameterization as a testbed. While common offline metrics fail to identify shortcomings in learning non-local dynamics, we show that the concept of receptive field (RF) can identify instability a-priori. We find that NN-based parameterizations that seem to accurately predict GW forcings from wind profiles ($mathbf{R^2 approx 0.99}$) cause unstable simulations when RF is too small to capture the non-local dynamics, while NNs of the same size but large-enough RF are stable. We examine three broad classes of architectures, namely convolutional NNs, Fourier neural operators, and fully-connected NNs; the latter two have inherently large RFs. We also demonstrate that learning non-local dynamics is crucial for the stability and accuracy of a data-driven spatiotemporal emulator of the zonal wind field. Given the ubiquity of non-local dynamics in the climate system, we expect the use of effective RF, which can be computed for any NN architecture, to be important for many applications. This work highlights the necessity of integrating ML theory with physics to design and analyze data-driven algorithms for weather and climate modeling.

7/17/2024

Capturing Climatic Variability: Using Deep Learning for Stochastic Downscaling

Kiri Daust, Adam Monahan

Adapting to the changing climate requires accurate local climate information, a computationally challenging problem. Recent studies have used Generative Adversarial Networks (GANs), a type of deep learning, to learn complex distributions and downscale climate variables efficiently. Capturing variability while downscaling is crucial for estimating uncertainty and characterising extreme events - critical information for climate adaptation. Since downscaling is an undetermined problem, many fine-scale states are physically consistent with the coarse-resolution state. To quantify this ill-posed problem, downscaling techniques should be stochastic, able to sample realisations from a high-resolution distribution conditioned on low-resolution input. Previous stochastic downscaling attempts have found substantial underdispersion, with models failing to represent the full distribution. We propose approaches to improve the stochastic calibration of GANs in three ways: a) injecting noise inside the network, b) adjusting the training process to explicitly account for the stochasticity, and c) using a probabilistic loss metric. We tested our models first on a synthetic dataset with known distributional properties, and then on a realistic downscaling scenario, predicting high-resolution wind components from low-resolution climate covariates. Injecting noise, on its own, substantially improved the quality of conditional and full distributions in tests with synthetic data, but performed less well for wind field downscaling, where models remained underdispersed. For wind downscaling, we found that adjusting the training method and including the probabilistic loss improved calibration. The best model, with all three changes, showed much improved skill at capturing the full variability of the high-resolution distribution and thus at characterising extremes.

6/6/2024