Sampling Hybrid Climate Simulation at Scale to Reliably Improve Machine Learning Parameterization

Read original: arXiv:2309.16177 - Published 7/8/2024 by Jerry Lin, Sungduk Yu, Liran Peng, Tom Beucler, Eliot Wong-Toi, Zeyuan Hu, Pierre Gentine, Margarita Geleta, Mike Pritchard

🧠

Overview

Machine learning (ML) models can potentially replace traditional physics-based parameterizations for simulating complex processes like turbulence, convection, and radiation in climate models.
However, it has been unclear whether improved performance of ML models in offline tests translates to better performance when coupled to a full climate model.
This research aims to address this question by extensively testing the coupled behavior of ML parameterizations using large ensembles of climate model simulations.

Plain English Explanation

Climate models use simplified representations, or parameterizations, of complex physical processes that occur at scales smaller than the model's grid resolution. Traditional parameterizations are based on physical laws and empirical data.

Machine learning offers a potential alternative, where the parameterizations are learned from high-resolution simulations or observations. The hope is that ML models can capture the complex dynamics more accurately than traditional approaches.

However, a key challenge has been whether the improved performance of ML models in isolated offline tests translates to better performance when they are actually used in a full climate model. This is important because the interactions between different processes in the climate model can produce very different results.

This research tackles this challenge by extensively testing the coupled behavior of ML parameterizations - running a huge number of climate model simulations (over 2,900) that combine traditional physics-based components with the ML models. This allows the researchers to statistically analyze how the ML model design choices impact the overall climate model performance.

Technical Explanation

The researchers developed full-physics ML parameterizations for key processes like turbulence, convection, and radiation. They then ran large ensembles of "hybrid" climate model simulations, where these ML parameterizations were coupled to the rest of the climate model.

By extensively sampling the impact of different ML model design choices and tuning, the researchers were able to:

Statistically confirm that reducing the offline error of the ML models does indeed lead to reduced error in the online, coupled climate simulations, under certain constraints.
Reveal that decisions that improve online performance, like removing dropout, can trade off against the overall stability of the hybrid climate model.
Identify specific design choices, like incorporating memory and training on multiple climate regimes, that yielded clear improvements to both offline and online performance.
Find that converting moisture input to relative humidity enhanced online stability, while using Mean Absolute Error (MAE) loss broke the relationship between offline and online error.

These insights help address the previously unresolved questions about how to design ML parameterizations that perform well when coupled to a full climate model.

Critical Analysis

The researchers acknowledge several caveats and limitations in their work:

The study focuses on a specific set of physical processes and a particular climate model; the generalizability to other processes and models is not guaranteed.
The computational expense of the large ensemble simulations limits the ability to explore the entire design space of ML parameterizations.
Some of the observed tradeoffs, like between online performance and model stability, may require further investigation to fully understand the underlying mechanisms.

Additionally, one could question whether the statistical correlations observed between offline and online performance are sufficient to draw firm conclusions. Further research may be needed to establish more definitive causal relationships.

Overall, this work represents an important step forward in addressing a critical challenge in the development of ML-based climate parameterizations. The insights provided here can inform future research and help guide the design of more effective hybrid physics-ML climate models.

Conclusion

This research takes a significant step towards understanding how to design machine learning-based parameterizations that can be effectively coupled to full-scale climate models. By extensively testing the online behavior of these hybrid models, the researchers were able to shed light on the complex relationships between offline performance, online performance, and model stability.

The findings suggest that carefully designed ML parameterizations, such as those that incorporate memory and are trained on diverse climate regimes, can indeed lead to improvements in overall climate model fidelity. However, the researchers also uncovered important tradeoffs that will need to be navigated.

This work represents an important contribution to the ongoing efforts to leverage the power of machine learning to enhance the realism and predictive capabilities of climate models, with potential implications for our understanding of the Earth's climate system and our ability to prepare for future climate change.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🧠

Sampling Hybrid Climate Simulation at Scale to Reliably Improve Machine Learning Parameterization

Jerry Lin, Sungduk Yu, Liran Peng, Tom Beucler, Eliot Wong-Toi, Zeyuan Hu, Pierre Gentine, Margarita Geleta, Mike Pritchard

Machine-learning (ML) parameterizations of subgrid processes (here of turbulence, convection, and radiation) may one day replace conventional parameterizations by emulating high-resolution physics without the cost of explicit simulation. However, their development has been stymied by uncertainty surrounding whether or not improved offline performance translates to improved online performance (i.e., when coupled to a large-scale general circulation model (GCM)). A key barrier has been the limited sampling of the online effects of the ML design decisions and tuning due to the complexity of performing large ensembles of hybrid physics-ML climate simulations. Our work examines the coupled behavior of full-physics ML parameterizations using large ensembles of hybrid simulations, totalling 2,970 in our case. With extensive sampling, we statistically confirm that lowering offline error lowers online error (given certain constraints). However, we also reveal that decisions decreasing online error, like removing dropout, can trade off against hybrid model stability and vice versa. Nevertheless, we are able to identify design decisions that yield unambiguous improvements to offline and online performance, namely incorporating memory and training on multiple climates. We also find that converting moisture input from specific to relative humidity enhances online stability and that using a Mean Absolute Error (MAE) loss breaks the aforementioned offline/online error relationship. By enabling rapid online experimentation at scale, we empirically answer previously unresolved questions regarding subgrid ML parameterization design.

7/8/2024

Embedding machine-learnt sub-grid variability improves climate model biases

Daniel Giles, James Briant, Cyril J. Morcrette, Serge Guillas

The under-representation of cloud formation is a long-standing bias associated with climate simulations. Parameterisation schemes are required to capture cloud processes within current climate models but have known biases. We overcome these biases by embedding a Multi-Output Gaussian Process (MOGP) trained on high resolution Unified Model simulations to represent the variability of temperature and specific humidity within a climate model. A trained MOGP model is coupled in-situ with a simplified Atmospheric General Circulation Model named SPEEDY. The temperature and specific humidity profiles of SPEEDY are perturbed at fixed intervals according to the variability predicted from the MOGP. Ten-year predictions are generated for both control and ML-hybrid models. The hybrid model reduces the global precipitation bias by 18% and over the tropics by 22%. To further understand the drivers of these improvements, physical quantities of interest are explored, such as the distribution of lifted index values and the alteration of the Hadley cell. The control and hybrid set-ups are also run in a plus 4K sea-surface temperature experiment to explore the effects of the approach on patterns relating to cloud cover and precipitation in a warmed climate setting.

6/17/2024

Towards Physically Consistent Deep Learning For Climate Model Parameterizations

Birgit Kuhbacher, Fernando Iglesias-Suarez, Niki Kilbertus, Veronika Eyring

Climate models play a critical role in understanding and projecting climate change. Due to their complexity, their horizontal resolution of about 40-100 km remains too coarse to resolve processes such as clouds and convection, which need to be approximated via parameterizations. These parameterizations are a major source of systematic errors and large uncertainties in climate projections. Deep learning (DL)-based parameterizations, trained on data from computationally expensive short, high-resolution simulations, have shown great promise for improving climate models in that regard. However, their lack of interpretability and tendency to learn spurious non-physical correlations result in reduced trust in the climate simulation. We propose an efficient supervised learning framework for DL-based parameterizations that leads to physically consistent models with improved interpretability and negligible computational overhead compared to standard supervised training. First, key features determining the target physical processes are uncovered. Subsequently, the neural network is fine-tuned using only those relevant features. We show empirically that our method robustly identifies a small subset of the inputs as actual physical drivers, therefore, removing spurious non-physical relationships. This results in by design physically consistent and interpretable neural networks while maintaining the predictive performance of unconstrained black-box DL-based parameterizations.

8/2/2024

Exploring the Potential of Hybrid Machine-Learning/Physics-Based Modeling for Atmospheric/Oceanic Prediction Beyond the Medium Range

Dhruvit Patel, Troy Arcomano, Brian Hunt, Istvan Szunyogh, Edward Ott

This paper explores the potential of a hybrid modeling approach that combines machine learning (ML) with conventional physics-based modeling for weather prediction beyond the medium range. It extends the work of Arcomano et al. (2022), which tested the approach for short- and medium-range weather prediction, and the work of Arcomano et al. (2023), which investigated its potential for climate modeling. The hybrid model used for the forecast experiments of the paper is based on the low-resolution, simplified parameterization atmospheric general circulation model (AGCM) SPEEDY. In addition to the hybridized prognostic variables of SPEEDY, the current version of the model has three purely ML-based prognostic variables. One of these is 6~h cumulative precipitation, another is the sea surface temperature, while the third is the heat content of the top 300 m deep layer of the ocean. The model has skill in predicting the El Ni~no cycle and its global teleconnections with precipitation for 3-7 months depending on the season. The model captures equatorial variability of the precipitation associated with Kelvin and Rossby waves and MJO. Predictions of the precipitation in the equatorial region have skill for 15 days in the East Pacific and 11.5 days in the West Pacific. Though the model has low spatial resolution, for these tasks it has prediction skill comparable to what has been published for high-resolution, purely physics-based, conventional operational forecast models.

5/31/2024