Embedding machine-learnt sub-grid variability improves climate model biases

2406.09551

YC

0

Reddit

0

Published 6/17/2024 by Daniel Giles, James Briant, Cyril J. Morcrette, Serge Guillas
Embedding machine-learnt sub-grid variability improves climate model biases

Abstract

The under-representation of cloud formation is a long-standing bias associated with climate simulations. Parameterisation schemes are required to capture cloud processes within current climate models but have known biases. We overcome these biases by embedding a Multi-Output Gaussian Process (MOGP) trained on high resolution Unified Model simulations to represent the variability of temperature and specific humidity within a climate model. A trained MOGP model is coupled in-situ with a simplified Atmospheric General Circulation Model named SPEEDY. The temperature and specific humidity profiles of SPEEDY are perturbed at fixed intervals according to the variability predicted from the MOGP. Ten-year predictions are generated for both control and ML-hybrid models. The hybrid model reduces the global precipitation bias by 18% and over the tropics by 22%. To further understand the drivers of these improvements, physical quantities of interest are explored, such as the distribution of lifted index values and the alteration of the Hadley cell. The control and hybrid set-ups are also run in a plus 4K sea-surface temperature experiment to explore the effects of the approach on patterns relating to cloud cover and precipitation in a warmed climate setting.

Create account to get full access

or

If you already have an account, we'll log you in

Data generation, coarse-graining and training design

This section describes how the researchers generated data, processed it to a coarser resolution, and designed the machine learning model training.

The researchers used a high-resolution climate model to simulate the atmosphere and land surface at a fine spatial scale. They then averaged this high-resolution data to a coarser grid, mimicking the typical resolution of global climate models. This coarse-grained data was used as the "ground truth" that the machine learning model would try to predict.

The machine learning model was trained to learn the relationship between the coarse-grained data and the underlying high-resolution variability. This trained model was then embedded within a global climate model, allowing it to capture sub-grid scale processes that were previously unresolved.

Global mean precipitation over 10-year simulation (1982-1992)

This section compares the global mean precipitation simulated by the climate model with and without the embedded machine learning component.

The results show that the climate model with the embedded machine learning component does a better job of matching the observed global mean precipitation compared to the standard climate model. The machine learning component helps to capture important sub-grid scale processes that improve the overall precipitation simulation.

Evaluation of regional precipitation biases

The researchers evaluated how well the climate model with the embedded machine learning component performed at simulating precipitation patterns across different regions of the globe.

They found that the machine learning component significantly reduced biases in simulated precipitation compared to the standard climate model, particularly in regions with complex topography or other challenging features. The machine learning component was able to better capture the underlying high-resolution variability that was missed by the coarse-resolution climate model alone.

Implications and future directions

This research demonstrates the potential for embedding machine learning models within climate models to improve their performance. By learning and representing important sub-grid scale processes, the machine learning component was able to reduce common biases in climate model simulations.

Going forward, the researchers suggest that further advances in machine learning and climate modeling could lead to even more substantial improvements in our ability to accurately simulate the Earth's climate. Continued collaboration between the climate science and machine learning communities will be crucial in realizing this potential.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Exploring the Potential of Hybrid Machine-Learning/Physics-Based Modeling for Atmospheric/Oceanic Prediction Beyond the Medium Range

Exploring the Potential of Hybrid Machine-Learning/Physics-Based Modeling for Atmospheric/Oceanic Prediction Beyond the Medium Range

Dhruvit Patel, Troy Arcomano, Brian Hunt, Istvan Szunyogh, Edward Ott

YC

0

Reddit

0

This paper explores the potential of a hybrid modeling approach that combines machine learning (ML) with conventional physics-based modeling for weather prediction beyond the medium range. It extends the work of Arcomano et al. (2022), which tested the approach for short- and medium-range weather prediction, and the work of Arcomano et al. (2023), which investigated its potential for climate modeling. The hybrid model used for the forecast experiments of the paper is based on the low-resolution, simplified parameterization atmospheric general circulation model (AGCM) SPEEDY. In addition to the hybridized prognostic variables of SPEEDY, the current version of the model has three purely ML-based prognostic variables. One of these is 6~h cumulative precipitation, another is the sea surface temperature, while the third is the heat content of the top 300 m deep layer of the ocean. The model has skill in predicting the El Ni~no cycle and its global teleconnections with precipitation for 3-7 months depending on the season. The model captures equatorial variability of the precipitation associated with Kelvin and Rossby waves and MJO. Predictions of the precipitation in the equatorial region have skill for 15 days in the East Pacific and 11.5 days in the West Pacific. Though the model has low spatial resolution, for these tasks it has prediction skill comparable to what has been published for high-resolution, purely physics-based, conventional operational forecast models.

Read more

5/31/2024

Conditional diffusion models for downscaling & bias correction of Earth system model precipitation

Conditional diffusion models for downscaling & bias correction of Earth system model precipitation

Michael Aich, Philipp Hess, Baoxiang Pan, Sebastian Bathiany, Yu Huang, Niklas Boers

YC

0

Reddit

0

Climate change exacerbates extreme weather events like heavy rainfall and flooding. As these events cause severe losses of property and lives, accurate high-resolution simulation of precipitation is imperative. However, existing Earth System Models (ESMs) struggle with resolving small-scale dynamics and suffer from biases, especially for extreme events. Traditional statistical bias correction and downscaling methods fall short in improving spatial structure, while recent deep learning methods lack controllability over the output and suffer from unstable training. Here, we propose a novel machine learning framework for simultaneous bias correction and downscaling. We train a generative diffusion model in a supervised way purely on observational data. We map observational and ESM data to a shared embedding space, where both are unbiased towards each other and train a conditional diffusion model to reverse the mapping. Our method can be used to correct any ESM field, as the training is independent of the ESM. Our approach ensures statistical fidelity, preserves large-scale spatial patterns and outperforms existing methods especially regarding extreme events and small-scale spatial features that are crucial for impact assessments.

Read more

4/24/2024

Probabilistic Emulation of a Global Climate Model with Spherical DYffusion

Probabilistic Emulation of a Global Climate Model with Spherical DYffusion

Salva Ruhling Cachay, Brian Henn, Oliver Watt-Meyer, Christopher S. Bretherton, Rose Yu

YC

0

Reddit

0

Data-driven deep learning models are on the verge of transforming global weather forecasting. It is an open question if this success can extend to climate modeling, where long inference rollouts and data complexity pose significant challenges. Here, we present the first conditional generative model able to produce global climate ensemble simulations that are accurate and physically consistent. Our model runs at 6-hourly time steps and is shown to be stable for 10-year-long simulations. Our approach beats relevant baselines and nearly reaches a gold standard for successful climate model emulation. We discuss the key design choices behind our dynamics-informed diffusion model-based approach which enables this significant step towards efficient, data-driven climate simulations that can help us better understand the Earth and adapt to a changing climate.

Read more

6/24/2024

Diffusion-Based Joint Temperature and Precipitation Emulation of Earth System Models

Diffusion-Based Joint Temperature and Precipitation Emulation of Earth System Models

Katie Christensen, Lyric Otto, Seth Bassetti, Claudia Tebaldi, Brian Hutchinson

YC

0

Reddit

0

Earth system models (ESMs) are the principal tools used in climate science to generate future climate projections under various atmospheric emissions scenarios on a global or regional scale. Generative deep learning approaches are suitable for emulating these tools due to their computational efficiency and ability, once trained, to generate realizations in a fraction of the time required by ESMs. We extend previous work that used a generative probabilistic diffusion model to emulate ESMs by targeting the joint emulation of multiple variables, temperature and precipitation, by a single diffusion model. Joint generation of multiple variables is critical to generate realistic samples of phenomena resulting from the interplay of multiple variables. The diffusion model emulator takes in the monthly mean-maps of temperature and precipitation and produces the daily values of each of these variables that exhibit statistical properties similar to those generated by ESMs. Our results show the outputs from our extended model closely resemble those from ESMs on various climate metrics including dry spells and hot streaks, and that the joint distribution of temperature and precipitation in our sample closely matches those of ESMs.

Read more

4/16/2024