Aurora: A Foundation Model of the Atmosphere

2405.13063

Published 5/29/2024 by Cristian Bodnar, Wessel P. Bruinsma, Ana Lucic, Megan Stanley, Johannes Brandstetter, Patrick Garvan, Maik Riechert, Jonathan Weyn, Haiyu Dong, Anna Vaughan and 7 others

cs.LG

📈

Abstract

Deep learning foundation models are revolutionizing many facets of science by leveraging vast amounts of data to learn general-purpose representations that can be adapted to tackle diverse downstream tasks. Foundation models hold the promise to also transform our ability to model our planet and its subsystems by exploiting the vast expanse of Earth system data. Here we introduce Aurora, a large-scale foundation model of the atmosphere trained on over a million hours of diverse weather and climate data. Aurora leverages the strengths of the foundation modelling approach to produce operational forecasts for a wide variety of atmospheric prediction problems, including those with limited training data, heterogeneous variables, and extreme events. In under a minute, Aurora produces 5-day global air pollution predictions and 10-day high-resolution weather forecasts that outperform state-of-the-art classical simulation tools and the best specialized deep learning models. Taken together, these results indicate that foundation models can transform environmental forecasting.

Create account to get full access

Overview

Foundation models, trained on vast amounts of data, are revolutionizing many scientific fields by learning general-purpose representations that can be adapted to diverse downstream tasks.
The paper introduces Aurora, a large-scale foundation model of the atmosphere trained on over a million hours of weather and climate data.
Aurora can produce operational forecasts for a wide variety of atmospheric prediction problems, outperforming state-of-the-art classical simulation tools and specialized deep learning models.

Plain English Explanation

Foundation models are a powerful new approach in machine learning that involves training a single, general-purpose model on a huge amount of diverse data. This model can then be adapted to tackle a wide variety of specific problems, rather than having to build a new model from scratch for each task.

The researchers have created a foundation model called Aurora that has been trained on over a million hours of weather and climate data. This allows Aurora to make predictions about all kinds of atmospheric conditions, from air pollution to high-resolution weather forecasts. Remarkably, Aurora can do this in less than a minute, and its predictions are better than the current state-of-the-art tools used for these tasks.

The key advantage of Aurora is its versatility. Rather than having separate, specialized models for different atmospheric prediction problems, Aurora can handle them all using the general knowledge it has learned from the vast training dataset. This represents a significant breakthrough that could transform environmental forecasting by making it much faster, more accurate, and more comprehensive.

Technical Explanation

The paper introduces Aurora, a large-scale foundation model of the atmosphere trained on over a million hours of diverse weather and climate data. Aurora leverages the strengths of the foundation modeling approach to produce operational forecasts for a wide variety of atmospheric prediction problems, including those with limited training data, heterogeneous variables, and extreme events.

The researchers demonstrate that in under a minute, Aurora can produce 5-day global air pollution predictions and 10-day high-resolution weather forecasts that outperform state-of-the-art classical simulation tools and the best specialized deep learning models. This is achieved by Aurora's ability to learn general-purpose representations that can be efficiently adapted to diverse downstream tasks, as opposed to traditional approaches that require building separate models for each task.

The paper also shows that Aurora's performance continues to improve as more data is added to the training set, indicating the potential for further advancements in environmental forecasting through the use of large-scale foundation models. These results suggest that foundation models can transform the field of environmental forecasting by providing a unified, flexible, and high-performing solution.

Critical Analysis

The paper provides compelling evidence that foundation models can be highly effective for environmental forecasting tasks. However, the research also acknowledges several caveats and areas for further study.

One limitation is that the performance of Aurora was only evaluated on a limited set of atmospheric prediction problems. The suitability of foundation models may vary depending on the specific task and dataset, and further research is needed to understand their broader applicability in the environmental sciences.

Additionally, the paper does not address potential issues around the interpretability and explainability of Aurora's predictions, which may be an important consideration for mission-critical applications like weather forecasting. The environmental impacts of the large computational resources required to train and run such a model are also not discussed.

Overall, the research represents a significant step forward in the use of foundation models for environmental modeling and forecasting. However, continued investigation and careful consideration of the technology's limitations and ethical implications will be necessary to fully realize its potential benefits.

Conclusion

This paper demonstrates the remarkable capabilities of Aurora, a large-scale foundation model of the atmosphere that can produce highly accurate and versatile forecasts for a wide range of environmental prediction tasks. By leveraging the general-purpose representations learned from a massive training dataset, Aurora outperforms specialized models and classical simulation tools, suggesting that foundation models have the potential to transform environmental forecasting.

While the research highlights the significant advantages of this approach, it also underscores the need for further investigation into the limitations and ethical considerations of such powerful AI systems. As foundation models become more prevalent in scientific domains, it will be crucial to carefully evaluate their suitability, interpretability, and environmental impact to ensure they are developed and deployed responsibly.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

📈

ORBIT: Oak Ridge Base Foundation Model for Earth System Predictability

Xiao Wang, Aristeidis Tsaris, Siyan Liu, Jong-Youl Choi, Ming Fan, Wei Zhang, Junqi Yin, Moetasim Ashfaq, Dan Lu, Prasanna Balaprakash

Earth system predictability is challenged by the complexity of environmental dynamics and the multitude of variables involved. Current AI foundation models, although advanced by leveraging large and heterogeneous data, are often constrained by their size and data integration, limiting their effectiveness in addressing the full range of Earth system prediction challenges. To overcome these limitations, we introduce the Oak Ridge Base Foundation Model for Earth System Predictability (ORBIT), an advanced vision-transformer model that scales up to 113 billion parameters using a novel hybrid tensor-data orthogonal parallelism technique. As the largest model of its kind, ORBIT surpasses the current climate AI foundation model size by a thousandfold. Performance scaling tests conducted on the Frontier supercomputer have demonstrated that ORBIT achieves 230 to 707 PFLOPS, with scaling efficiency maintained at 78% to 96% across 24,576 AMD GPUs. These breakthroughs establish new advances in AI-driven climate modeling and demonstrate promise to significantly improve the Earth system predictability.

4/24/2024

cs.AI cs.DC eess.IV

On the Foundations of Earth and Climate Foundation Models

Xiao Xiang Zhu, Zhitong Xiong, Yi Wang, Adam J. Stewart, Konrad Heidler, Yuanyuan Wang, Zhenghang Yuan, Thomas Dujardin, Qingsong Xu, Yilei Shi

Foundation models have enormous potential in advancing Earth and climate sciences, however, current approaches may not be optimal as they focus on a few basic features of a desirable Earth and climate foundation model. Crafting the ideal Earth foundation model, we define eleven features which would allow such a foundation model to be beneficial for any geoscientific downstream application in an environmental- and human-centric manner.We further shed light on the way forward to achieve the ideal model and to evaluate Earth foundation models. What comes after foundation models? Energy efficient adaptation, adversarial defenses, and interpretability are among the emerging directions.

5/8/2024

cs.AI eess.SP

Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation

Zhitong Xiong, Yi Wang, Fahong Zhang, Adam J. Stewart, Joelle Hanna, Damian Borth, Ioannis Papoutsis, Bertrand Le Saux, Gustau Camps-Valls, Xiao Xiang Zhu

The development of foundation models has revolutionized our ability to interpret the Earth's surface using satellite observational data. Traditional models have been siloed, tailored to specific sensors or data types like optical, radar, and hyperspectral, each with its own unique characteristics. This specialization hinders the potential for a holistic analysis that could benefit from the combined strengths of these diverse data sources. Our novel approach introduces the Dynamic One-For-All (DOFA) model, leveraging the concept of neural plasticity in brain science to integrate various data modalities into a single framework adaptively. This dynamic hypernetwork, adjusting to different wavelengths, enables a single versatile Transformer jointly trained on data from five sensors to excel across 12 distinct Earth observation tasks, including sensors never seen during pretraining. DOFA's innovative design offers a promising leap towards more accurate, efficient, and unified Earth observation analysis, showcasing remarkable adaptability and performance in harnessing the potential of multimodal Earth observation data.

6/10/2024

cs.CV

Urban Air Pollution Forecasting: a Machine Learning Approach leveraging Satellite Observations and Meteorological Forecasts

Giacomo Blanco, Luca Barco, Lorenzo Innocenti, Claudio Rossi

Air pollution poses a significant threat to public health and well-being, particularly in urban areas. This study introduces a series of machine-learning models that integrate data from the Sentinel-5P satellite, meteorological conditions, and topological characteristics to forecast future levels of five major pollutants. The investigation delineates the process of data collection, detailing the combination of diverse data sources utilized in the study. Through experiments conducted in the Milan metropolitan area, the models demonstrate their efficacy in predicting pollutant levels for the forthcoming day, achieving a percentage error of around 30%. The proposed models are advantageous as they are independent of monitoring stations, facilitating their use in areas without existing infrastructure. Additionally, we have released the collected dataset to the public, aiming to stimulate further research in this field. This research contributes to advancing our understanding of urban air quality dynamics and emphasizes the importance of amalgamating satellite, meteorological, and topographical data to develop robust pollution forecasting models.

5/31/2024

cs.LG