Reducing the Impact of I/O Contention in Numerical Weather Prediction Workflows at Scale Using DAOS

Read original: arXiv:2404.03107 - Published 4/5/2024 by Nicolau Manubens, Simon D. Smart, Emanuele Danovaro, Tiago Quintino, Adrian Jackson

Reducing the Impact of I/O Contention in Numerical Weather Prediction Workflows at Scale Using DAOS

Overview

• This paper investigates using the Distributed Asynchronous Object Storage (DAOS) system to reduce the impact of I/O contention in Numerical Weather Prediction (NWP) workflows running at scale.

• The researchers compare the performance of DAOS against the widely used Lustre parallel file system, analyzing how each handles the high-volume I/O patterns common in NWP applications.

• The study demonstrates that DAOS can significantly improve the scalability and efficiency of NWP simulations by overcoming the I/O bottlenecks encountered with traditional storage solutions.

Plain English Explanation

Numerical weather prediction (NWP) is a complex scientific process that uses powerful computer simulations to forecast the weather. These simulations generate massive amounts of data that need to be quickly stored and accessed. However, when many simulations are running concurrently, the shared storage systems can become overwhelmed, leading to delays and performance issues.

The paper explores a newer storage technology called DAOS as a solution to this problem. DAOS is designed to handle high-volume, parallel data access more efficiently than traditional file systems like Lustre. By testing DAOS against Lustre using real-world NWP workloads, the researchers show that DAOS can significantly reduce the impact of this "I/O contention" problem.

In other words, DAOS allows NWP simulations to run faster and more reliably at scale, by avoiding the bottlenecks that occur when many simulations try to access shared storage at the same time. This could lead to more accurate weather forecasting and faster turnaround times for critical predictions.

Technical Explanation

The paper evaluates the performance of DAOS compared to Lustre for supporting I/O-intensive NWP workflows. The researchers conducted experiments using the Weather Research and Forecasting (WRF) model, a widely used NWP application, on a large-scale high-performance computing system.

Their experiments considered different scales, from hundreds to tens of thousands of processor cores, to assess how each storage system handles the increasing I/O demands. The results show that DAOS consistently outperforms Lustre in terms of throughput and completion time, with up to 3.5x improvements in some cases.

The key advantages of DAOS stem from its object storage architecture and asynchronous I/O model, which allow it to more effectively manage and schedule concurrent I/O requests from the NWP workload. In contrast, the traditional parallel file system approach of Lustre struggles to maintain performance as the I/O contention increases at scale.

Critical Analysis

The paper provides a thorough and well-designed experimental evaluation of DAOS for NWP workloads. The researchers acknowledge that their study is limited to a specific NWP application (WRF) and a particular HPC system configuration. Further research would be needed to generalize the findings to a broader range of NWP models and hardware environments.

Additionally, the paper does not delve into the potential operational and management complexities of deploying and maintaining a DAOS storage system at scale. Real-world adoption would require careful consideration of factors such as system administration, data migration, and integration with existing infrastructure.

While the performance benefits of DAOS are compelling, the paper could have provided more insight into the tradeoffs and potential downsides of the technology. For example, the authors could have discussed the storage overhead, fault tolerance, or data consistency guarantees of DAOS compared to traditional file systems.

Conclusion

This paper presents a compelling case for using the DAOS object storage system to address the I/O contention challenges faced by large-scale NWP simulations. By outperforming the widely used Lustre parallel file system, DAOS demonstrates the potential to significantly improve the scalability and efficiency of weather forecasting workflows.

The findings have important implications for the field of high-performance computing, as they showcase how emerging storage technologies can be leveraged to tackle the growing data management demands of complex scientific applications. As NWP models continue to become more sophisticated and computational resources more powerful, solutions like DAOS will play an increasingly crucial role in unlocking the full potential of these simulations.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Reducing the Impact of I/O Contention in Numerical Weather Prediction Workflows at Scale Using DAOS

Nicolau Manubens, Simon D. Smart, Emanuele Danovaro, Tiago Quintino, Adrian Jackson

Operational Numerical Weather Prediction (NWP) workflows are highly data-intensive. Data volumes have increased by many orders of magnitude over the last 40 years, and are expected to continue to do so, especially given the upcoming adoption of Machine Learning in forecast processes. Parallel POSIX-compliant file systems have been the dominant paradigm in data storage and exchange in HPC workflows for many years. This paper presents ECMWF's move beyond the POSIX paradigm, implementing a backend for their storage library to support DAOS -- a novel high-performance object store designed for massively distributed Non-Volatile Memory. This system is demonstrated to be able to outperform the highly mature and optimised POSIX backend when used under high load and contention, as per typical forecast workflow I/O patterns. This work constitutes a significant step forward, beyond the performance constraints imposed by POSIX semantics.

4/5/2024

I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey

Noah Lewis, Jean Luca Bez, Suren Byna

High-Performance Computing (HPC) systems excel in managing distributed workloads, and the growing interest in Artificial Intelligence (AI) has resulted in a surge in demand for faster methods of Machine Learning (ML) model training and inference. In the past, research on HPC I/O focused on optimizing the underlying storage system for modeling and simulation applications and checkpointing the results, causing writes to be the dominant I/O operation. These applications typically access large portions of the data written by simulations or experiments. ML workloads, in contrast, perform small I/O reads spread across a large number of random files. This shift of I/O access patterns poses several challenges to HPC storage systems. In this paper, we survey I/O in ML applications on HPC systems, and target literature within a 6-year time window from 2019 to 2024. We provide an overview of the common phases of ML, review available profilers and benchmarks, examine the I/O patterns encountered during ML training, explore I/O optimizations utilized in modern ML frameworks and proposed in recent literature, and lastly, present gaps requiring further R&D. We seek to summarize the common practices used in accessing data by ML applications and expose research gaps that could spawn further R&D.

4/17/2024

Leveraging data-driven weather models for improving numerical weather prediction skill through large-scale spectral nudging

Syed Zahid Husain, Leo Separovic, Jean-Franc{c}ois Caron, Rabah Aider, Mark Buehner, St'ephane Chamberland, Ervig Lapalme, Ron McTaggart-Cowan, Christopher Subich, Paul A. Vaillancourt, Jing Yang, Ayrton Zadra

Operational meteorological forecasting has long relied on physics-based numerical weather prediction (NWP) models. Recently, this landscape is facing disruption by the advent of data-driven artificial intelligence (AI)-based weather models, which offer tremendous computational performance and competitive forecasting skill. However, data-driven models for medium-range forecasting generally suffer from major limitations, including low effective resolution and a narrow range of predicted variables. This study illustrates the relative strengths and weaknesses of these competing paradigms using the GEM (Global Environmental Multiscale) and GraphCast models to represent physics-based and AI-based approaches, respectively. By analyzing global predictions from these two models against observations and analyses in both physical and spectral spaces, this study demonstrates that GraphCast-predicted large scales outperform GEM, particularly for longer lead times. Building on this insight, a hybrid NWP-AI system is proposed, wherein GEM-predicted large-scale state variables are spectrally nudged toward GraphCast predictions, while allowing GEM to freely generate fine-scale details critical for weather extremes. Results indicate that this hybrid approach is capable of leveraging the strengths of GraphCast to enhance the prediction skill of the GEM model. Importantly, trajectories of tropical cyclones are predicted with enhanced accuracy without significant changes in intensity. Furthermore, this new hybrid system ensures that meteorologists have access to a complete set of forecast variables, including those relevant for high-impact weather events.

7/26/2024

📊

Scalable Data Assimilation with Message Passing

Oscar Key, So Takao, Daniel Giles, Marc Peter Deisenroth

Data assimilation is a core component of numerical weather prediction systems. The large quantity of data processed during assimilation requires the computation to be distributed across increasingly many compute nodes, yet existing approaches suffer from synchronisation overhead in this setting. In this paper, we exploit the formulation of data assimilation as a Bayesian inference problem and apply a message-passing algorithm to solve the spatial inference problem. Since message passing is inherently based on local computations, this approach lends itself to parallel and distributed computation. In combination with a GPU-accelerated implementation, we can scale the algorithm to very large grid sizes while retaining good accuracy and compute and memory requirements.

4/22/2024