Probabilistic Forecasting with Coherent Aggregation

Read original: arXiv:2307.09797 - Published 8/7/2024 by Kin G. Olivares, Geoffrey N'egiar, Ruijun Ma, O. Nangba Meetei, Mengfei Cao, Michael W. Mahoney

✨

Overview

Accurate probabilistic forecasting is crucial in many real-world applications, such as energy management, climate forecasting, and supply chain planning.
These forecasting problems often have a hierarchical structure, and forecasting systems that respect this structure are called "coherent."
Developing accurate and coherent forecasting systems is challenging, and current methods are still being adapted for this purpose.

Plain English Explanation

The paper describes a new machine learning model called the Deep Coherent Factor Model Neural Network (DeepCoFactor) that aims to produce accurate and coherent probabilistic forecasts. Probabilistic forecasting means the model doesn't just give a single prediction, but instead provides a range of possible outcomes and the likelihood of each one.

Many real-world forecasting problems, like predicting energy demand or weather patterns, have a hierarchical structure. For example, national energy demand can be broken down into regional and local demand. Coherent forecasting means the model's predictions at different levels of the hierarchy are consistent with each other.

Achieving both accuracy and coherence is difficult with existing forecasting methods. The DeepCoFactor model uses a novel approach to generate forecasts that are coherent by design. It combines a neural network architecture with a deep Gaussian factor model, which allows the model to be optimized for various objectives, like minimizing forecast error or maximizing the probability of the observed outcomes.

The paper shows that DeepCoFactor outperforms other state-of-the-art coherent forecasting methods on several public datasets, with improvements in forecast accuracy ranging from 4.16% to 54.40%.

Technical Explanation

The DeepCoFactor model is built upon the MQForecaster neural network architecture, which has been used for hierarchical forecasting tasks. The key innovation in DeepCoFactor is the addition of a deep Gaussian factor forecasting model that ensures the generated forecasts are coherent.

The deep Gaussian factor model represents the forecasted quantities as a linear combination of latent factors, which capture the underlying patterns in the data. These factors are modeled using a Gaussian distribution, which allows the model to generate coherent samples that respect the hierarchical structure of the forecasting problem.

The DeepCoFactor model is trained end-to-end using sample-based learning objectives, such as quantile loss and the scaled Continuous Ranked Probability Score (CRPS). These objectives directly optimize the model for accurate and coherent probabilistic forecasts, rather than just point estimates.

The paper evaluates DeepCoFactor on three publicly available hierarchical forecasting datasets and compares it to other state-of-the-art coherent forecasting methods. The results show that DeepCoFactor achieves significant improvements in scaled CRPS forecast accuracy, with gains ranging from 4.16% to 54.40%.

Critical Analysis

The paper provides a thorough evaluation of the DeepCoFactor model and its performance compared to other methods. However, the authors do not discuss any potential limitations or caveats of the approach.

One potential issue is the computational complexity of the deep Gaussian factor model, which may limit the scalability of the approach to very large-scale forecasting problems. The paper also does not address how the model would perform in the presence of missing data or other real-world challenges that are common in operational forecasting scenarios.

Additionally, the paper focuses solely on the technical aspects of the model and does not explore the broader implications of accurate and coherent probabilistic forecasting, such as how it could inform decision-making in the target application domains.

Conclusion

The DeepCoFactor model presented in this paper represents a promising advance in the field of hierarchical probabilistic forecasting. By combining a neural network architecture with a deep Gaussian factor model, the authors have developed a method that can generate accurate and coherent forecasts, which is a critical requirement in many real-world applications.

The significant performance improvements demonstrated on public datasets suggest that DeepCoFactor could have a meaningful impact on operational forecasting tasks, potentially leading to better decision-making and resource allocation. However, further research is needed to address the model's scalability and robustness to real-world challenges.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

✨

Probabilistic Forecasting with Coherent Aggregation

Kin G. Olivares, Geoffrey N'egiar, Ruijun Ma, O. Nangba Meetei, Mengfei Cao, Michael W. Mahoney

Obtaining accurate probabilistic forecasts is an important operational challenge in many applications, perhaps most obviously in energy management, climate forecasting, supply chain planning, and resource allocation. In many of these applications, there is a natural hierarchical structure over the forecasted quantities; and forecasting systems that adhere to this hierarchical structure are said to be coherent. Furthermore, operational planning benefits from accuracy at all levels of the aggregation hierarchy. Building accurate and coherent forecasting systems, however, is challenging: classic multivariate time series tools and neural network methods are still being adapted for this purpose. In this paper, we augment an MQForecaster neural network architecture with a novel deep Gaussian factor forecasting model that achieves coherence by construction, yielding a method we call the Deep Coherent Factor Model Neural Network (DeepCoFactor) model. DeepCoFactor generates samples that can be differentiated with respect to model parameters, allowing optimization on various sample-based learning objectives that align with the forecasting system's goals, including quantile loss and the scaled Continuous Ranked Probability Score (CRPS). In a comparison to state-of-the-art coherent forecasting methods, DeepCoFactor achieves significant improvements in scaled CRPS forecast accuracy, with gains between 4.16 and 54.40%, as measured on three publicly available hierarchical forecasting datasets.

8/7/2024

Weather-Informed Probabilistic Forecasting and Scenario Generation in Power Systems

Hanyu Zhang, Reza Zandehshahvar, Mathieu Tanneau, Pascal Van Hentenryck

The integration of renewable energy sources (RES) into power grids presents significant challenges due to their intrinsic stochasticity and uncertainty, necessitating the development of new techniques for reliable and efficient forecasting. This paper proposes a method combining probabilistic forecasting and Gaussian copula for day-ahead prediction and scenario generation of load, wind, and solar power in high-dimensional contexts. By incorporating weather covariates and restoring spatio-temporal correlations, the proposed method enhances the reliability of probabilistic forecasts in RES. Extensive numerical experiments compare the effectiveness of different time series models, with performance evaluated using comprehensive metrics on a real-world and high-dimensional dataset from Midcontinent Independent System Operator (MISO). The results highlight the importance of weather information and demonstrate the efficacy of the Gaussian copula in generating realistic scenarios, with the proposed weather-informed Temporal Fusion Transformer (WI-TFT) model showing superior performance.

9/14/2024

🧠

Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks

Joel Oskarsson, Tomas Landelius, Marc Peter Deisenroth, Fredrik Lindsten

In recent years, machine learning has established itself as a powerful tool for high-resolution weather forecasting. While most current machine learning models focus on deterministic forecasts, accurately capturing the uncertainty in the chaotic weather system calls for probabilistic modeling. We propose a probabilistic weather forecasting model called Graph-EFM, combining a flexible latent-variable formulation with the successful graph-based forecasting framework. The use of a hierarchical graph construction allows for efficient sampling of spatially coherent forecasts. Requiring only a single forward pass per time step, Graph-EFM allows for fast generation of arbitrarily large ensembles. We experiment with the model on both global and limited area forecasting. Ensemble forecasts from Graph-EFM achieve equivalent or lower errors than comparable deterministic models, with the added benefit of accurately capturing forecast uncertainty.

6/10/2024

👁️

CaFA: Global Weather Forecasting with Factorized Attention on Sphere

Zijie Li, Anthony Zhou, Saurabh Patil, Amir Barati Farimani

Accurate weather forecasting is crucial in various sectors, impacting decision-making processes and societal events. Data-driven approaches based on machine learning models have recently emerged as a promising alternative to numerical weather prediction models given their potential to capture physics of different scales from historical data and the significantly lower computational cost during the prediction stage. Renowned for its state-of-the-art performance across diverse domains, the Transformer model has also gained popularity in machine learning weather prediction. Yet applying Transformer architectures to weather forecasting, particularly on a global scale is computationally challenging due to the quadratic complexity of attention and the quadratic increase in spatial points as resolution increases. In this work, we propose a factorized-attention-based model tailored for spherical geometries to mitigate this issue. More specifically, it utilizes multi-dimensional factorized kernels that convolve over different axes where the computational complexity of the kernel is only quadratic to the axial resolution instead of overall resolution. The deterministic forecasting accuracy of the proposed model on $1.5^circ$ and 0-7 days' lead time is on par with state-of-the-art purely data-driven machine learning weather prediction models. We also showcase the proposed model holds great potential to push forward the Pareto front of accuracy-efficiency for Transformer weather models, where it can achieve better accuracy with less computational cost compared to Transformer based models with standard attention.

5/14/2024