Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs

Read original: arXiv:2402.03614 - Published 5/27/2024 by He Zhao, Vassili Kitsios, Terence J. O'Kane, Edwin V. Bonilla

Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs

Overview

• This paper introduces a novel Poisson factorized Granger-causal graph model for understanding the causal relationships between different variables in complex systems.

• The model combines Granger causality, a widely used approach for inferring causal relationships from time series data, with a Poisson factorization method to handle high-dimensional datasets efficiently.

• The authors demonstrate the effectiveness of their approach on several real-world datasets, showing that it can uncover meaningful causal insights that are consistent with domain knowledge.

Plain English Explanation

The paper presents a new way to analyze causal relationships in complex systems using time series data. The key idea is to combine two existing techniques: Granger causality and Poisson factorization.

Granger causality is a statistical method that looks at how past values of one variable predict the future values of another variable, which can suggest a causal relationship. Poisson factorization is a way to efficiently model high-dimensional data, like the kind you might have in a complex system with many interacting variables.

By bringing these two approaches together, the authors created a model that can uncover causal connections in large, complicated datasets. They tested it on real-world examples and found that it could identify relationships that matched expert knowledge about those systems.

The benefit of this new method is that it allows researchers to better understand the underlying causal structure of complex phenomena, which is important for things like decision-making, forecasting, and intervention planning. It provides a more principled way to infer causality from observational data compared to previous approaches.

Technical Explanation

The core of the paper is the Poisson Factorised Granger-Causal Graph (PFGCG) model, which combines Granger causality with Poisson factorization to capture causal relationships in high-dimensional time series data.

The Granger causality component of the model looks at how past values of one variable (the "cause") help predict future values of another variable (the "effect"), beyond what can be predicted from the effect's own past values alone. This suggests a causal influence.

The Poisson factorization part of the model is used to efficiently represent the high-dimensional time series data as a low-rank matrix, enabling the method to scale to large problems. Specifically, the authors use a Poisson Matrix Factorization (PMF) approach, which models the observed data as Poisson-distributed.

The PFGCG model then combines these two elements, using the Granger causality framework to infer directed edges in a causal graph, while leveraging the PMF representation to handle high-dimensional data. The model is trained using variational inference techniques.

The authors evaluate their approach on several real-world datasets, including neural spiking activity, social media interactions, and financial time series. They show that the PFGCG model can uncover meaningful causal relationships that align with domain knowledge, outperforming baseline methods.

Critical Analysis

The PFGCG model represents an interesting advance in causal discovery from time series data, particularly for high-dimensional settings. By integrating Granger causality and Poisson factorization, the authors have developed a scalable approach that can uncover interpretable causal insights.

One limitation mentioned in the paper is that the model assumes linear Granger-causal relationships, which may not always reflect the true underlying causal mechanisms. Extensions to nonlinear Granger causality could be an area for future research.

Additionally, the authors note that the Poisson assumption may not always be appropriate for the data at hand. Exploring alternative matrix factorization approaches, such as nonnegative matrix factorization, could help relax this constraint.

It would also be valuable to further investigate the robustness of the PFGCG model to confounding variables and other threats to causal inference, as is often a concern when drawing causal conclusions from observational data. Bounding causal effects under Markov equivalence is one potential direction to explore.

Overall, this paper makes a compelling contribution to the field of causal discovery, providing a novel and scalable approach that can uncover meaningful insights from complex, high-dimensional time series data. Further research to address the limitations mentioned could help strengthen and expand the applicability of this work.

Conclusion

This paper introduces a new Poisson factorized Granger-causal graph model that combines two powerful techniques - Granger causality and Poisson factorization - to enable the discovery of causal relationships in high-dimensional time series data.

The authors demonstrate the effectiveness of their approach on several real-world datasets, showing that it can uncover meaningful causal insights that are consistent with domain knowledge. This represents an important advance in causal discovery, with potential applications in fields like neuroscience, social science, and finance, where understanding the underlying causal structure of complex systems is critical for decision-making, forecasting, and intervention planning.

While the model has some limitations, such as the assumption of linear Granger causality and the Poisson data distribution, the authors have laid the groundwork for further research to address these issues and expand the capabilities of this causal discovery framework.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs

He Zhao, Vassili Kitsios, Terence J. O'Kane, Edwin V. Bonilla

We study the problem of automatically discovering Granger causal relations from observational multivariate time-series data.Vector autoregressive (VAR) models have been time-tested for this problem, including Bayesian variants and more recent developments using deep neural networks. Most existing VAR methods for Granger causality use sparsity-inducing penalties/priors or post-hoc thresholds to interpret their coefficients as Granger causal graphs. Instead, we propose a new Bayesian VAR model with a hierarchical factorised prior distribution over binary Granger causal graphs, separately from the VAR coefficients. We develop an efficient algorithm to infer the posterior over binary Granger causal graphs. Comprehensive experiments on synthetic, semi-synthetic, and climate data show that our method is more uncertainty aware, has less hyperparameters, and achieves better performance than competing approaches, especially in low-data regimes where there are less observations.

5/27/2024

🛠️

Online Graph Topology Learning from Matrix-valued Time Series

Yiye Jiang, J'er'emie Bigot, Sofian Maabout

The focus is on the statistical analysis of matrix-valued time series, where data is collected over a network of sensors, typically at spatial locations, over time. Each sensor records a vector of features at each time point, creating a vectorial time series for each sensor. The goal is to identify the dependency structure among these sensors and represent it with a graph. When only one feature per sensor is observed, vector auto-regressive (VAR) models are commonly used to infer Granger causality, resulting in a causal graph. The first contribution extends VAR models to matrix-variate models for the purpose of graph learning. Additionally, two online procedures are proposed for both low and high dimensions, enabling rapid updates of coefficient estimates as new samples arrive. In the high-dimensional setting, a novel Lasso-type approach is introduced, and homotopy algorithms are developed for online learning. An adaptive tuning procedure for the regularization parameter is also provided. Given that the application of auto-regressive models to data typically requires detrending, which is not feasible in an online context, the proposed AR models are augmented by incorporating trend as an additional parameter, with a particular focus on periodic trends. The online algorithms are adapted to these augmented data models, allowing for simultaneous learning of the graph and trend from streaming samples. Numerical experiments using both synthetic and real data demonstrate the effectiveness of the proposed methods.

9/10/2024

🧠

Granger Causality using Neural Networks

Malik Shahid Sultan, Samuel Horvath, Hernando Ombao

Dependence between nodes in a network is an important concept that pervades many areas including finance, politics, sociology, genomics and the brain sciences. One way to characterize dependence between components of a multivariate time series data is via Granger Causality (GC). Standard traditional approaches to GC estimation / inference commonly assume linear dynamics, however such simplification does not hold in many real-world applications where signals are inherently non-linear. In such cases, imposing linear models such as vector autoregressive (VAR) models can lead to mis-characterization of true Granger Causal interactions. To overcome this limitation, Tank et al (IEEE Transactions on Pattern Analysis and Machine Learning, 2022) proposed a solution that uses neural networks with sparse regularization penalties. The regularization encourages learnable weights to be sparse, which enables inference on GC. This paper overcomes the limitations of current methods by leveraging advances in machine learning and deep learning which have been demonstrated to learn hidden patterns in the data. We propose novel classes of models that can handle underlying non-linearity in a computationally efficient manner, simultaneously providing GC and lag order selection. Firstly, we present the Learned Kernel VAR (LeKVAR) model that learns kernel parameterized by a shared neural net followed by penalization on learnable weights to discover GC structure. Secondly, we show one can directly decouple lags and individual time series importance via decoupled penalties. This is important as we want to select the lag order during the process of GC estimation. This decoupling acts as a filtering and can be extended to any DL model including Multi-Layer Perceptrons (MLP), Recurrent Neural Networks (RNN), Long Short Term Memory Networks (LSTM), Transformers etc, for simultaneous GC estimation and lag selection.

8/9/2024

🤯

Effective Bayesian Causal Inference via Structural Marginalisation and Autoregressive Orders

Christian Toth, Christian Knoll, Franz Pernkopf, Robert Peharz

Bayesian causal inference (BCI) naturally incorporates epistemic uncertainty about the true causal model into down-stream causal reasoning tasks by posterior averaging over causal models. However, this poses a tremendously hard computational problem due to the intractable number of causal structures to marginalise over. In this work, we decompose the structure learning problem into inferring (i) a causal order and (ii) a parent set for each variable given a causal order. By limiting the number of parents per variable, we can exactly marginalise over the parent sets in polynomial time, which leaves only the causal order to be marginalised. To this end, we propose a novel autoregressive model over causal orders (ARCO) learnable with gradient-based methods. Our method yields state-of-the-art in structure learning on simulated non-linear additive noise benchmarks with scale-free and Erdos-Renyi graph structures, and competitive results on real-world data. Moreover, we illustrate that our method accurately infers interventional distributions, which allows us to estimate posterior average causal effects and many other causal quantities of interest.

7/17/2024