Granger Causality using Neural Networks

Read original: arXiv:2208.03703 - Published 8/9/2024 by Malik Shahid Sultan, Samuel Horvath, Hernando Ombao

🧠

Overview

The paper proposes new methods for characterizing dependence between components of multivariate time series data, which is an important concept in fields like finance, politics, and neuroscience.
Traditional approaches to Granger Causality (GC) estimation often assume linear dynamics, which can lead to mischaracterization of true causal interactions in real-world non-linear systems.
The authors present two novel models - Learned Kernel VAR (LeKVAR) and a decoupled penalty approach - that leverage advances in machine learning and deep learning to handle underlying non-linearity and provide GC estimation along with lag order selection.

Plain English Explanation

When we have data that changes over time, like stock prices or brain activity, we're often interested in how different parts of the system influence each other. Granger Causality is one way to study these relationships - it looks at whether knowing the history of one variable helps us predict another.

However, standard Granger Causality methods assume the relationships are linear, which isn't always true in the real world. In cases with more complex, non-linear dynamics, these simple models can miss important connections.

The authors of this paper propose some new approaches that use machine learning and deep learning techniques to better handle non-linear relationships. The first model, Learned Kernel VAR (LeKVAR), learns a flexible kernel function to capture non-linear patterns, while also using sparsity regularization to infer the Granger Causal structure.

The second approach directly separates the importance of different time lags and individual time series, which helps with selecting the appropriate lag order during the Granger Causality estimation process. This decoupled penalty can be applied to a variety of deep learning models, like neural networks and recurrent networks.

By leveraging advances in machine learning, these new methods can better uncover the complex interdependencies in real-world dynamic systems, with applications in fields like finance, neuroscience, and social science.

Technical Explanation

The paper proposes two novel models for Granger Causality (GC) estimation that can handle non-linear dynamics:

Learned Kernel VAR (LeKVAR): This model learns a kernel function parameterized by a shared neural network, followed by sparsity-inducing regularization to discover the GC structure. The learned kernel allows the model to capture non-linear relationships, while the regularization promotes sparse weights, enabling inference on the GC.
Decoupled Penalty Approach: This approach directly decouples the importance of lags and individual time series through separate penalty terms. This allows for simultaneous GC estimation and lag order selection, as the decoupled penalties act as a filtering mechanism. This decoupling can be extended to work with a variety of deep learning models, including MLPs, RNNs, LSTMs, and Transformers.

Both approaches leverage advances in machine learning and deep learning to overcome the limitations of traditional linear GC methods, which can fail to capture true causal interactions in non-linear real-world systems.

The authors evaluate the performance of their proposed models on both synthetic and real-world datasets, including financial time series and neuroimaging data. The results demonstrate that the new methods outperform standard GC estimation techniques, particularly in the presence of non-linear dynamics.

Critical Analysis

The paper presents a compelling solution to the limitations of traditional Granger Causality estimation methods, which often struggle with non-linear relationships in real-world data. By incorporating modern machine learning techniques, the proposed models are able to better capture the underlying complexity of the system under study.

One potential limitation of the work is the computational complexity of the deep learning-based approaches, which may limit their practical application to large-scale problems. The authors acknowledge this and suggest that future research could explore more efficient architectures or optimization techniques to address this.

Additionally, the paper does not provide extensive analysis of the interpretability and robustness of the learned GC structures. While the sparsity-inducing regularization aims to promote interpretable models, further investigation into the stability and reliability of the inferred causal relationships would be valuable.

Finally, the authors note that their methods rely on the assumption of stationarity in the time series data. Extending the approaches to handle non-stationary dynamics could broaden their applicability to a wider range of real-world scenarios.

Overall, the paper presents an important step forward in the field of causal inference for non-linear time series data, and the proposed models offer promising avenues for further research and development.

Conclusion

This paper introduces novel machine learning-based methods for characterizing dependence and Granger Causality in multivariate time series data. By leveraging advances in deep learning, the authors have developed models that can handle the non-linear dynamics often present in real-world systems, overcoming the limitations of traditional linear approaches.

The proposed LeKVAR and decoupled penalty models demonstrate improved performance in both synthetic and real-world datasets, highlighting their potential for applications in fields like finance, neuroscience, and social science, where understanding complex causal relationships is of critical importance.

While the methods show promise, the authors also identify areas for further research, such as improving computational efficiency and exploring the interpretability and robustness of the inferred causal structures. Nonetheless, this work represents a significant step forward in the field of causal inference for non-linear time series data, with valuable implications for our understanding of dynamic systems across diverse domains.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🧠

Granger Causality using Neural Networks

Malik Shahid Sultan, Samuel Horvath, Hernando Ombao

Dependence between nodes in a network is an important concept that pervades many areas including finance, politics, sociology, genomics and the brain sciences. One way to characterize dependence between components of a multivariate time series data is via Granger Causality (GC). Standard traditional approaches to GC estimation / inference commonly assume linear dynamics, however such simplification does not hold in many real-world applications where signals are inherently non-linear. In such cases, imposing linear models such as vector autoregressive (VAR) models can lead to mis-characterization of true Granger Causal interactions. To overcome this limitation, Tank et al (IEEE Transactions on Pattern Analysis and Machine Learning, 2022) proposed a solution that uses neural networks with sparse regularization penalties. The regularization encourages learnable weights to be sparse, which enables inference on GC. This paper overcomes the limitations of current methods by leveraging advances in machine learning and deep learning which have been demonstrated to learn hidden patterns in the data. We propose novel classes of models that can handle underlying non-linearity in a computationally efficient manner, simultaneously providing GC and lag order selection. Firstly, we present the Learned Kernel VAR (LeKVAR) model that learns kernel parameterized by a shared neural net followed by penalization on learnable weights to discover GC structure. Secondly, we show one can directly decouple lags and individual time series importance via decoupled penalties. This is important as we want to select the lag order during the process of GC estimation. This decoupling acts as a filtering and can be extended to any DL model including Multi-Layer Perceptrons (MLP), Recurrent Neural Networks (RNN), Long Short Term Memory Networks (LSTM), Transformers etc, for simultaneous GC estimation and lag selection.

8/9/2024

Jacobian Regularizer-based Neural Granger Causality

Wanqi Zhou, Shuanghao Bai, Shujian Yu, Qibin Zhao, Badong Chen

With the advancement of neural networks, diverse methods for neural Granger causality have emerged, which demonstrate proficiency in handling complex data, and nonlinear relationships. However, the existing framework of neural Granger causality has several limitations. It requires the construction of separate predictive models for each target variable, and the relationship depends on the sparsity on the weights of the first layer, resulting in challenges in effectively modeling complex relationships between variables as well as unsatisfied estimation accuracy of Granger causality. Moreover, most of them cannot grasp full-time Granger causality. To address these drawbacks, we propose a Jacobian Regularizer-based Neural Granger Causality (JRNGC) approach, a straightforward yet highly effective method for learning multivariate summary Granger causality and full-time Granger causality by constructing a single model for all target variables. Specifically, our method eliminates the sparsity constraints of weights by leveraging an input-output Jacobian matrix regularizer, which can be subsequently represented as the weighted causal matrix in the post-hoc analysis. Extensive experiments show that our proposed approach achieves competitive performance with the state-of-the-art methods for learning summary Granger causality and full-time Granger causality while maintaining lower model complexity and high scalability.

5/15/2024

NeuroKoopman Dynamic Causal Discovery

Rahmat Adesunkanmi, Balaji Sesha Srikanth Pokuri, Ratnesh Kumar

In many real-world applications where the system dynamics has an underlying interdependency among its variables (such as power grid, economics, neuroscience, omics networks, environmental ecosystems, and others), one is often interested in knowing whether the past values of one time series influences the future of another, known as Granger causality, and the associated underlying dynamics. This paper introduces a Koopman-inspired framework that leverages neural networks for data-driven learning of the Koopman bases, termed NeuroKoopman Dynamic Causal Discovery (NKDCD), for reliably inferring the Granger causality along with the underlying nonlinear dynamics. NKDCD employs an autoencoder architecture that lifts the nonlinear dynamics to a higher dimension using data-learned bases, where the lifted time series can be reliably modeled linearly. The lifting function, the linear Granger causality lag matrices, and the projection function (from lifted space to base space) are all represented as multilayer perceptrons and are all learned simultaneously in one go. NKDCD also utilizes sparsity-inducing penalties on the weights of the lag matrices, encouraging the model to select only the needed causal dependencies within the data. Through extensive testing on practically applicable datasets, it is shown that the NKDCD outperforms the existing nonlinear Granger causality discovery approaches.

4/26/2024

Bayesian Vector AutoRegression with Factorised Granger-Causal Graphs

He Zhao, Vassili Kitsios, Terence J. O'Kane, Edwin V. Bonilla

We study the problem of automatically discovering Granger causal relations from observational multivariate time-series data.Vector autoregressive (VAR) models have been time-tested for this problem, including Bayesian variants and more recent developments using deep neural networks. Most existing VAR methods for Granger causality use sparsity-inducing penalties/priors or post-hoc thresholds to interpret their coefficients as Granger causal graphs. Instead, we propose a new Bayesian VAR model with a hierarchical factorised prior distribution over binary Granger causal graphs, separately from the VAR coefficients. We develop an efficient algorithm to infer the posterior over binary Granger causal graphs. Comprehensive experiments on synthetic, semi-synthetic, and climate data show that our method is more uncertainty aware, has less hyperparameters, and achieves better performance than competing approaches, especially in low-data regimes where there are less observations.

5/27/2024