Granger Causal Inference in Multivariate Hawkes Processes by Minimum Message Length

Read original: arXiv:2309.02027 - Published 4/12/2024 by Katerina Hlavackova-Schindler, Anna Melnykova, Irene Tubikanec

🤯

Overview

Multivariate Hawkes processes (MHPs) are a versatile tool for modeling real-world phenomena like earthquakes, stock market activity, neuronal activity, and virus propagation.
This paper focuses on MHPs with exponential decay kernels and estimating the connectivity graphs that represent Granger causal relationships between the components.
The authors propose an optimization criterion and model selection algorithm based on the minimum message length (MML) principle, which prefers models that provide the most concise explanation of the observed data.
The MML-based method is shown to outperform other state-of-the-art methods, especially in scenarios with short time horizons, where the latter tend to overfit.

Plain English Explanation

Multivariate Hawkes processes (link) are mathematical models that can be used to study and understand various real-world events, such as earthquakes, stock market activity, brain cell behavior, and the spread of viruses. In this paper, the researchers focus on a specific type of Hawkes process that has an exponential decay pattern.

The main goal of the study is to estimate the relationships, or "connectivity graphs," between the different components of these Hawkes processes. These connectivity graphs represent the Granger causal connections between the variables being modeled. To do this, the researchers propose a new optimization method and model selection algorithm based on the "minimum message length" (MML) principle.

The MML principle states that, even if two models fit the observed data equally well, the simpler model that can be described more concisely is the better one. This is similar to the idea of Occam's razor, where the simplest explanation is often preferred. By using this MML approach, the researchers found that their method outperformed other state-of-the-art techniques, especially when dealing with datasets that cover a relatively short time period, where the other methods tend to overfit the data.

The researchers also applied their MML-based method to analyze government bond data from the G7 countries and were able to identify causal connections that align with expert knowledge in the field.

Technical Explanation

The paper presents a method for estimating the connectivity graphs, which represent Granger causal relationships, in the context of multivariate Hawkes processes (MHPs) with exponential decay kernels.

The authors propose an optimization criterion and model selection algorithm based on the minimum message length (MML) principle. MML compares different Granger causal models using the Occam's razor principle - even when models have similar goodness-of-fit to the observed data, the one that provides the most concise explanation of the data is preferred.

The key innovation is that, while most existing state-of-the-art methods using lasso-type penalization tend to overfit in scenarios with short time horizons, the proposed MML-based method is shown to achieve high F1 scores in these settings.

The authors conduct a numerical study comparing their proposed algorithm to other classical and state-of-the-art methods. They find that their MML-based approach achieves the highest F1 scores in specific sparse graph settings.

The method is also illustrated on G7 sovereign bond data, and the obtained causal connections are found to be in agreement with expert knowledge available in the literature.

Critical Analysis

The paper presents a well-designed and thorough study, with a clear focus on addressing the limitations of existing methods, particularly in scenarios with short time horizons. The authors' use of the MML principle as the basis for their optimization and model selection criteria is a novel and promising approach.

One potential area for further research could be to explore the performance of the MML-based method on a wider range of real-world datasets and application domains, beyond the G7 bond data example provided. This would help to further validate the generalizability and robustness of the proposed technique.

Additionally, while the paper provides a comprehensive technical explanation, there may be value in exploring more intuitive analogies or examples to help a general audience better understand the significance and implications of the research (link).

Overall, this paper presents a valuable contribution to the field of Multivariate Hawkes Processes and causal inference, with a novel approach that shows promising results, particularly in challenging data regimes.

Conclusion

This paper introduces a new method for estimating connectivity graphs, representing Granger causal relationships, in the context of multivariate Hawkes processes with exponential decay kernels. The key innovation is the use of a minimum message length (MML) principle-based optimization and model selection approach, which outperforms other state-of-the-art techniques, especially in scenarios with short time horizons.

The proposed MML-based method has the potential to provide valuable insights into a wide range of real-world phenomena, from neuronal activity to virus propagation. The ability to accurately infer causal relationships from limited data could have important implications for fields such as epidemiology, finance, and neuroscience.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤯

Granger Causal Inference in Multivariate Hawkes Processes by Minimum Message Length

Katerina Hlavackova-Schindler, Anna Melnykova, Irene Tubikanec

Multivariate Hawkes processes (MHPs) are versatile probabilistic tools used to model various real-life phenomena: earthquakes, operations on stock markets, neuronal activity, virus propagation and many others. In this paper, we focus on MHPs with exponential decay kernels and estimate connectivity graphs, which represent the Granger causal relations between their components. We approach this inference problem by proposing an optimization criterion and model selection algorithm based on the minimum message length (MML) principle. MML compares Granger causal models using the Occam's razor principle in the following way: even when models have a comparable goodness-of-fit to the observed data, the one generating the most concise explanation of the data is preferred. While most of the state-of-art methods using lasso-type penalization tend to overfitting in scenarios with short time horizons, the proposed MML-based method achieves high F1 scores in these settings. We conduct a numerical study comparing the proposed algorithm to other related classical and state-of-art methods, where we achieve the highest F1 scores in specific sparse graph settings. We illustrate the proposed method also on G7 sovereign bond data and obtain causal connections, which are in agreement with the expert knowledge available in the literature.

4/12/2024

Flexible Parametric Inference for Space-Time Hawkes Processes

Emilia Siviero, Guillaume Staerman, Stephan Cl'emenc{c}on, Thomas Moreau

Many modern spatio-temporal data sets, in sociology, epidemiology or seismology, for example, exhibit self-exciting characteristics, triggering and clustering behaviors both at the same time, that a suitable Hawkes space-time process can accurately capture. This paper aims to develop a fast and flexible parametric inference technique to recover the parameters of the kernel functions involved in the intensity function of a space-time Hawkes process based on such data. Our statistical approach combines three key ingredients: 1) kernels with finite support are considered, 2) the space-time domain is appropriately discretized, and 3) (approximate) precomputations are used. The inference technique we propose then consists of a $ell_2$ gradient-based solver that is fast and statistically accurate. In addition to describing the algorithmic aspects, numerical experiments have been carried out on synthetic and real spatio-temporal data, providing solid empirical evidence of the relevance of the proposed methodology.

6/18/2024

Mamba Hawkes Process

Anningzhe Gao, Shan Dai, Yan Hu

Irregular and asynchronous event sequences are prevalent in many domains, such as social media, finance, and healthcare. Traditional temporal point processes (TPPs), like Hawkes processes, often struggle to model mutual inhibition and nonlinearity effectively. While recent neural network models, including RNNs and Transformers, address some of these issues, they still face challenges with long-term dependencies and computational efficiency. In this paper, we introduce the Mamba Hawkes Process (MHP), which leverages the Mamba state space architecture to capture long-range dependencies and dynamic event interactions. Our results show that MHP outperforms existing models across various datasets. Additionally, we propose the Mamba Hawkes Process Extension (MHP-E), which combines Mamba and Transformer models to enhance predictive capabilities. We present the novel application of the Mamba architecture to Hawkes processes, a flexible and extensible model structure, and a theoretical analysis of the synergy between state space models and Hawkes processes. Experimental results demonstrate the superior performance of both MHP and MHP-E, advancing the field of temporal point process modeling.

7/9/2024

Network reconstruction via the minimum description length principle

Tiago P. Peixoto

A fundamental problem associated with the task of network reconstruction from dynamical or behavioral data consists in determining the most appropriate model complexity in a manner that prevents overfitting, and produces an inferred network with a statistically justifiable number of edges. The status quo in this context is based on $L_{1}$ regularization combined with cross-validation. However, besides its high computational cost, this commonplace approach unnecessarily ties the promotion of sparsity with weight shrinkage. This combination forces a trade-off between the bias introduced by shrinkage and the network sparsity, which often results in substantial overfitting even after cross-validation. In this work, we propose an alternative nonparametric regularization scheme based on hierarchical Bayesian inference and weight quantization, which does not rely on weight shrinkage to promote sparsity. Our approach follows the minimum description length (MDL) principle, and uncovers the weight distribution that allows for the most compression of the data, thus avoiding overfitting without requiring cross-validation. The latter property renders our approach substantially faster to employ, as it requires a single fit to the complete data. As a result, we have a principled and efficient inference scheme that can be used with a large variety of generative models, without requiring the number of edges to be known in advance. We also demonstrate that our scheme yields systematically increased accuracy in the reconstruction of both artificial and empirical networks. We highlight the use of our method with the reconstruction of interaction networks between microbial communities from large-scale abundance samples involving in the order of $10^{4}$ to $10^{5}$ species, and demonstrate how the inferred model can be used to predict the outcome of interventions in the system.

5/8/2024