A Measure-Theoretic Axiomatisation of Causality

Read original: arXiv:2305.17139 - Published 6/7/2024 by Junhyung Park, Simon Buchholz, Bernhard Scholkopf, Krikamol Muandet

🤯

Overview

The paper proposes a new framework for understanding causality, viewing it as an extension of probability theory and a study of what happens when intervening on a system.
The authors argue for building on Kolmogorov's measure-theoretic axiomatization of probability as a starting point for an axiomatization of causality.
They introduce the concept of a "causal space", which consists of a probability space and a collection of "causal kernels" that encode the causal information of the space.

Plain English Explanation

The paper tackles the fundamental question of how to formally define and reason about causality. Causality is a crucial concept in many fields, from science to philosophy, but there is still no universally agreed-upon way to define it mathematically.

The authors propose building on the well-established mathematical theory of probability, proposed by Andrey Kolmogorov. They suggest that causality can be seen as an extension of probability, where we not only look at how events are correlated, but also what happens when we actively intervene and change the system.

To capture this idea, the authors introduce the concept of a "causal space". This consists of a standard probability space, along with a set of "causal kernels" that describe how the probabilities of different events change when we intervene on the system. This framework allows the authors to rigorously analyze various aspects of causality, including cycles, latent variables, and stochastic processes, which have been challenging for other approaches.

Technical Explanation

The key idea in the paper is to view causality as an extension of probability theory, rather than as a separate concept. The authors start by arguing that Kolmogorov's measure-theoretic axiomatization of probability provides a solid foundation for building a formal theory of causality.

They then introduce the concept of a "causal space", which consists of a probability space (a set of events, a sigma-algebra, and a probability measure) along with a collection of "causal kernels". These causal kernels are transition probability functions that describe how the probabilities of different events change when we intervene on the system.

For example, if we have a variable X that causally influences a variable Y, the causal kernel would describe how the probabilities of different values of Y change when we set X to a particular value. This allows the authors to formally capture the notion of "what happens when we intervene on a system", which is central to many causal reasoning tasks.

The authors show how this framework can shed light on various long-standing challenges in causal reasoning, such as handling cycles, latent variables, and stochastic processes. By grounding their approach in measure theory, they are able to provide a rigorous mathematical foundation for causal reasoning.

Critical Analysis

The authors' proposal for a measure-theoretic axiomatization of causality is a valuable contribution to the ongoing efforts to formally define and reason about causality. By building on the well-established foundations of probability theory, the authors have created a framework that is both mathematically rigorous and conceptually appealing.

One potential limitation of the approach is that it may require significant technical expertise to fully understand and apply. The concepts of causal spaces and causal kernels, while well-defined, may be challenging for some readers to grasp. Additionally, the authors do not provide extensive examples or applications of their framework, which could make it harder for researchers to understand how to put it into practice.

Furthermore, the authors acknowledge that their framework does not yet address all the complexities of causal reasoning, such as the role of time and the potential for feedback loops. While they argue that their approach can shed light on these issues, further research may be needed to fully incorporate these aspects into the framework.

Despite these potential limitations, the authors' work represents an important step forward in the effort to develop a comprehensive and rigorous theory of causality. By bridging the gap between probability theory and causal reasoning, they have laid the groundwork for fruitful future research in this critical area.

Conclusion

The paper proposes a novel framework for understanding causality, grounded in the measure-theoretic foundations of probability theory. By introducing the concept of a "causal space", the authors have created a rigorous mathematical foundation for reasoning about causal relationships and the effects of interventions on a system.

While the technical complexity of the framework may present some challenges, the authors' work represents an important contribution to the ongoing quest to formally define and reason about causality. Their approach has the potential to shed light on long-standing issues in causal reasoning, and may pave the way for further advancements in this crucial area of research.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤯

A Measure-Theoretic Axiomatisation of Causality

Junhyung Park, Simon Buchholz, Bernhard Scholkopf, Krikamol Muandet

Causality is a central concept in a wide range of research areas, yet there is still no universally agreed axiomatisation of causality. We view causality both as an extension of probability theory and as a study of textit{what happens when one intervenes on a system}, and argue in favour of taking Kolmogorov's measure-theoretic axiomatisation of probability as the starting point towards an axiomatisation of causality. To that end, we propose the notion of a textit{causal space}, consisting of a probability space along with a collection of transition probability kernels, called textit{causal kernels}, that encode the causal information of the space. Our proposed framework is not only rigorously grounded in measure theory, but it also sheds light on long-standing limitations of existing frameworks including, for example, cycles, latent variables and stochastic processes.

6/7/2024

New!Towards Definition of Higher Order Causality in Complex Systems

Jakub Kov{r}enek, Pavel Sanda, Jaroslav Hlinka

The description of the dynamics of complex systems, in particular the capture of the interaction structure and causal relationships between elements of the system, is one of the central questions of interdisciplinary research. While the characterization of pairwise causal interactions is a relatively ripe field with established theoretical concepts and the current focus is on technical issues of their efficient estimation, it turns out that the standard concepts such as Granger causality or transfer entropy may not faithfully reflect possible synergies or interactions of higher orders, phenomena highly relevant for many real-world complex systems. In this paper, we propose a generalization and refinement of the information-theoretic approach to causal inference, enabling the description of truly multivariate, rather than multiple pairwise, causal interactions, and moving thus from causal networks to causal hypernetworks. In particular, while keeping the ability to control for mediating variables or common causes, in case of purely synergetic interactions such as the exclusive disjunction, it ascribes the causal role to the multivariate causal set but emph{not} to individual inputs, distinguishing it thus from the case of e.g. two additive univariate causes. We demonstrate this concept by application to illustrative theoretical examples as well as a biophysically realistic simulation of biological neuronal dynamics recently reported to employ synergetic computations.

9/16/2024

💬

Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability

Atticus Geiger, Duligur Ibeling, Amir Zur, Maheep Chaudhary, Sonakshi Chauhan, Jing Huang, Aryaman Arora, Zhengxuan Wu, Noah Goodman, Christopher Potts, Thomas Icard

Causal abstraction provides a theoretical foundation for mechanistic interpretability, the field concerned with providing intelligible algorithms that are faithful simplifications of the known, but opaque low-level details of black box AI models. Our contributions are (1) generalizing the theory of causal abstraction from mechanism replacement (i.e., hard and soft interventions) to arbitrary mechanism transformation (i.e., functionals from old mechanisms to new mechanisms), (2) providing a flexible, yet precise formalization for the core concepts of modular features, polysemantic neurons, and graded faithfulness, and (3) unifying a variety of mechanistic interpretability methodologies in the common language of causal abstraction, namely activation and path patching, causal mediation analysis, causal scrubbing, causal tracing, circuit analysis, concept erasure, sparse autoencoders, differential binary masking, distributed alignment search, and activation steering.

7/9/2024

Teaching Transformers Causal Reasoning through Axiomatic Training

Aniket Vashishtha, Abhinav Kumar, Abbavaram Gowtham Reddy, Vineeth N Balasubramanian, Amit Sharma

For text-based AI systems to interact in the real world, causal reasoning is an essential skill. Since interventional data is costly to generate, we study to what extent an agent can learn causal reasoning from passive data. Specifically, we consider an axiomatic training setup where an agent learns from multiple demonstrations of a causal axiom (or rule), rather than incorporating the axiom as an inductive bias or inferring it from data values. A key question is whether the agent would learn to generalize from the axiom demonstrations to new scenarios. For example, if a transformer model is trained on demonstrations of the causal transitivity axiom over small graphs, would it generalize to applying the transitivity axiom over large graphs? Our results, based on a novel axiomatic training scheme, indicate that such generalization is possible. We consider the task of inferring whether a variable causes another variable, given a causal graph structure. We find that a 67 million parameter transformer model, when trained on linear causal chains (along with some noisy variations) can generalize well to new kinds of graphs, including longer causal chains, causal chains with reversed order, and graphs with branching; even when it is not explicitly trained for such settings. Our model performs at par (or even better) than many larger language models such as GPT-4, Gemini Pro, and Phi-3. Overall, our axiomatic training framework provides a new paradigm of learning causal reasoning from passive data that can be used to learn arbitrary axioms, as long as sufficient demonstrations can be generated.

7/11/2024