Probabilities of the third type: Statistical Relational Learning and Reasoning with Relative Frequencies

Read original: arXiv:2202.10367 - Published 8/21/2024 by Felix Weitkamper

🛸

Overview

The paper introduces a new formalism called "Functional Lifted Bayesian Networks" (FLBNs) that explicitly incorporates continuous dependencies on relative frequencies into statistical relational artificial intelligence.
FLBNs are compared and contrasted with "Lifted Bayesian Networks for Conditional Probability Logic" (LBNCPLs), a recently introduced approach that expresses discrete dependencies on probabilistic data.
The paper also provides a representation of the asymptotic probability distributions induced by FLBNs on domains of increasing sizes, which can be used to estimate parameters for a large domain consistently from randomly sampled subpopulations.

Plain English Explanation

Probabilistic dependencies on relational data often depend on the relative frequency of a state in the domain. For example, the likelihood of a school closure during an epidemic might depend on the proportion of infected pupils exceeding a certain threshold. Rather than using discrete thresholds, these dependencies can be continuous, such as the likelihood of a mosquito bite transmitting an illness depending on the proportion of carrier mosquitoes.

Current approaches usually only consider probabilities over possible worlds, rather than over domain elements themselves. The recently introduced Lifted Bayesian Networks for Conditional Probability Logic addressed this by expressing discrete dependencies on probabilistic data.

The new "Functional Lifted Bayesian Networks" (FLBNs) introduced in this paper go a step further by explicitly incorporating continuous dependencies on relative frequencies. This approach not only improves modeling, but also provides a more rigorous way to handle learning problems where training and application domains have different sizes.

By representing the asymptotic probability distributions induced by FLBNs on domains of increasing sizes, the paper shows that parameters for a large domain can be consistently estimated from randomly sampled subpopulations. Furthermore, the authors prove that for parametric families of FLBNs, the convergence of these asymptotic probabilities to their limits is uniform in the parameters, ensuring a meaningful dependence of the probabilities on the model parameters.

Technical Explanation

The paper introduces "Functional Lifted Bayesian Networks" (FLBNs), a new formalism that explicitly incorporates continuous dependencies on relative frequencies into statistical relational artificial intelligence. This is in contrast to current approaches, which usually only consider probabilities over possible worlds rather than over domain elements themselves.

FLBNs are compared and contrasted with the recently introduced Lifted Bayesian Networks for Conditional Probability Logic (LBNCPLs), which express discrete dependencies on probabilistic data.

The paper provides a representation of the asymptotic probability distributions induced by FLBNs on domains of increasing sizes. This representation has well-understood scaling behavior across domain sizes, which can be leveraged to estimate parameters for a large domain consistently from randomly sampled subpopulations. The authors show that in parametric families of FLBNs, the convergence of these asymptotic probabilities to their limits is uniform in the parameters, ensuring a meaningful dependence of the probabilities on the model parameters.

Critical Analysis

The paper introduces a novel approach to incorporating continuous dependencies on relative frequencies into statistical relational AI, which is an important advancement over previous methods that only considered probabilities over possible worlds. The authors provide a rigorous mathematical treatment of the asymptotic behavior of FLBNs, which is a key strength of the work.

However, the paper does not extensively discuss potential limitations or areas for further research. For example, it would be valuable to understand the computational complexity of working with FLBNs, the scalability of the approach to large domains, and how the performance of FLBNs compares to other state-of-the-art techniques in real-world applications.

Additionally, the paper could benefit from a more critical examination of the assumptions and simplifications made in the FLBN formalism. While the authors demonstrate the theoretical properties of the approach, it would be helpful to understand the practical implications and potential issues that may arise when applying FLBNs to complex, real-world problems.

Further research into the robustness of FLBNs to noisy or incomplete data, as well as their ability to capture higher-order interactions and nonlinear dependencies, could also strengthen the practical relevance of the proposed framework.

Conclusion

The introduction of "Functional Lifted Bayesian Networks" (FLBNs) represents an important advancement in statistical relational AI by providing a formalism that explicitly incorporates continuous dependencies on relative frequencies. The paper's rigorous treatment of the asymptotic behavior of FLBNs and its implications for parameter estimation are particularly noteworthy.

While the theoretical properties of FLBNs are well-established, further research is needed to fully understand the practical implications and limitations of the approach. Exploring the computational complexity, scalability, and performance of FLBNs in real-world applications, as well as investigating their robustness to various data challenges, could help strengthen the impact of this work and its potential contributions to the field of statistical relational AI.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🛸

Probabilities of the third type: Statistical Relational Learning and Reasoning with Relative Frequencies

Felix Weitkamper

Dependencies on the relative frequency of a state in the domain are common when modelling probabilistic dependencies on relational data. For instance, the likelihood of a school closure during an epidemic might depend on the proportion of infected pupils exceeding a threshold. Often, rather than depending on discrete thresholds, dependencies are continuous: for instance, the likelihood of any one mosquito bite transmitting an illness depends on the proportion of carrier mosquitoes. Current approaches usually only consider probabilities over possible worlds rather than over domain elements themselves. An exception are the recently introduced lifted Bayesian networks for conditional probability logic, which express discrete dependencies on probabilistic data. We introduce functional lifted Bayesian networks, a formalism that explicitly incorporates continuous dependencies on relative frequencies into statistical relational artificial intelligence, and compare and contrast them with lifted Bayesian networks for conditional probability logic. Incorporating relative frequencies is not only beneficial to modelling; it also provides a more rigorous approach to learning problems where training and test or application domains have different sizes. To this end, we provide a representation of the asymptotic probability distributions induced by functional lifted Bayesian networks on domains of increasing sizes. Since that representation has well-understood scaling behaviour across domain sizes, it can be used to estimate parameters for a large domain consistently from randomly sampled subpopulations. Furthermore, we show that in parametric families of FLBN, convergence is uniform in the parameters, which ensures a meaningful dependence of the asymptotic probabilities on the parameters of the model.

8/21/2024

💬

What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Akshay Paruchuri, Jake Garrison, Shun Liao, John Hernandez, Jacob Sunshine, Tim Althoff, Xin Liu, Daniel McDuff

Language models (LM) are capable of remarkably complex linguistic tasks; however, numerical reasoning is an area in which they frequently struggle. An important but rarely evaluated form of reasoning is understanding probability distributions. In this paper, we focus on evaluating the probabilistic reasoning capabilities of LMs using idealized and real-world statistical distributions. We perform a systematic evaluation of state-of-the-art LMs on three tasks: estimating percentiles, drawing samples, and calculating probabilities. We evaluate three ways to provide context to LMs 1) anchoring examples from within a distribution or family of distributions, 2) real-world context, 3) summary statistics on which to base a Normal approximation. Models can make inferences about distributions, and can be further aided by the incorporation of real-world context, example shots and simplified assumptions, even if these assumptions are incorrect or misspecified. To conduct this work, we developed a comprehensive benchmark distribution dataset with associated question-answer pairs that we will release publicly.

6/19/2024

🏅

Generalization bounds for learning under graph-dependence: A survey

Rui-Ray Zhang, Massih-Reza Amini

Traditional statistical learning theory relies on the assumption that data are identically and independently distributed (i.i.d.). However, this assumption often does not hold in many real-life applications. In this survey, we explore learning scenarios where examples are dependent and their dependence relationship is described by a dependency graph, a commonly utilized model in probability and combinatorics. We collect various graph-dependent concentration bounds, which are then used to derive Rademacher complexity and stability generalization bounds for learning from graph-dependent data. We illustrate this paradigm through practical learning tasks and provide some research directions for future work. To our knowledge, this survey is the first of this kind on this subject.

4/1/2024

On the Independence Assumption in Neurosymbolic Learning

Emile van Krieken, Pasquale Minervini, Edoardo M. Ponti, Antonio Vergari

State-of-the-art neurosymbolic learning systems use probabilistic reasoning to guide neural networks towards predictions that conform to logical constraints over symbols. Many such systems assume that the probabilities of the considered symbols are conditionally independent given the input to simplify learning and reasoning. We study and criticise this assumption, highlighting how it can hinder optimisation and prevent uncertainty quantification. We prove that loss functions bias conditionally independent neural networks to become overconfident in their predictions. As a result, they are unable to represent uncertainty over multiple valid options. Furthermore, we prove that these loss functions are difficult to optimise: they are non-convex, and their minima are usually highly disconnected. Our theoretical analysis gives the foundation for replacing the conditional independence assumption and designing more expressive neurosymbolic probabilistic models.

6/10/2024