Hierarchic Flows to Estimate and Sample High-dimensional Probabilities

2405.03468

Published 5/7/2024 by Etienne Lempereur, St'ephane Mallat

Hierarchic Flows to Estimate and Sample High-dimensional Probabilities

Abstract

Finding low-dimensional interpretable models of complex physical fields such as turbulence remains an open question, 80 years after the pioneer work of Kolmogorov. Estimating high-dimensional probability distributions from data samples suffers from an optimization and an approximation curse of dimensionality. It may be avoided by following a hierarchic probability flow from coarse to fine scales. This inverse renormalization group is defined by conditional probabilities across scales, renormalized in a wavelet basis. For a $varphi^4$ scalar potential, sampling these hierarchic models avoids the critical slowing down at the phase transition. An outstanding issue is to also approximate non-Gaussian fields having long-range interactions in space and across scales. We introduce low-dimensional models with robust multiscale approximations of high order polynomial energies. They are calculated with a second wavelet transform, which defines interactions over two hierarchies of scales. We estimate and sample these wavelet scattering models to generate 2D vorticity fields of turbulence, and images of dark matter densities.

Create account to get full access

Overview

• This paper introduces a novel approach called "Hierarchic Flows" for estimating and sampling high-dimensional probability distributions.

• The method leverages the hierarchical structure of complex systems to build a flexible and efficient model for representing and manipulating high-dimensional probabilities.

• The key idea is to decompose the high-dimensional probability distribution into a hierarchy of lower-dimensional conditional distributions, which can be learned and sampled more effectively.

• This allows the approach to overcome the challenges of traditional methods for high-dimensional probability estimation and sampling, which often struggle with the curse of dimensionality.

Plain English Explanation

Estimating and sampling high-dimensional probability distributions is a fundamental challenge in many areas of science and engineering. Imagine you have a complex system with hundreds or thousands of interacting variables - things like the weather, the stock market, or the human brain. Understanding the joint probability distribution of all these variables is crucial for tasks like forecasting, decision-making, and generating realistic synthetic data.

However, traditional probability models struggle to capture the full complexity of high-dimensional systems. As the number of variables grows, the number of parameters required to represent the joint probability distribution explodes, making it extremely difficult to estimate and sample from the distribution effectively.

The key insight of this paper is that many high-dimensional systems have an inherent hierarchical structure. For example, in the weather system, there are large-scale patterns like high and low pressure systems, which then influence smaller-scale phenomena like cloud formation and precipitation. By decomposing the high-dimensional probability distribution into a hierarchy of lower-dimensional conditional distributions, the Hierarchic Flows approach can model and sample from these complex probability distributions much more efficiently.

The hierarchical structure allows the model to capture the essential features of the high-dimensional system without getting bogged down in the curse of dimensionality. And by using powerful normalizing flow techniques, the authors can learn flexible, expressive models for the conditional probability distributions at each level of the hierarchy.

Technical Explanation

The key technical innovation of the Hierarchic Flows approach is the decomposition of the high-dimensional probability distribution into a hierarchy of lower-dimensional conditional distributions. Specifically, the authors propose modeling the joint probability distribution p(x) as a product of conditional distributions:

p(x) = p(x_1) * p(x_2 | x_1) * p(x_3 | x_1, x_2) * ... * p(x_n | x_1, x_2, ..., x_{n-1})

where x = (x_1, x_2, ..., x_n) is the high-dimensional random variable.

By learning flexible models for each of these conditional distributions using normalizing flows, the authors can efficiently represent and sample from the overall high-dimensional probability distribution. The hierarchical structure allows the model to capture the essential features of the system without being overwhelmed by the curse of dimensionality.

The authors demonstrate the effectiveness of their approach on a variety of high-dimensional probability estimation and sampling tasks, including network reconstruction and multivariate self-similarity analysis. The results show significant improvements over traditional methods, particularly in terms of sample quality and computational efficiency.

Critical Analysis

The Hierarchic Flows approach is a promising and innovative solution to the challenging problem of high-dimensional probability estimation and sampling. By leveraging the inherent hierarchical structure of complex systems, the method is able to overcome the limitations of traditional models and achieve impressive performance on a range of tasks.

That said, the paper does not explore the full scope of potential limitations and caveats. For example, the performance of the method likely depends on the extent to which the true high-dimensional distribution can be well-approximated by the proposed hierarchical decomposition. In cases where the underlying structure is more complex or irregular, the method may struggle to capture the essential features.

Additionally, the paper does not provide a rigorous theoretical analysis of the conditions under which the Hierarchic Flows approach is guaranteed to outperform alternative methods. While the empirical results are compelling, a deeper theoretical understanding of the method's strengths and weaknesses would be valuable for guiding its application in practice.

Overall, the Hierarchic Flows approach represents a significant advance in the field of high-dimensional probability modeling, and the ideas introduced in this paper are likely to spur further research and innovation. By continuing to push the boundaries of what is possible with high-dimensional data, researchers can unlock new possibilities in a wide range of scientific and engineering domains.

Conclusion

The Hierarchic Flows approach introduced in this paper offers a powerful and flexible solution for estimating and sampling high-dimensional probability distributions. By decomposing the complex joint distribution into a hierarchy of more manageable conditional distributions, the method is able to overcome the challenges posed by the curse of dimensionality and deliver state-of-the-art performance on a range of tasks.

The key insights of the paper – leveraging hierarchical structure, using normalizing flows for expressive modeling, and employing efficient sampling techniques – have the potential to significantly impact a wide range of fields, from weather forecasting and financial modeling to neuroscience and materials science. As researchers continue to explore the boundaries of high-dimensional data analysis, the Hierarchic Flows approach is poised to play an increasingly important role in unlocking new discoveries and driving innovation.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data

Bastian Boll, Daniel Gonzalez-Alvarado, Stefania Petra, Christoph Schnorr

We introduce a novel generative model for the representation of joint probability distributions of a possibly large number of discrete random variables. The approach uses measure transport by randomized assignment flows on the statistical submanifold of factorizing distributions, which also enables to sample efficiently from the target distribution and to assess the likelihood of unseen data points. The embedding of the flow via the Segre map in the meta-simplex of all discrete joint distributions ensures that any target distribution can be represented in principle, whose complexity in practice only depends on the parametrization of the affinity function of the dynamical assignment flow system. Our model can be trained in a simulation-free manner without integration by conditional Riemannian flow matching, using the training data encoded as geodesics in closed-form with respect to the e-connection of information geometry. By projecting high-dimensional flow matching in the meta-simplex of joint distributions to the submanifold of factorizing distributions, our approach has strong motivation from first principles of modeling coupled discrete variables. Numerical experiments devoted to distributions of structured image labelings demonstrate the applicability to large-scale problems, which may include discrete distributions in other application areas. Performance measures show that our approach scales better with the increasing number of classes than recent related work.

6/10/2024

stat.ML cs.LG

🤿

Deep learning probability flows and entropy production rates in active matter

Nicholas M. Boffi, Eric Vanden-Eijnden

Active matter systems, from self-propelled colloids to motile bacteria, are characterized by the conversion of free energy into useful work at the microscopic scale. They involve physics beyond the reach of equilibrium statistical mechanics, and a persistent challenge has been to understand the nature of their nonequilibrium states. The entropy production rate and the probability current provide quantitative ways to do so by measuring the breakdown of time-reversal symmetry. Yet, their efficient computation has remained elusive, as they depend on the system's unknown and high-dimensional probability density. Here, building upon recent advances in generative modeling, we develop a deep learning framework to estimate the score of this density. We show that the score, together with the microscopic equations of motion, gives access to the entropy production rate, the probability current, and their decomposition into local contributions from individual particles. To represent the score, we introduce a novel, spatially-local transformer network architecture that learns high-order interactions between particles while respecting their underlying permutation symmetry. We demonstrate the broad utility and scalability of the method by applying it to several high-dimensional systems of active particles undergoing motility-induced phase separation (MIPS). We show that a single network trained on a system of 4096 particles at one packing fraction can generalize to other regions of the phase diagram, including systems with as many as 32768 particles. We use this observation to quantify the spatial structure of the departure from equilibrium in MIPS as a function of the number of particles and the packing fraction.

6/18/2024

cs.LG cs.NA

🎯

Generalised Diffusion Probabilistic Scale-Spaces

Pascal Peter

Diffusion probabilistic models excel at sampling new images from learned distributions. Originally motivated by drift-diffusion concepts from physics, they apply image perturbations such as noise and blur in a forward process that results in a tractable probability distribution. A corresponding learned reverse process generates images and can be conditioned on side information, which leads to a wide variety of practical applications. Most of the research focus currently lies on practice-oriented extensions. In contrast, the theoretical background remains largely unexplored, in particular the relations to drift-diffusion. In order to shed light on these connections to classical image filtering, we propose a generalised scale-space theory for diffusion probabilistic models. Moreover, we show conceptual and empirical connections to diffusion and osmosis filters.

6/7/2024

eess.IV cs.CV cs.LG

🏷️

Conditional Normalizing Flows for Active Learning of Coarse-Grained Molecular Representations

Henrik Schopmans, Pascal Friederich

Efficient sampling of the Boltzmann distribution of molecular systems is a long-standing challenge. Recently, instead of generating long molecular dynamics simulations, generative machine learning methods such as normalizing flows have been used to learn the Boltzmann distribution directly, without samples. However, this approach is susceptible to mode collapse and thus often does not explore the full configurational space. In this work, we address this challenge by separating the problem into two levels, the fine-grained and coarse-grained degrees of freedom. A normalizing flow conditioned on the coarse-grained space yields a probabilistic connection between the two levels. To explore the configurational space, we employ coarse-grained simulations with active learning which allows us to update the flow and make all-atom potential energy evaluations only when necessary. Using alanine dipeptide as an example, we show that our methods obtain a speedup to molecular dynamics simulations of approximately 15.9 to 216.2 compared to the speedup of 4.5 of the current state-of-the-art machine learning approach.

5/27/2024

cs.LG cs.AI