The NFLikelihood: an unsupervised DNNLikelihood from Normalizing Flows

2309.09743

Published 5/17/2024 by Humberto Reyes-Gonzalez, Riccardo Torre

🤷

Abstract

We propose the NFLikelihood, an unsupervised version, based on Normalizing Flows, of the DNNLikelihood proposed in Ref.[1]. We show, through realistic examples, how Autoregressive Flows, based on affine and rational quadratic spline bijectors, are able to learn complicated high-dimensional Likelihoods arising in High Energy Physics (HEP) analyses. We focus on a toy LHC analysis example already considered in the literature and on two Effective Field Theory fits of flavor and electroweak observables, whose samples have been obtained throught the HEPFit code. We discuss advantages and disadvantages of the unsupervised approach with respect to the supervised one and discuss possible interplays of the two.

Create account to get full access

Overview

The paper proposes a new unsupervised version of the DNNLikelihood method, called NFLikelihood, which uses Normalizing Flows to learn complicated high-dimensional Likelihoods in High Energy Physics (HEP) analyses.
The authors show that Autoregressive Flows, based on affine and rational quadratic spline bijectors, can effectively learn these high-dimensional Likelihoods.
The paper focuses on a toy LHC analysis example and two Effective Field Theory fits of flavor and electroweak observables.
It discusses the advantages and disadvantages of the unsupervised approach compared to the supervised one, and the potential interplay between the two.

Plain English Explanation

The paper introduces a new unsupervised method called NFLikelihood that uses Normalizing Flows to learn complex, high-dimensional probability distributions in High Energy Physics (HEP) research. These probability distributions, known as Likelihoods, are important for analyzing data from particle physics experiments like the Large Hadron Collider (LHC).

The authors show that a specific type of Normalizing Flow, called Autoregressive Flows, can effectively learn these complicated Likelihoods. Autoregressive Flows use affine and rational quadratic spline transformations to model the data. The researchers test their method on a toy LHC analysis example and two real-world problems from Effective Field Theory, which is a framework for studying the fundamental particles and forces in nature.

Compared to the previously proposed DNNLikelihood method, which uses supervised machine learning, the new NFLikelihood approach is unsupervised. This means it can learn the Likelihoods directly from the data, without requiring labeled examples. The paper discusses the pros and cons of the unsupervised approach and how it might work together with the supervised method.

Technical Explanation

The paper introduces a new unsupervised method called NFLikelihood that uses Normalizing Flows to learn complex, high-dimensional Likelihoods in High Energy Physics (HEP) analyses. Likelihoods are probability distributions that describe how likely it is to observe certain data given a particular model or hypothesis.

The authors show that Autoregressive Flows, a specific type of Normalizing Flow, can effectively learn these complicated Likelihoods. Autoregressive Flows use a series of affine and rational quadratic spline transformations to map the data to a simple, easy-to-sample distribution, like a Gaussian. This allows the model to capture the complex structure of the Likelihoods.

The paper evaluates the NFLikelihood method on three different examples: a toy LHC analysis, and two Effective Field Theory fits of flavor and electroweak observables. These examples represent realistic, high-dimensional Likelihoods that arise in HEP research.

The authors compare the unsupervised NFLikelihood approach to the previously proposed DNNLikelihood method, which uses supervised machine learning. They discuss the advantages and disadvantages of the two approaches and suggest that they could potentially be combined to further improve the modeling of Likelihoods in HEP analyses.

Critical Analysis

The paper presents a promising unsupervised approach for modeling complex Likelihoods in High Energy Physics, which could have important implications for data analysis in fields like particle physics and cosmology. The use of Normalizing Flows, specifically Autoregressive Flows, appears to be an effective way to capture the high-dimensional structure of these probability distributions.

However, the paper does not provide a thorough comparison of the NFLikelihood method to other unsupervised techniques, such as Variational Autoencoders or Flow-Matching Latent Space Transformers. It would be helpful to understand how the proposed method performs relative to these other unsupervised approaches, especially in terms of accuracy, computational efficiency, and scalability to high-dimensional problems.

Additionally, the paper could benefit from a more in-depth discussion of the potential limitations and challenges of the NFLikelihood method. For example, it would be interesting to explore how the method performs when faced with multi-modal or highly non-linear Likelihoods, and whether there are any stability or convergence issues that could arise during the training process.

Overall, the paper presents an interesting and potentially valuable contribution to the field of Likelihood modeling in High Energy Physics. By demonstrating the effectiveness of Normalizing Flows for this task, the authors have opened up new avenues for further research and development in this area.

Conclusion

The paper introduces a new unsupervised method called NFLikelihood that uses Normalizing Flows to learn complex, high-dimensional Likelihoods in High Energy Physics (HEP) analyses. The authors show that Autoregressive Flows, based on affine and rational quadratic spline bijectors, can effectively capture the structure of these probability distributions, which are crucial for analyzing data from particle physics experiments.

The NFLikelihood approach offers an alternative to the previously proposed DNNLikelihood method, which uses supervised machine learning. The paper discusses the advantages and disadvantages of the unsupervised approach and suggests that it could potentially be combined with the supervised method to further improve Likelihood modeling in HEP research.

Overall, the paper presents an interesting and promising contribution to the field of Likelihood modeling, with potential implications for data analysis in particle physics, cosmology, and other areas of science that deal with complex, high-dimensional probability distributions.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🤖

Kernelised Normalising Flows

Eshant English, Matthias Kirchler, Christoph Lippert

Normalising Flows are non-parametric statistical models characterised by their dual capabilities of density estimation and generation. This duality requires an inherently invertible architecture. However, the requirement of invertibility imposes constraints on their expressiveness, necessitating a large number of parameters and innovative architectural designs to achieve good results. Whilst flow-based models predominantly rely on neural-network-based transformations for expressive designs, alternative transformation methods have received limited attention. In this work, we present Ferumal flow, a novel kernelised normalising flow paradigm that integrates kernels into the framework. Our results demonstrate that a kernelised flow can yield competitive or superior results compared to neural network-based flows whilst maintaining parameter efficiency. Kernelised flows excel especially in the low-data regime, enabling flexible non-parametric density estimation in applications with sparse data availability.

6/28/2024

stat.ML cs.LG

🔄

On the Universality of Coupling-based Normalizing Flows

Felix Draxler, Stefan Wahl, Christoph Schnorr, Ullrich Kothe

We present a novel theoretical framework for understanding the expressive power of normalizing flows. Despite their prevalence in scientific applications, a comprehensive understanding of flows remains elusive due to their restricted architectures. Existing theorems fall short as they require the use of arbitrarily ill-conditioned neural networks, limiting practical applicability. We propose a distributional universality theorem for well-conditioned coupling-based normalizing flows such as RealNVP. In addition, we show that volume-preserving normalizing flows are not universal, what distribution they learn instead, and how to fix their expressivity. Our results support the general wisdom that affine and related couplings are expressive and in general outperform volume-preserving flows, bridging a gap between empirical results and theoretical understanding.

6/6/2024

cs.LG stat.ML

🏷️

Conditional Normalizing Flows for Active Learning of Coarse-Grained Molecular Representations

Henrik Schopmans, Pascal Friederich

Efficient sampling of the Boltzmann distribution of molecular systems is a long-standing challenge. Recently, instead of generating long molecular dynamics simulations, generative machine learning methods such as normalizing flows have been used to learn the Boltzmann distribution directly, without samples. However, this approach is susceptible to mode collapse and thus often does not explore the full configurational space. In this work, we address this challenge by separating the problem into two levels, the fine-grained and coarse-grained degrees of freedom. A normalizing flow conditioned on the coarse-grained space yields a probabilistic connection between the two levels. To explore the configurational space, we employ coarse-grained simulations with active learning which allows us to update the flow and make all-atom potential energy evaluations only when necessary. Using alanine dipeptide as an example, we show that our methods obtain a speedup to molecular dynamics simulations of approximately 15.9 to 216.2 compared to the speedup of 4.5 of the current state-of-the-art machine learning approach.

5/27/2024

cs.LG cs.AI

Hierarchic Flows to Estimate and Sample High-dimensional Probabilities

Etienne Lempereur, St'ephane Mallat

Finding low-dimensional interpretable models of complex physical fields such as turbulence remains an open question, 80 years after the pioneer work of Kolmogorov. Estimating high-dimensional probability distributions from data samples suffers from an optimization and an approximation curse of dimensionality. It may be avoided by following a hierarchic probability flow from coarse to fine scales. This inverse renormalization group is defined by conditional probabilities across scales, renormalized in a wavelet basis. For a $varphi^4$ scalar potential, sampling these hierarchic models avoids the critical slowing down at the phase transition. An outstanding issue is to also approximate non-Gaussian fields having long-range interactions in space and across scales. We introduce low-dimensional models with robust multiscale approximations of high order polynomial energies. They are calculated with a second wavelet transform, which defines interactions over two hierarchies of scales. We estimate and sample these wavelet scattering models to generate 2D vorticity fields of turbulence, and images of dark matter densities.

5/7/2024

stat.ML cs.LG