From Biased to Unbiased Dynamics: An Infinitesimal Generator Approach

2406.09028

Published 6/14/2024 by Timoth'ee Devergne, Vladimir Kostic, Michele Parrinello, Massimiliano Pontil

From Biased to Unbiased Dynamics: An Infinitesimal Generator Approach

Abstract

We investigate learning the eigenfunctions of evolution operators for time-reversal invariant stochastic processes, a prime example being the Langevin equation used in molecular dynamics. Many physical or chemical processes described by this equation involve transitions between metastable states separated by high potential barriers that can hardly be crossed during a simulation. To overcome this bottleneck, data are collected via biased simulations that explore the state space more rapidly. We propose a framework for learning from biased simulations rooted in the infinitesimal generator of the process and the associated resolvent operator. We contrast our approach to more common ones based on the transfer operator, showing that it can provably learn the spectral properties of the unbiased system from biased data. In experiments, we highlight the advantages of our method over transfer operator approaches and recent developments based on generator learning, demonstrating its effectiveness in estimating eigenfunctions and eigenvalues. Importantly, we show that even with datasets containing only a few relevant transitions due to sub-optimal biasing, our approach recovers relevant information about the transition mechanism.

Create account to get full access

Introduction

This paper presents a novel approach for learning the dynamics of a stochastic diffusion process from data, with the goal of removing bias and recovering the true underlying dynamics. The authors propose using the infinitesimal generator as a key tool to achieve this.

Learning dynamical systems from data

The researchers explain that many real-world systems can be modeled as stochastic diffusion processes, but the observed data may be biased due to factors like measurement error or external influences. This can lead to inaccurate estimates of the true dynamics. The paper introduces a framework for learning the infinitesimal generator from biased data and then using it to recover the unbiased dynamics.

Removing bias from dynamical systems

The key insight is that the infinitesimal generator, which characterizes the local dynamics of the diffusion process, can be used to identify and remove the bias. The authors show how to estimate the infinitesimal generator from biased data and then use it to reconstruct the true unbiased dynamics. This allows them to accurately model the underlying system without the confounding effects of measurement bias.

Implications and applications

The proposed method has important implications for a wide range of fields that rely on modeling dynamical systems, such as nonequilibrium physics, machine learning, and control theory. By recovering the true dynamics, researchers can gain deeper insights into the underlying processes and make more accurate predictions.

Conclusion

This paper presents a powerful framework for learning dynamical systems from biased data by leveraging the infinitesimal generator. The ability to remove measurement bias and recover the true underlying dynamics has far-reaching implications across many scientific and engineering disciplines.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🔄

Learning the Infinitesimal Generator of Stochastic Diffusion Processes

Vladimir R. Kostic, Karim Lounici, Helene Halconruy, Timothee Devergne, Massimiliano Pontil

We address data-driven learning of the infinitesimal generator of stochastic diffusion processes, essential for understanding numerical simulations of natural and physical systems. The unbounded nature of the generator poses significant challenges, rendering conventional analysis techniques for Hilbert-Schmidt operators ineffective. To overcome this, we introduce a novel framework based on the energy functional for these stochastic processes. Our approach integrates physical priors through an energy-based risk metric in both full and partial knowledge settings. We evaluate the statistical performance of a reduced-rank estimator in reproducing kernel Hilbert spaces (RKHS) in the partial knowledge setting. Notably, our approach provides learning bounds independent of the state space dimension and ensures non-spurious spectral estimation. Additionally, we elucidate how the distortion between the intrinsic energy-induced metric of the stochastic diffusion and the RKHS metric used for generator estimation impacts the spectral learning bounds.

5/22/2024

stat.ML cs.LG

Nonequilbrium physics of generative diffusion models

Zhendong Yu, Haiping Huang

Generative diffusion models apply the concept of Langevin dynamics in physics to machine leaning, attracting a lot of interest from industrial application, but a complete picture about inherent mechanisms is still lacking. In this paper, we provide a transparent physics analysis of the diffusion models, deriving the fluctuation theorem, entropy production, Franz-Parisi potential to understand the intrinsic phase transitions discovered recently. Our analysis is rooted in non-equlibrium physics and concepts from equilibrium physics, i.e., treating both forward and backward dynamics as a Langevin dynamics, and treating the reverse diffusion generative process as a statistical inference, where the time-dependent state variables serve as quenched disorder studied in spin glass theory. This unified principle is expected to guide machine learning practitioners to design better algorithms and theoretical physicists to link the machine learning to non-equilibrium thermodynamics.

5/21/2024

cs.LG

🤔

Towards Understanding Inductive Bias in Transformers: A View From Infinity

Itay Lavie, Guy Gur-Ari, Zohar Ringel

We study inductive bias in Transformers in the infinitely over-parameterized Gaussian process limit and argue transformers tend to be biased towards more permutation symmetric functions in sequence space. We show that the representation theory of the symmetric group can be used to give quantitative analytical predictions when the dataset is symmetric to permutations between tokens. We present a simplified transformer block and solve the model at the limit, including accurate predictions for the learning curves and network outputs. We show that in common setups, one can derive tight bounds in the form of a scaling law for the learnability as a function of the context length. Finally, we argue WikiText dataset, does indeed possess a degree of permutation symmetry.

5/29/2024

cs.LG stat.ML

Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training

Anchit Jain, Rozhin Nobahari, Aristide Baratin, Stefano Sarao Mannelli

Machine learning systems often acquire biases by leveraging undesired features in the data, impacting accuracy variably across different sub-populations. Current understanding of bias formation mostly focuses on the initial and final stages of learning, leaving a gap in knowledge regarding the transient dynamics. To address this gap, this paper explores the evolution of bias in a teacher-student setup modeling different data sub-populations with a Gaussian-mixture model. We provide an analytical description of the stochastic gradient descent dynamics of a linear classifier in this setting, which we prove to be exact in high dimension. Notably, our analysis reveals how different properties of sub-populations influence bias at different timescales, showing a shifting preference of the classifier during training. Applying our findings to fairness and robustness, we delineate how and when heterogeneous data and spurious features can generate and amplify bias. We empirically validate our results in more complex scenarios by training deeper networks on synthetic and real datasets, including CIFAR10, MNIST, and CelebA.

5/29/2024

cs.LG stat.ML