Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders

2403.08941

Published 6/14/2024 by Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez

Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders

Abstract

Inference for Variational Autoencoders (VAEs) consists of learning two models: (1) a generative model, which transforms a simple distribution over a latent space into the distribution over observed data, and (2) an inference model, which approximates the posterior of the latent codes given data. The two components are learned jointly via a lower bound to the generative model's log marginal likelihood. In early phases of joint training, the inference model poorly approximates the latent code posteriors. Recent work showed that this leads optimization to get stuck in local optima, negatively impacting the learned generative model. As such, recent work suggests ensuring a high-quality inference model via iterative training: maximizing the objective function relative to the inference model before every update to the generative model. Unfortunately, iterative training is inefficient, requiring heuristic criteria for reverting from iterative to joint training for speed. Here, we suggest an inference method that trains the generative and inference models independently. It approximates the posterior of the true model a priori; fixing this posterior approximation, we then maximize the lower bound relative to only the generative model. By conventional wisdom, this approach should rely on the true prior and likelihood of the true model to approximate its posterior (which are unknown). However, we show that we can compute a deterministic, model-agnostic posterior approximation (MAPA) of the true model's posterior. We then use MAPA to develop a proof-of-concept inference method. We present preliminary results on low-dimensional synthetic data that (1) MAPA captures the trend of the true posterior, and (2) our MAPA-based inference performs better density estimation with less computation than baselines. Lastly, we present a roadmap for scaling the MAPA-based inference method to high-dimensional data.

Create account to get full access

Overview

This paper introduces a new method for approximating the posterior distribution in Variational Autoencoders (VAEs) that is model-agnostic and can be applied to a wide range of VAE architectures.
The proposed method, called Epanechnikov Variational Autoencoder (EVAE), aims to achieve fast and accurate posterior approximation by leveraging a flexible kernel-based density estimator.
The authors demonstrate the effectiveness of EVAE on various VAE models, including the Poisson Variational Autoencoder (PoVAE) and the Learning to Solve Bayesian Inverse Problems framework, showing improved performance over existing approaches.

Plain English Explanation

Variational Autoencoders (VAEs) are a type of machine learning model that can generate new data by learning the underlying patterns in a dataset. However, a key challenge in training VAEs is accurately approximating the posterior distribution, which represents the probability of the latent variables given the observed data.

The paper introduces a new method called the Epanechnikov Variational Autoencoder (EVAE) that aims to address this challenge. EVAE uses a flexible kernel-based density estimator to approximate the posterior distribution, which allows it to work with a wide range of VAE architectures without requiring significant modifications.

The authors demonstrate that EVAE can achieve faster and more accurate posterior approximation compared to existing methods, particularly on VAE models like the Poisson Variational Autoencoder (PoVAE) and the Learning to Solve Bayesian Inverse Problems framework. This improved performance can lead to better-quality generated data and more reliable inferences from VAE models.

Technical Explanation

The paper introduces a new approach for approximating the posterior distribution in Variational Autoencoders (VAEs) called the Epanechnikov Variational Autoencoder (EVAE). The key idea behind EVAE is to use a flexible kernel-based density estimator to model the posterior, which can be applied to a wide range of VAE architectures without requiring significant modifications.

Specifically, the authors propose using the Epanechnikov kernel function as the basis for the density estimator. This kernel function has several desirable properties, such as being differentiable and having a compact support, which make it well-suited for use in VAE training. The EVAE framework then optimizes the parameters of the kernel-based posterior approximation jointly with the VAE encoder and decoder networks.

The authors evaluate the performance of EVAE on various VAE models, including the Poisson Variational Autoencoder (PoVAE) and the Learning to Solve Bayesian Inverse Problems framework. The results show that EVAE can achieve faster convergence and higher accuracy in posterior approximation compared to existing methods, leading to improved performance on downstream tasks such as data generation and inference.

Critical Analysis

The paper presents a promising approach for addressing the challenge of posterior approximation in VAEs, which is a crucial step in training these models effectively. The authors' decision to use a flexible kernel-based density estimator, specifically the Epanechnikov kernel, appears well-justified based on the empirical results and the desirable properties of the kernel function.

One potential limitation of the EVAE method is that it may require more computational resources than some other posterior approximation techniques, as the kernel-based density estimator can be more computationally expensive to evaluate. However, the authors argue that the improved accuracy and convergence speed of EVAE can offset these additional computational costs in many practical applications.

Another area for further research could be exploring the performance of EVAE on a wider range of VAE architectures and datasets, including more complex or high-dimensional problems. While the authors demonstrate the effectiveness of EVAE on several VAE models, there may be additional insights to be gained by applying the method to a broader set of scenarios.

Overall, the Epanechnikov Variational Autoencoder (EVAE) presented in this paper represents a valuable contribution to the field of VAE research, offering a model-agnostic approach to posterior approximation that can lead to faster and more accurate training of these powerful generative models.

Conclusion

This paper introduces a novel method for approximating the posterior distribution in Variational Autoencoders (VAEs) called the Epanechnikov Variational Autoencoder (EVAE). EVAE leverages a flexible kernel-based density estimator to model the posterior, which allows it to be applied to a wide range of VAE architectures without significant modifications.

The authors demonstrate that EVAE can achieve faster convergence and higher accuracy in posterior approximation compared to existing methods, particularly on VAE models like the Poisson Variational Autoencoder (PoVAE) and the Learning to Solve Bayesian Inverse Problems framework. This improved performance can lead to better-quality generated data and more reliable inferences from VAE models, making EVAE a valuable contribution to the field of generative modeling.

The paper also highlights the potential for further research, such as exploring the performance of EVAE on a wider range of VAE architectures and datasets, and investigating ways to address any potential computational overhead associated with the kernel-based density estimator. Overall, the Epanechnikov Variational Autoencoder represents a promising step forward in addressing the challenge of posterior approximation in Variational Autoencoders.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🔍

Learning multi-modal generative models with permutation-invariant encoders and tighter variational bounds

Marcel Hirt, Domenico Campolo, Victoria Leong, Juan-Pablo Ortega

Devising deep latent variable models for multi-modal data has been a long-standing theme in machine learning research. Multi-modal Variational Autoencoders (VAEs) have been a popular generative model class that learns latent representations that jointly explain multiple modalities. Various objective functions for such models have been suggested, often motivated as lower bounds on the multi-modal data log-likelihood or from information-theoretic considerations. To encode latent variables from different modality subsets, Product-of-Experts (PoE) or Mixture-of-Experts (MoE) aggregation schemes have been routinely used and shown to yield different trade-offs, for instance, regarding their generative quality or consistency across multiple modalities. In this work, we consider a variational bound that can tightly approximate the data log-likelihood. We develop more flexible aggregation schemes that generalize PoE or MoE approaches by combining encoded features from different modalities based on permutation-invariant neural networks. Our numerical experiments illustrate trade-offs for multi-modal variational bounds and various aggregation schemes. We show that tighter variational bounds and more flexible aggregation models can become beneficial when one wants to approximate the true joint distribution over observed modalities and latent variables in identifiable models.

4/22/2024

stat.ML cs.LG

🔎

Poisson Variational Autoencoder

Hadi Vafaii, Dekel Galor, Jacob L. Yates

Variational autoencoders (VAE) employ Bayesian inference to interpret sensory inputs, mirroring processes that occur in primate vision across both ventral (Higgins et al., 2021) and dorsal (Vafaii et al., 2023) pathways. Despite their success, traditional VAEs rely on continuous latent variables, which deviates sharply from the discrete nature of biological neurons. Here, we developed the Poisson VAE (P-VAE), a novel architecture that combines principles of predictive coding with a VAE that encodes inputs into discrete spike counts. Combining Poisson-distributed latent variables with predictive coding introduces a metabolic cost term in the model loss function, suggesting a relationship with sparse coding which we verify empirically. Additionally, we analyze the geometry of learned representations, contrasting the P-VAE to alternative VAE models. We find that the P-VAEencodes its inputs in relatively higher dimensions, facilitating linear separability of categories in a downstream classification task with a much better (5x) sample efficiency. Our work provides an interpretable computational framework to study brain-like sensory processing and paves the way for a deeper understanding of perception as an inferential process.

5/24/2024

cs.LG cs.AI

New!Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

Vaidotas Simkus, Michael U. Gutmann

We consider the task of estimating variational autoencoders (VAEs) when the training data is incomplete. We show that missing data increases the complexity of the model's posterior distribution over the latent variables compared to the fully-observed case. The increased complexity may adversely affect the fit of the model due to a mismatch between the variational and model posterior distributions. We introduce two strategies based on (i) finite variational-mixture and (ii) imputation-based variational-mixture distributions to address the increased posterior complexity. Through a comprehensive evaluation of the proposed approaches, we show that variational mixtures are effective at improving the accuracy of VAE estimation from incomplete data.

6/28/2024

cs.LG stat.ML

🤯

Learning to solve Bayesian inverse problems: An amortized variational inference approach using Gaussian and Flow guides

Sharmila Karumuri, Ilias Bilionis

Inverse problems, i.e., estimating parameters of physical models from experimental data, are ubiquitous in science and engineering. The Bayesian formulation is the gold standard because it alleviates ill-posedness issues and quantifies epistemic uncertainty. Since analytical posteriors are not typically available, one resorts to Markov chain Monte Carlo sampling or approximate variational inference. However, inference needs to be rerun from scratch for each new set of data. This drawback limits the applicability of the Bayesian formulation to real-time settings, e.g., health monitoring of engineered systems, and medical diagnosis. The objective of this paper is to develop a methodology that enables real-time inference by learning the Bayesian inverse map, i.e., the map from data to posteriors. Our approach is as follows. We parameterize the posterior distribution as a function of data. This work outlines two distinct approaches to do this. The first method involves parameterizing the posterior using an amortized full-rank Gaussian guide, implemented through neural networks. The second method utilizes a Conditional Normalizing Flow guide, employing conditional invertible neural networks for cases where the target posterior is arbitrarily complex. In both approaches, we learn the network parameters by amortized variational inference which involves maximizing the expectation of evidence lower bound over all possible datasets compatible with the model. We demonstrate our approach by solving a set of benchmark problems from science and engineering. Our results show that the posterior estimates of our approach are in agreement with the corresponding ground truth obtained by Markov chain Monte Carlo. Once trained, our approach provides the posterior distribution for a given observation just at the cost of a forward pass of the neural network.

5/28/2024

stat.ML cs.LG