Manifold Learning by Mixture Models of VAEs for Inverse Problems

2303.15244

Published 6/13/2024 by Giovanni S. Alberti, Johannes Hertrich, Matteo Santacesaria, Silvia Sciutto

📉

Abstract

Representing a manifold of very high-dimensional data with generative models has been shown to be computationally efficient in practice. However, this requires that the data manifold admits a global parameterization. In order to represent manifolds of arbitrary topology, we propose to learn a mixture model of variational autoencoders. Here, every encoder-decoder pair represents one chart of a manifold. We propose a loss function for maximum likelihood estimation of the model weights and choose an architecture that provides us the analytical expression of the charts and of their inverses. Once the manifold is learned, we use it for solving inverse problems by minimizing a data fidelity term restricted to the learned manifold. To solve the arising minimization problem we propose a Riemannian gradient descent algorithm on the learned manifold. We demonstrate the performance of our method for low-dimensional toy examples as well as for deblurring and electrical impedance tomography on certain image manifolds.

Create account to get full access

Overview

The paper proposes a new approach to represent high-dimensional data manifolds using a mixture of variational autoencoders (VAEs).
The method learns a collection of "charts" that can collectively describe a manifold of arbitrary topology, allowing for efficient representation and optimization on the manifold.
The paper demonstrates the effectiveness of this approach on toy examples as well as real-world inverse problems like image deblurring and electrical impedance tomography.

Plain English Explanation

Representing high-dimensional data can be computationally challenging, but learning multi-modal generative models can make this more efficient. However, this typically requires that the data manifold has a global parameterization, which may not always be the case.

To handle manifolds with arbitrary topology, the researchers propose using a mixture of variational autoencoders (VAEs). Each encoder-decoder pair in the mixture represents a "chart" of the manifold, allowing the model to collectively describe complex shapes.

The method learns the model parameters through maximum likelihood estimation and provides analytical expressions for the charts and their inverses. This enables solving inverse problems by optimizing on the learned manifold, using a Riemannian gradient descent algorithm.

The researchers demonstrate the approach on toy examples as well as real-world tasks like image deblurring and electrical impedance tomography, showing improvements over previous methods.

Technical Explanation

The key idea is to represent a high-dimensional data manifold as a mixture of VAEs, where each encoder-decoder pair defines a "chart" of the manifold. This allows the model to capture manifolds with arbitrary topology, going beyond the typical assumption of a global parameterization.

The researchers propose a loss function for maximum likelihood estimation of the model weights, and design an architecture that provides analytical expressions for the charts and their inverses. This enables efficient optimization on the learned manifold, which is crucial for solving inverse problems.

Specifically, the researchers formulate the inverse problem as minimizing a data fidelity term restricted to the learned manifold. They solve this using a Riemannian gradient descent algorithm, taking into account the geometry of the learned manifold.

The method is evaluated on both low-dimensional toy examples and real-world applications like deblurring and electrical impedance tomography. The results demonstrate the effectiveness of the approach in generative modeling of manifolds and solving inverse problems.

Critical Analysis

The paper presents a novel and promising approach for representing high-dimensional data manifolds. The use of a mixture of VAEs allows the model to capture complex topologies, addressing a key limitation of previous methods.

However, the paper does not provide a thorough analysis of the method's limitations or potential issues. For example, it is unclear how the number of VAE components in the mixture is chosen, and how this affects the model's performance and scalability to very high-dimensional data.

Additionally, the paper focuses on specific inverse problems, but does not discuss the generalizability of the approach to other types of inverse problems or more diverse datasets. Further research is needed to understand the broader applicability and robustness of the method.

Nonetheless, the core idea of learning a manifold representation through a mixture of generative models is intriguing and could have significant implications for efficient optimization and generative modeling of complex data.

Conclusion

The proposed method for learning a manifold representation using a mixture of VAEs is a promising approach for efficiently modeling and optimizing on high-dimensional data manifolds. By capturing complex topologies, the technique enables solving inverse problems through optimization restricted to the learned manifold.

While the paper demonstrates the effectiveness of the approach on select examples, further research is needed to fully understand the method's limitations and broader applicability. Nonetheless, this work represents an interesting advancement in the field of generative modeling and optimization on manifolds, with potential applications in various domains.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🔍

Learning multi-modal generative models with permutation-invariant encoders and tighter variational bounds

Marcel Hirt, Domenico Campolo, Victoria Leong, Juan-Pablo Ortega

Devising deep latent variable models for multi-modal data has been a long-standing theme in machine learning research. Multi-modal Variational Autoencoders (VAEs) have been a popular generative model class that learns latent representations that jointly explain multiple modalities. Various objective functions for such models have been suggested, often motivated as lower bounds on the multi-modal data log-likelihood or from information-theoretic considerations. To encode latent variables from different modality subsets, Product-of-Experts (PoE) or Mixture-of-Experts (MoE) aggregation schemes have been routinely used and shown to yield different trade-offs, for instance, regarding their generative quality or consistency across multiple modalities. In this work, we consider a variational bound that can tightly approximate the data log-likelihood. We develop more flexible aggregation schemes that generalize PoE or MoE approaches by combining encoded features from different modalities based on permutation-invariant neural networks. Our numerical experiments illustrate trade-offs for multi-modal variational bounds and various aggregation schemes. We show that tighter variational bounds and more flexible aggregation models can become beneficial when one wants to approximate the true joint distribution over observed modalities and latent variables in identifiable models.

4/22/2024

stat.ML cs.LG

🖼️

Improving Diffusion Models for Inverse Problems using Manifold Constraints

Hyungjin Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye

Recently, diffusion models have been used to solve various inverse problems in an unsupervised manner with appropriate modifications to the sampling process. However, the current solvers, which recursively apply a reverse diffusion step followed by a projection-based measurement consistency step, often produce suboptimal results. By studying the generative sampling path, here we show that current solvers throw the sample path off the data manifold, and hence the error accumulates. To address this, we propose an additional correction term inspired by the manifold constraint, which can be used synergistically with the previous solvers to make the iterations close to the manifold. The proposed manifold constraint is straightforward to implement within a few lines of code, yet boosts the performance by a surprisingly large margin. With extensive experiments, we show that our method is superior to the previous methods both theoretically and empirically, producing promising results in many applications such as image inpainting, colorization, and sparse-view computed tomography. Code available https://github.com/HJ-harry/MCG_diffusion

5/21/2024

cs.LG cs.AI cs.CV stat.ML

📊

Inferring Manifolds From Noisy Data Using Gaussian Processes

David B Dunson, Nan Wu

In analyzing complex datasets, it is often of interest to infer lower dimensional structure underlying the higher dimensional observations. As a flexible class of nonlinear structures, it is common to focus on Riemannian manifolds. Most existing manifold learning algorithms replace the original data with lower dimensional coordinates without providing an estimate of the manifold in the observation space or using the manifold to denoise the original data. This article proposes a new methodology for addressing these problems, allowing interpolation of the estimated manifold between fitted data points. The proposed approach is motivated by novel theoretical properties of local covariance matrices constructed from noisy samples on a manifold. Our results enable us to turn a global manifold reconstruction problem into a local regression problem, allowing application of Gaussian processes for probabilistic manifold reconstruction. In addition to theory justifying the algorithm, we provide simulated and real data examples to illustrate the performance.

5/28/2024

stat.ML cs.LG

📉

Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes

Jaehyeong Jo, Sung Ju Hwang

Learning the distribution of data on Riemannian manifolds is crucial for modeling data from non-Euclidean space, which is required by many applications in diverse scientific fields. Yet, existing generative models on manifolds suffer from expensive divergence computation or rely on approximations of heat kernel. These limitations restrict their applicability to simple geometries and hinder scalability to high dimensions. In this work, we introduce the Riemannian Diffusion Mixture, a principled framework for building a generative diffusion process on manifolds. Instead of following the denoising approach of previous diffusion models, we construct a diffusion process using a mixture of bridge processes derived on general manifolds without requiring heat kernel estimations. We develop a geometric understanding of the mixture process, deriving the drift as a weighted mean of tangent directions to the data points that guides the process toward the data distribution. We further propose a scalable training objective for learning the mixture process that readily applies to general manifolds. Our method achieves superior performance on diverse manifolds with dramatically reduced number of in-training simulation steps for general manifolds.

6/4/2024

cs.LG stat.ML