ENOT: Expectile Regularization for Fast and Accurate Training of Neural Optimal Transport

Read original: arXiv:2403.03777 - Published 7/4/2024 by Nazar Buzun, Maksim Bobrin, Dmitry V. Dylov

🏋️

Overview

Presents a new training procedure for Neural Optimal Transport (NOT) that can accurately and efficiently estimate the optimal transportation plan
Addresses the main bottleneck of existing NOT solvers - the computationally intensive process of finding a near-exact approximation of the conjugate operator
Proposes a new loss function based on expectile regularization to enforce binding conditions on the learning of dual potentials, eliminating the need for extensive fine-tuning

Plain English Explanation

This paper introduces a new approach for Neural Optimal Transport (NOT) training that can more accurately and efficiently estimate the optimal transportation plan between two sets of data. The key challenge with existing NOT solvers is the difficulty in finding a close approximation of the conjugate operator, which is either done through a complex optimization process or requires extensive fine-tuning.

The researchers resolve this issue by proposing a new loss function based on expectile regularization. This regularization technique enforces certain constraints on the learning of the dual potentials, providing an upper bound estimate of the possible conjugate potentials. As a result, the learning process becomes more stable, and the need for additional fine-tuning is eliminated.

The proposed method, called Expectile-Regularised Neural Optimal Transport (ENOT), outperforms previous state-of-the-art approaches on standard Wasserstein-2 benchmark tasks, with significant improvements in both quality and runtime. The researchers also demonstrate ENOT's performance on image generation tasks with different cost functions, showcasing the robustness of the algorithm.

Technical Explanation

The main contribution of this paper is the introduction of a new training procedure for Neural Optimal Transport (NOT), which aims to accurately and efficiently estimate the optimal transportation plan between two probability distributions.

The key bottleneck in existing NOT solvers is the process of finding a near-exact approximation of the conjugate operator (i.e., the c-transform). This is typically done either by optimizing over non-convex max-min objectives or through computationally intensive fine-tuning of the initial approximated prediction.

To address these issues, the researchers propose a new loss function based on expectile regularization. This regularization technique enforces binding conditions on the learning of the dual potentials, providing an upper bound estimate over the distribution of possible conjugate potentials. As a result, the learning process becomes more stable, and the need for additional fine-tuning is eliminated.

The proposed method, called Expectile-Regularised Neural Optimal Transport (ENOT), outperforms previous state-of-the-art approaches on established Wasserstein-2 benchmark tasks by a large margin, with up to a 3-fold improvement in quality and up to a 10-fold improvement in runtime. The researchers also showcase the performance of ENOT on various tasks, such as image generation, demonstrating the robustness of the proposed algorithm to different cost functions.

Critical Analysis

The paper presents a compelling solution to the main bottleneck in existing NOT solvers, which is the challenging process of approximating the conjugate operator. The proposed expectile regularization technique appears to be a theoretically sound and practically effective approach, as evidenced by the significant performance improvements on benchmark tasks.

However, the paper does not address potential limitations or areas for further research. For example, it would be interesting to see how the ENOT algorithm performs on more complex or high-dimensional tasks, such as quantum applications of optimal transport or domain transfer problems. Additionally, the authors could explore the sensitivity of the method to hyperparameter choices or the impact of different cost functions on the overall performance.

It would also be valuable for the authors to provide a more detailed discussion of the theoretical underpinnings of the expectile regularization and how it relates to or differs from other regularization techniques used in the field of optimal transport.

Conclusion

This paper presents a novel training procedure for Neural Optimal Transport (NOT) called Expectile-Regularised Neural Optimal Transport (ENOT), which addresses the main computational bottleneck of existing NOT solvers. By introducing a new loss function based on expectile regularization, the researchers are able to significantly improve the accuracy and efficiency of the optimal transportation plan estimation process, outperforming previous state-of-the-art approaches.

The performance gains demonstrated by ENOT on established benchmarks and its robustness to different cost functions suggest that this approach could have a substantial impact on a wide range of applications that rely on optimal transport, from image generation to quantum physics. While the paper does not explore potential limitations or areas for further research, the core contributions represent an important advancement in the field of neural optimal transport.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🏋️

ENOT: Expectile Regularization for Fast and Accurate Training of Neural Optimal Transport

Nazar Buzun, Maksim Bobrin, Dmitry V. Dylov

We present a new approach for Neural Optimal Transport (NOT) training procedure, capable of accurately and efficiently estimating optimal transportation plan via specific regularization on dual Kantorovich potentials. The main bottleneck of existing NOT solvers is associated with the procedure of finding a near-exact approximation of the conjugate operator (i.e., the c-transform), which is done either by optimizing over non-convex max-min objectives or by the computationally intensive fine-tuning of the initial approximated prediction. We resolve both issues by proposing a new, theoretically justified loss in the form of expectile regularisation which enforces binding conditions on the learning process of dual potentials. Such a regularization provides the upper bound estimation over the distribution of possible conjugate potentials and makes the learning stable, completely eliminating the need for additional extensive fine-tuning. Proposed method, called Expectile-Regularised Neural Optimal Transport (ENOT), outperforms previous state-of-the-art approaches on the established Wasserstein-2 benchmark tasks by a large margin (up to a 3-fold improvement in quality and up to a 10-fold improvement in runtime). Moreover, we showcase performance of ENOT for varying cost functions on different tasks such as image generation, showing robustness of proposed algorithm. OTT-JAX library includes our implementation of ENOT algorithm https://ott-jax.readthedocs.io/en/latest/tutorials/ENOT.html

7/4/2024

🔄

Sparse Domain Transfer via Elastic Net Regularization

Jingwei Zhang, Farzan Farnia

Transportation of samples across different domains is a central task in several machine learning problems. A sensible requirement for domain transfer tasks in computer vision and language domains is the sparsity of the transportation map, i.e., the transfer algorithm aims to modify the least number of input features while transporting samples across the source and target domains. In this work, we propose Elastic Net Optimal Transport (ENOT) to address the sparse distribution transfer problem. The ENOT framework utilizes the $L_1$-norm and $L_2$-norm regularization mechanisms to find a sparse and stable transportation map between the source and target domains. To compute the ENOT transport map, we consider the dual formulation of the ENOT optimization task and prove that the sparsified gradient of the optimal potential function in the ENOT's dual representation provides the ENOT transport map. Furthermore, we demonstrate the application of the ENOT framework to perform feature selection for sparse domain transfer. We present the numerical results of applying ENOT to several domain transfer problems for synthetic Gaussian mixtures and real image and text data. Our empirical results indicate the success of the ENOT framework in identifying a sparse domain transport map.

5/14/2024

Progressive Entropic Optimal Transport Solvers

Parnian Kassraie, Aram-Alexandre Pooladian, Michal Klein, James Thornton, Jonathan Niles-Weed, Marco Cuturi

Optimal transport (OT) has profoundly impacted machine learning by providing theoretical and computational tools to realign datasets. In this context, given two large point clouds of sizes $n$ and $m$ in $mathbb{R}^d$, entropic OT (EOT) solvers have emerged as the most reliable tool to either solve the Kantorovich problem and output a $ntimes m$ coupling matrix, or to solve the Monge problem and learn a vector-valued push-forward map. While the robustness of EOT couplings/maps makes them a go-to choice in practical applications, EOT solvers remain difficult to tune because of a small but influential set of hyperparameters, notably the omnipresent entropic regularization strength $varepsilon$. Setting $varepsilon$ can be difficult, as it simultaneously impacts various performance metrics, such as compute speed, statistical performance, generalization, and bias. In this work, we propose a new class of EOT solvers (ProgOT), that can estimate both plans and transport maps. We take advantage of several opportunities to optimize the computation of EOT solutions by dividing mass displacement using a time discretization, borrowing inspiration from dynamic OT formulations, and conquering each of these steps using EOT with properly scheduled parameters. We provide experimental evidence demonstrating that ProgOT is a faster and more robust alternative to standard solvers when computing couplings at large scales, even outperforming neural network-based approaches. We also prove statistical consistency of our approach for estimating optimal transport maps.

9/18/2024

🧠

Efficient Neural Network Approaches for Conditional Optimal Transport with Applications in Bayesian Inference

Zheyu Oliver Wang, Ricardo Baptista, Youssef Marzouk, Lars Ruthotto, Deepanshu Verma

We present two neural network approaches that approximate the solutions of static and dynamic conditional optimal transport (COT) problems. Both approaches enable conditional sampling and conditional density estimation, which are core tasks in Bayesian inference$unicode{x2013}$particularly in the simulation-based (likelihood-free) setting. Our methods represent the target conditional distributions as transformations of a tractable reference distribution and, therefore, fall into the framework of measure transport. Although many measure transport approaches model the transformation as COT maps, obtaining the map is computationally challenging, even in moderate dimensions. To improve scalability, our numerical algorithms use neural networks to parameterize COT maps and further exploit the structure of the COT problem. Our static approach approximates the map as the gradient of a partially input-convex neural network. It uses a novel numerical implementation to increase computational efficiency compared to state-of-the-art alternatives. Our dynamic approach approximates the conditional optimal transport via the flow map of a regularized neural ODE; compared to the static approach, it is slower to train but offers more modeling choices and can lead to faster sampling. We demonstrate both algorithms numerically, comparing them with competing state-of-the-art approaches, using benchmark datasets and simulation-based Bayesian inverse problems.

7/22/2024