Distributional Preference Alignment of LLMs via Optimal Transport

Read original: arXiv:2406.05882 - Published 6/11/2024 by Igor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia Rigotti, Apoorva Nitsure, Mikhail Yurochkin, Kristjan Greenewald, Jiri Navratil, Jerret Ross

Distributional Preference Alignment of LLMs via Optimal Transport

Overview

This paper presents a novel approach for aligning the distributional preferences of large language models (LLMs) using optimal transport.
The authors propose a method to shift the output distribution of an LLM to better match a target distribution, allowing the model to produce outputs that better align with user preferences.
The approach leverages first-order stochastic dominance to define distributional preferences and uses optimal transport to find the most efficient way to transform the model's output distribution.

Plain English Explanation

Large language models (LLMs) are powerful AI systems that can generate human-like text on a wide range of topics. However, the outputs of these models may not always align with the preferences and values of the people using them. The authors of this paper have developed a technique to help "steer" the output of an LLM to better match the user's desired distribution of outputs.

The key idea is to define the user's preferences in terms of first-order stochastic dominance, which essentially means that the user prefers outcomes that are "better" or "higher" on some scale. The authors then use an optimization technique called "optimal transport" to find the most efficient way to transform the LLM's output distribution to better match the user's preferred distribution.

This allows the LLM to generate outputs that are more in line with the user's values and preferences, without having to completely retrain the model from scratch. This could be useful in a wide range of applications, such as improving semi-supervised learning, structured sparse modeling, or partial transport embedding.

Technical Explanation

The authors start by defining the user's preferences in terms of first-order stochastic dominance, which means that the user prefers outcomes that are "better" or "higher" on some scale. This allows them to specify the user's desired output distribution in a flexible and intuitive way.

They then use an optimal transport framework to find the most efficient way to transform the LLM's output distribution to better match the user's preferred distribution. This involves solving an optimization problem to find the "transportation plan" that minimizes the cost of moving probability mass from the LLM's distribution to the desired distribution.

The authors demonstrate the effectiveness of their approach through experiments on both synthetic and real-world datasets. They show that the technique can significantly shift the output distribution of an LLM to better align with the user's preferences, while preserving the overall quality and coherence of the generated outputs.

Critical Analysis

One potential limitation of this approach is that it assumes the user's preferences can be accurately captured by a first-order stochastic dominance relationship. In some cases, the user's preferences may be more complex and require a more sophisticated preference model.

Additionally, the optimal transport optimization problem can be computationally expensive, especially for large-scale LLMs. The authors mention that they use entropic regularization to speed up the optimization, but further research may be needed to make the technique scalable to very large models.

It would also be interesting to see how this approach performs in real-world applications, where the user's preferences may be more nuanced and context-dependent. The authors' experiments were focused on synthetic and controlled settings, so further research is needed to understand the practical implications and limitations of this technique.

Conclusion

This paper presents a novel approach for aligning the distributional preferences of large language models using optimal transport. By defining the user's preferences in terms of first-order stochastic dominance and then optimizing the model's output distribution accordingly, the authors have developed a technique that can help steer LLM outputs to better match user values and priorities.

While there are some potential limitations and areas for further research, this work represents an important step towards developing more user-centric and ethically-aligned language models. As LLMs become increasingly prevalent in various applications, techniques like this will be crucial for ensuring that the outputs of these models are well-aligned with the needs and preferences of the people using them.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Distributional Preference Alignment of LLMs via Optimal Transport

Igor Melnyk, Youssef Mroueh, Brian Belgodere, Mattia Rigotti, Apoorva Nitsure, Mikhail Yurochkin, Kristjan Greenewald, Jiri Navratil, Jerret Ross

Current LLM alignment techniques use pairwise human preferences at a sample level, and as such, they do not imply an alignment on the distributional level. We propose in this paper Alignment via Optimal Transport (AOT), a novel method for distributional preference alignment of LLMs. AOT aligns LLMs on unpaired preference data by making the reward distribution of the positive samples stochastically dominant in the first order on the distribution of negative samples. We introduce a convex relaxation of this first-order stochastic dominance and cast it as an optimal transport problem with a smooth and convex cost. Thanks to the one-dimensional nature of the resulting optimal transport problem and the convexity of the cost, it has a closed-form solution via sorting on empirical measures. We fine-tune LLMs with this AOT objective, which enables alignment by penalizing the violation of the stochastic dominance of the reward distribution of the positive samples on the reward distribution of the negative samples. We analyze the sample complexity of AOT by considering the dual of the OT problem and show that it converges at the parametric rate. Empirically, we show on a diverse set of alignment datasets and LLMs that AOT leads to state-of-the-art models in the 7B family of models when evaluated with Open LLM Benchmarks and AlpacaEval.

6/11/2024

🗣️

Linear Optimal Partial Transport Embedding

Yikun Bai, Ivan Medri, Rocio Diaz Martin, Rana Muhammad Shahroz Khan, Soheil Kolouri

Optimal transport (OT) has gained popularity due to its various applications in fields such as machine learning, statistics, and signal processing. However, the balanced mass requirement limits its performance in practical problems. To address these limitations, variants of the OT problem, including unbalanced OT, Optimal partial transport (OPT), and Hellinger Kantorovich (HK), have been proposed. In this paper, we propose the Linear optimal partial transport (LOPT) embedding, which extends the (local) linearization technique on OT and HK to the OPT problem. The proposed embedding allows for faster computation of OPT distance between pairs of positive measures. Besides our theoretical contributions, we demonstrate the LOPT embedding technique in point-cloud interpolation and PCA analysis.

4/24/2024

Automatic Outlier Rectification via Optimal Transport

Jose Blanchet, Jiajin Li, Markus Pelger, Greg Zanotti

In this paper, we propose a novel conceptual framework to detect outliers using optimal transport with a concave cost function. Conventional outlier detection approaches typically use a two-stage procedure: first, outliers are detected and removed, and then estimation is performed on the cleaned data. However, this approach does not inform outlier removal with the estimation task, leaving room for improvement. To address this limitation, we propose an automatic outlier rectification mechanism that integrates rectification and estimation within a joint optimization framework. We take the first step to utilize the optimal transport distance with a concave cost function to construct a rectification set in the space of probability distributions. Then, we select the best distribution within the rectification set to perform the estimation task. Notably, the concave cost function we introduced in this paper is the key to making our estimator effectively identify the outlier during the optimization process. We demonstrate the effectiveness of our approach over conventional approaches in simulations and empirical analyses for mean estimation, least absolute regression, and the fitting of option implied volatility surfaces.

7/12/2024

Submodular Framework for Structured-Sparse Optimal Transport

Piyushi Manupriya, Pratik Jawanpuria, Karthik S. Gurumoorthy, SakethaNath Jagarlapudi, Bamdev Mishra

Unbalanced optimal transport (UOT) has recently gained much attention due to its flexible framework for handling un-normalized measures and its robustness properties. In this work, we explore learning (structured) sparse transport plans in the UOT setting, i.e., transport plans have an upper bound on the number of non-sparse entries in each column (structured sparse pattern) or in the whole plan (general sparse pattern). We propose novel sparsity-constrained UOT formulations building on the recently explored maximum mean discrepancy based UOT. We show that the proposed optimization problem is equivalent to the maximization of a weakly submodular function over a uniform matroid or a partition matroid. We develop efficient gradient-based discrete greedy algorithms and provide the corresponding theoretical guarantees. Empirically, we observe that our proposed greedy algorithms select a diverse support set and we illustrate the efficacy of the proposed approach in various applications.

6/10/2024