CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs

Read original: arXiv:2406.03158 - Published 6/6/2024 by Shuang Ao, Stefan Rueger, Advaith Siddharthan

CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs

Overview

This paper proposes a novel approach called CSS (Contrastive Semantic Similarity) for quantifying the uncertainty of large language models (LLMs).
The key idea is to leverage the semantic similarity between the model's output and a set of contrasting reference texts to estimate the model's confidence in its predictions.
The authors demonstrate the effectiveness of CSS on various tasks, including natural language inference, question-answering, and language generation, and show that it outperforms existing uncertainty quantification methods.

Plain English Explanation

The paper introduces a new way to measure how confident a large language model (LLM) is in its predictions. Large language models are AI systems that can generate human-like text, answer questions, and perform other language-related tasks. However, these models can sometimes be uncertain or unsure about their outputs, and it's important to be able to quantify this uncertainty.

The researchers' approach, called Contrastive Semantic Similarity (CSS), works by comparing the model's output to a set of "contrasting" reference texts. These reference texts are carefully chosen to represent alternative ways of expressing the same information. By measuring the semantic similarity between the model's output and these contrasting references, the researchers can estimate how confident the model is in its prediction.

For example, if the model generates a text that is very similar to one of the contrasting references, it suggests the model is highly confident in its output. But if the model's output is equally similar to multiple contrasting references, it indicates the model is more uncertain.

The researchers show that this CSS approach outperforms existing methods for quantifying uncertainty in LLMs, across a variety of language tasks like understanding natural language, answering questions, and generating text. This suggests CSS could be a valuable tool for helping users understand when to trust the outputs of these powerful language models.

Technical Explanation

The key innovation of the CSS approach is its use of contrastive semantic similarity to quantify uncertainty. The authors first obtain a set of reference texts that represent alternative ways of expressing the same information as the model's output. They then measure the semantic similarity between the model's output and each of these reference texts using a neural language model.

The intuition is that if the model's output is highly similar to one of the reference texts, it indicates the model is confident in its prediction. But if the output is equally similar to multiple references, it suggests the model is more uncertain.

To formalize this idea, the authors define a "contrastive semantic similarity" (CSS) score that captures this relative similarity. Specifically, the CSS score is calculated as the ratio of the maximum semantic similarity between the model's output and any reference text, divided by the average semantic similarity across all references.

The authors demonstrate the effectiveness of CSS on a range of language tasks, including natural language inference, question-answering, and text generation. They show that CSS outperforms existing uncertainty quantification methods, both in terms of accurately identifying the model's confidence and in downstream applications that rely on uncertainty estimates.

Critical Analysis

The CSS approach presented in this paper is a promising step towards more reliable and interpretable uncertainty quantification for large language models. By leveraging semantic similarity in a contrastive way, the method provides a principled way to assess the model's confidence that goes beyond simple output probabilities.

However, the authors acknowledge some limitations of their work. The reliance on carefully curated reference texts may limit the scalability and generalization of the approach, as generating high-quality contrasting references could be challenging for arbitrary input-output pairs.

Additionally, the authors note that the CSS score may be sensitive to the choice of language model used for computing semantic similarity. It would be valuable to explore the robustness of the approach to different similarity models and understand how the performance might vary across diverse language tasks and datasets.

Finally, while the paper demonstrates the effectiveness of CSS on a range of benchmarks, it would be helpful to see more real-world case studies or user studies to understand how the uncertainty estimates provided by CSS could benefit practitioners in practical applications of large language models.

Overall, the CSS approach represents an interesting and innovative contribution to the field of uncertainty quantification for large language models. Further research to address the limitations and explore the broader implications of this work could help advance the development of more reliable and trustworthy AI systems.

Conclusion

This paper presents a novel approach called Contrastive Semantic Similarity (CSS) for quantifying the uncertainty of large language models (LLMs). The key idea is to leverage the semantic similarity between the model's output and a set of carefully chosen contrasting reference texts to estimate the model's confidence in its predictions.

The authors demonstrate that the CSS approach outperforms existing uncertainty quantification methods across a variety of language tasks, including natural language inference, question-answering, and text generation. This suggests that CSS could be a valuable tool for helping users understand when to trust the outputs of powerful language models, which is an important consideration as these models become more widely deployed.

While the paper identifies some limitations of the CSS approach, such as the reliance on curated reference texts, the overall contribution represents a significant step forward in the field of uncertainty quantification for LLMs. Further research to address these challenges and explore the broader implications of this work could lead to even more reliable and trustworthy AI systems in the future.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs

Shuang Ao, Stefan Rueger, Advaith Siddharthan

Despite the impressive capability of large language models (LLMs), knowing when to trust their generations remains an open challenge. The recent literature on uncertainty quantification of natural language generation (NLG) utilises a conventional natural language inference (NLI) classifier to measure the semantic dispersion of LLMs responses. These studies employ logits of NLI classifier for semantic clustering to estimate uncertainty. However, logits represent the probability of the predicted class and barely contain feature information for potential clustering. Alternatively, CLIP (Contrastive Language-Image Pre-training) performs impressively in extracting image-text pair features and measuring their similarity. To extend its usability, we propose Contrastive Semantic Similarity, the CLIP-based feature extraction module to obtain similarity features for measuring uncertainty for text pairs. We apply this method to selective NLG, which detects and rejects unreliable generations for better trustworthiness of LLMs. We conduct extensive experiments with three LLMs on several benchmark question-answering datasets with comprehensive evaluation metrics. Results show that our proposed method performs better in estimating reliable responses of LLMs than comparable baselines. Results show that our proposed method performs better in estimating reliable responses of LLMs than comparable baselines. The code are available at url{https://github.com/AoShuang92/css_uq_llms}.

6/6/2024

💬

Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models

Zhen Lin, Shubhendu Trivedi, Jimeng Sun

Large language models (LLMs) specializing in natural language generation (NLG) have recently started exhibiting promising capabilities across a variety of domains. However, gauging the trustworthiness of responses generated by LLMs remains an open challenge, with limited research on uncertainty quantification (UQ) for NLG. Furthermore, existing literature typically assumes white-box access to language models, which is becoming unrealistic either due to the closed-source nature of the latest LLMs or computational constraints. In this work, we investigate UQ in NLG for *black-box* LLMs. We first differentiate *uncertainty* vs *confidence*: the former refers to the ``dispersion'' of the potential predictions for a fixed input, and the latter refers to the confidence on a particular prediction/generation. We then propose and compare several confidence/uncertainty measures, applying them to *selective NLG* where unreliable results could either be ignored or yielded for further assessment. Experiments were carried out with several popular LLMs on question-answering datasets (for evaluation purposes). Results reveal that a simple measure for the semantic dispersion can be a reliable predictor of the quality of LLM responses, providing valuable insights for practitioners on uncertainty management when adopting LLMs. The code to replicate our experiments is available at https://github.com/zlin7/UQ-NLG.

5/21/2024

CLUE: Concept-Level Uncertainty Estimation for Large Language Models

Yu-Hsiang Wang, Andrew Bai, Che-Ping Tsai, Cho-Jui Hsieh

Large Language Models (LLMs) have demonstrated remarkable proficiency in various natural language generation (NLG) tasks. Previous studies suggest that LLMs' generation process involves uncertainty. However, existing approaches to uncertainty estimation mainly focus on sequence-level uncertainty, overlooking individual pieces of information within sequences. These methods fall short in separately assessing the uncertainty of each component in a sequence. In response, we propose a novel framework for Concept-Level Uncertainty Estimation (CLUE) for LLMs. We leverage LLMs to convert output sequences into concept-level representations, breaking down sequences into individual concepts and measuring the uncertainty of each concept separately. We conduct experiments to demonstrate that CLUE can provide more interpretable uncertainty estimation results compared with sentence-level uncertainty, and could be a useful tool for various tasks such as hallucination detection and story generation.

9/6/2024

💬

Semantic Density: Uncertainty Quantification in Semantic Space for Large Language Models

Xin Qiu, Risto Miikkulainen

With the widespread application of Large Language Models (LLMs) to various domains, concerns regarding the trustworthiness of LLMs in safety-critical scenarios have been raised, due to their unpredictable tendency to hallucinate and generate misinformation. Existing LLMs do not have an inherent functionality to provide the users with an uncertainty metric for each response it generates, making it difficult to evaluate trustworthiness. Although a number of works aim to develop uncertainty quantification methods for LLMs, they have fundamental limitations, such as being restricted to classification tasks, requiring additional training and data, considering only lexical instead of semantic information, and being prompt-wise but not response-wise. A new framework is proposed in this paper to address these issues. Semantic density extracts uncertainty information for each response from a probability distribution perspective in semantic space. It has no restriction on task types and is off-the-shelf for new models and tasks. Experiments on seven state-of-the-art LLMs, including the latest Llama 3 and Mixtral-8x22B models, on four free-form question-answering benchmarks demonstrate the superior performance and robustness of semantic density compared to prior approaches.

5/28/2024