On Large Language Models' Hallucination with Regard to Known Facts

2403.20009

Published 4/1/2024 by Che Jiang, Biqing Qi, Xiangyu Hong, Dayuan Fu, Yang Cheng, Fandong Meng, Mo Yu, Bowen Zhou, Jie Zhou

cs.CL cs.LG

On Large Language Models' Hallucination with Regard to Known Facts

Abstract

Large language models are successful in answering factoid questions but are also prone to hallucination.We investigate the phenomenon of LLMs possessing correct answer knowledge yet still hallucinating from the perspective of inference dynamics, an area not previously covered in studies on hallucinations.We are able to conduct this analysis via two key ideas.First, we identify the factual questions that query the same triplet knowledge but result in different answers. The difference between the model behaviors on the correct and incorrect outputs hence suggests the patterns when hallucinations happen. Second, to measure the pattern, we utilize mappings from the residual streams to vocabulary space. We reveal the different dynamics of the output token probabilities along the depths of layers between the correct and hallucinated cases. In hallucinated cases, the output token's information rarely demonstrates abrupt increases and consistent superiority in the later stages of the model. Leveraging the dynamic curve as a feature, we build a classifier capable of accurately detecting hallucinatory predictions with an 88% success rate. Our study shed light on understanding the reasons for LLMs' hallucinations on their known facts, and more importantly, on accurately predicting when they are hallucinating.

Get summaries of the top AI research delivered straight to your inbox:

Overview

The paper explores the tendency of large language models (LLMs) to generate responses that contradict known facts, a phenomenon known as "hallucination".
The researchers designed experiments to assess the prevalence and characteristics of hallucination in LLMs.
The findings provide insights into the limitations of current LLMs and the need for improved techniques to ensure the reliability and trustworthiness of these models.

Plain English Explanation

Large language models (LLMs) are powerful artificial intelligence systems that can generate human-like text on a wide range of topics. However, these models can sometimes produce responses that don't align with established facts or real-world knowledge. This is known as "hallucination" - the model essentially makes up information that appears plausible but is not actually true.

The researchers in this study wanted to better understand the hallucination problem. They designed a series of experiments to assess how often LLMs exhibit hallucination, what types of information they are most likely to get wrong, and what factors might contribute to this issue.

Overall, the findings suggest that hallucination is a significant challenge for current LLMs. The models frequently generated responses that contradicted well-known facts, especially when asked about specific details or obscure information. This raises concerns about the reliability and trustworthiness of these powerful AI systems, as users may mistakenly believe the fabricated information is accurate.

Technical Explanation

The paper explores the phenomenon of hallucination in large language models (LLMs), where the models generate responses that contradict known facts. The researchers conducted a series of experiments to assess the prevalence and characteristics of hallucination in LLMs.

The experimental setup involved posing a variety of questions to several prominent LLMs, including GPT-3, BART, and T5. The questions covered a range of topics and varying levels of difficulty, from common knowledge to more obscure facts. The researchers then analyzed the model responses to identify instances of hallucination, where the generated text contradicted established information.

The results showed that hallucination is a significant issue for current LLMs, with the models frequently producing incorrect or fabricated responses, especially for questions requiring specific details or knowledge of less common facts. The researchers also found that hallucination was more prevalent in responses to complex or open-ended questions, suggesting that the models struggle to maintain coherence and grounding in reality when tasked with generating longer, more involved text.

Critical Analysis

The paper provides valuable insights into the limitations of current LLMs and the need for further research to address the hallucination problem. The experimental design and analysis were rigorous, and the findings corroborate previous concerns about the reliability of these models when it comes to factual information.

However, the paper does not delve deeply into the potential causes of hallucination or propose concrete solutions. While the researchers acknowledge the need for improved model architectures and training techniques to mitigate hallucination, they do not offer specific recommendations or directions for future work.

Additionally, the paper focuses solely on the hallucination issue and does not consider other potential pitfalls or unintended behaviors that may arise with the increasing deployment of LLMs in various applications. Further research is needed to explore the broader implications and challenges associated with the use of these powerful AI systems.

Conclusion

This paper sheds light on the significant challenge of hallucination in large language models, where the models generate responses that contradict known facts. The findings underscore the need for continued research and development to improve the reliability and trustworthiness of these powerful AI systems.

As LLMs become more widely adopted, it is crucial to address the hallucination problem and ensure that these models can provide accurate and trustworthy information. Addressing this issue will be essential for the safe and responsible deployment of LLMs in a wide range of applications, from customer service to medical diagnosis and beyond.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

A Survey on Hallucination in Large Vision-Language Models

Hanchao Liu, Wenyuan Xue, Yifei Chen, Dapeng Chen, Xiutian Zhao, Ke Wang, Liping Hou, Rongjun Li, Wei Peng

Recent development of Large Vision-Language Models (LVLMs) has attracted growing attention within the AI landscape for its practical implementation potential. However, ``hallucination'', or more specifically, the misalignment between factual visual content and corresponding textual generation, poses a significant challenge of utilizing LVLMs. In this comprehensive survey, we dissect LVLM-related hallucinations in an attempt to establish an overview and facilitate future mitigation. Our scrutiny starts with a clarification of the concept of hallucinations in LVLMs, presenting a variety of hallucination symptoms and highlighting the unique challenges inherent in LVLM hallucinations. Subsequently, we outline the benchmarks and methodologies tailored specifically for evaluating hallucinations unique to LVLMs. Additionally, we delve into an investigation of the root causes of these hallucinations, encompassing insights from the training data and model components. We also critically review existing methods for mitigating hallucinations. The open questions and future directions pertaining to hallucinations within LVLMs are discussed to conclude this survey.

5/7/2024

cs.CV cs.CL cs.LG

💬

Hallucination of Multimodal Large Language Models: A Survey

Zechen Bai, Pichao Wang, Tianjun Xiao, Tong He, Zongbo Han, Zheng Zhang, Mike Zheng Shou

This survey presents a comprehensive analysis of the phenomenon of hallucination in multimodal large language models (MLLMs), also known as Large Vision-Language Models (LVLMs), which have demonstrated significant advancements and remarkable abilities in multimodal tasks. Despite these promising developments, MLLMs often generate outputs that are inconsistent with the visual content, a challenge known as hallucination, which poses substantial obstacles to their practical deployment and raises concerns regarding their reliability in real-world applications. This problem has attracted increasing attention, prompting efforts to detect and mitigate such inaccuracies. We review recent advances in identifying, evaluating, and mitigating these hallucinations, offering a detailed overview of the underlying causes, evaluation benchmarks, metrics, and strategies developed to address this issue. Additionally, we analyze the current challenges and limitations, formulating open questions that delineate potential pathways for future research. By drawing the granular classification and landscapes of hallucination causes, evaluation benchmarks, and mitigation methods, this survey aims to deepen the understanding of hallucinations in MLLMs and inspire further advancements in the field. Through our thorough and in-depth review, we contribute to the ongoing dialogue on enhancing the robustness and reliability of MLLMs, providing valuable insights and resources for researchers and practitioners alike. Resources are available at: https://github.com/showlab/Awesome-MLLM-Hallucination.

4/30/2024

cs.CV

Don't Believe Everything You Read: Enhancing Summarization Interpretability through Automatic Identification of Hallucinations in Large Language Models

Priyesh Vakharia, Devavrat Joshi, Meenal Chavan, Dhananjay Sonawane, Bhrigu Garg, Parsa Mazaheri

Large Language Models (LLMs) are adept at text manipulation -- tasks such as machine translation and text summarization. However, these models can also be prone to hallucination, which can be detrimental to the faithfulness of any answers that the model provides. Recent works in combating hallucinations in LLMs deal with identifying hallucinated sentences and categorizing the different ways in which models hallucinate. This paper takes a deep dive into LLM behavior with respect to hallucinations, defines a token-level approach to identifying different kinds of hallucinations, and further utilizes this token-level tagging to improve the interpretability and faithfulness of LLMs in dialogue summarization tasks. Through this, the paper presents a new, enhanced dataset and a new training paradigm.

4/4/2024

cs.CL cs.AI

💬

Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval

Mengjia Niu, Hao Li, Jie Shi, Hamed Haddadi, Fan Mo

Large language models (LLMs) have demonstrated remarkable capabilities across various domains, although their susceptibility to hallucination poses significant challenges for their deployment in critical areas such as healthcare. To address this issue, retrieving relevant facts from knowledge graphs (KGs) is considered a promising method. Existing KG-augmented approaches tend to be resource-intensive, requiring multiple rounds of retrieval and verification for each factoid, which impedes their application in real-world scenarios. In this study, we propose Self-Refinement-Enhanced Knowledge Graph Retrieval (Re-KGR) to augment the factuality of LLMs' responses with less retrieval efforts in the medical field. Our approach leverages the attribution of next-token predictive probability distributions across different tokens, and various model layers to primarily identify tokens with a high potential for hallucination, reducing verification rounds by refining knowledge triples associated with these tokens. Moreover, we rectify inaccurate content using retrieved knowledge in the post-processing stage, which improves the truthfulness of generated responses. Experimental results on a medical dataset demonstrate that our approach can enhance the factual capability of LLMs across various foundational models as evidenced by the highest scores on truthfulness.

5/13/2024

cs.CL cs.LG