Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models

Read original: arXiv:2407.16470 - Published 7/26/2024 by Kenza Benkirane, Laura Gongas, Shahar Pelles, Naomi Fuchs, Joshua Darmon, Pontus Stenetorp, David Ifeoluwa Adelani, Eduardo S'anchez

Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models

Overview

This paper explores using large language models (LLMs) to detect hallucination in machine translation (MT) for both low and high resource languages.
Hallucination refers to the generation of text that is not supported by the input, which is a common issue in MT systems.
The authors propose a novel approach that leverages the capabilities of LLMs to identify hallucinated content in MT output.

Plain English Explanation

Machine translation (MT) is the process of automatically converting text from one language to another. However, MT systems can sometimes generate text that is not actually supported by the original input, a problem known as hallucination. This can lead to inaccurate or even nonsensical translations.

In this paper, the researchers investigated using large language models (LLMs) - powerful AI systems trained on vast amounts of text data - to detect hallucination in MT output. Their approach works for both low resource languages (with limited training data) and high resource languages (with abundant data).

The key idea is to use the LLM to analyze the MT output and identify any text that seems inconsistent or unsupported by the original input. This can help catch and correct hallucinated content, improving the overall accuracy and reliability of the MT system.

By leveraging the impressive language understanding capabilities of LLMs, the researchers were able to develop a novel technique for enhancing the quality and trustworthiness of machine translation, which has important implications for a wide range of real-world applications that rely on accurate language translation.

Technical Explanation

The paper presents a novel approach for detecting hallucination in machine translation (MT) output using large language models (LLMs). The authors first establish the importance of this problem, as hallucination - the generation of text that is not supported by the input - is a common issue in MT systems that can lead to inaccurate or nonsensical translations.

To address this, the researchers propose using LLMs, which are powerful AI models trained on vast amounts of text data, to analyze MT output and identify hallucinated content. Specifically, they fine-tune the LLMs on a task of classifying MT output as either hallucinated or non-hallucinated. This allows the LLMs to develop an understanding of what constitutes hallucinated text.

The authors evaluate their approach on both low resource languages (with limited training data) and high resource languages (with abundant data). Their experiments show that the LLM-based hallucination detection system is effective across this range of language scenarios, outperforming other baseline approaches.

The paper also discusses potential limitations and areas for further research, such as the need to further improve the robustness of the hallucination detection system and explore its applicability to other language tasks beyond MT.

Overall, this work demonstrates the promising potential of LLMs to enhance the quality and reliability of machine translation by identifying and mitigating the issue of hallucination. The techniques presented here could have important implications for a variety of real-world applications that rely on accurate language translation.

Critical Analysis

The paper presents a well-designed and thorough investigation into using large language models (LLMs) for detecting hallucination in machine translation (MT) output. The researchers have identified a relevant and important problem, and their proposed approach leverages the impressive language understanding capabilities of LLMs in a novel way.

One potential limitation noted in the paper is the need to further improve the robustness of the hallucination detection system, as the authors acknowledge that it may not be able to catch all instances of hallucination, especially for more subtle or complex cases. Additionally, the paper does not explore the applicability of the techniques beyond the MT domain, such as to other language-related tasks.

Another area for further research could be investigating the interpretability and explainability of the LLM-based hallucination detection system. Understanding the specific reasons and mechanisms by which the LLM identifies hallucinated content could lead to additional insights and improvements.

Despite these minor caveats, the paper represents an important contribution to the field of machine translation and language modeling. The authors have demonstrated the value of applying large language models to the critical problem of hallucination, with promising results that could have significant real-world impact.

Conclusion

This paper presents a novel approach for detecting hallucination in machine translation (MT) output using large language models (LLMs). The researchers have shown that LLMs can be effectively fine-tuned to identify hallucinated content in MT, with the technique working well for both low and high resource languages.

The ability to reliably detect hallucination is a crucial step towards improving the accuracy and trustworthiness of machine translation systems, which have important applications in fields like healthcare, business, and international communication. By leveraging the impressive language understanding capabilities of LLMs, this work represents an important advancement in enhancing the quality and reliability of automated language translation.

While the paper identifies some areas for further research and refinement, the overall approach and findings demonstrate the significant potential of applying large language models to the problem of hallucination detection. This work lays the groundwork for future developments that could have a transformative impact on the field of machine translation and beyond.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models

Kenza Benkirane, Laura Gongas, Shahar Pelles, Naomi Fuchs, Joshua Darmon, Pontus Stenetorp, David Ifeoluwa Adelani, Eduardo S'anchez

Recent advancements in massively multilingual machine translation systems have significantly enhanced translation accuracy; however, even the best performing systems still generate hallucinations, severely impacting user trust. Detecting hallucinations in Machine Translation (MT) remains a critical challenge, particularly since existing methods excel with High-Resource Languages (HRLs) but exhibit substantial limitations when applied to Low-Resource Languages (LRLs). This paper evaluates hallucination detection approaches using Large Language Models (LLMs) and semantic similarity within massively multilingual embeddings. Our study spans 16 language directions, covering HRLs, LRLs, with diverse scripts. We find that the choice of model is essential for performance. On average, for HRLs, Llama3-70B outperforms the previous state of the art by as much as 0.16 MCC (Matthews Correlation Coefficient). However, for LRLs we observe that Claude Sonnet outperforms other LLMs on average by 0.03 MCC. The key takeaway from our study is that LLMs can achieve performance comparable or even better than previously proposed models, despite not being explicitly trained for any machine translation task. However, their advantage is less significant for LRLs.

7/26/2024

InterrogateLLM: Zero-Resource Hallucination Detection in LLM-Generated Answers

Yakir Yehuda, Itzik Malkiel, Oren Barkan, Jonathan Weill, Royi Ronen, Noam Koenigstein

Despite the many advances of Large Language Models (LLMs) and their unprecedented rapid evolution, their impact and integration into every facet of our daily lives is limited due to various reasons. One critical factor hindering their widespread adoption is the occurrence of hallucinations, where LLMs invent answers that sound realistic, yet drift away from factual truth. In this paper, we present a novel method for detecting hallucinations in large language models, which tackles a critical issue in the adoption of these models in various real-world scenarios. Through extensive evaluations across multiple datasets and LLMs, including Llama-2, we study the hallucination levels of various recent LLMs and demonstrate the effectiveness of our method to automatically detect them. Notably, we observe up to 87% hallucinations for Llama-2 in a specific experiment, where our method achieves a Balanced Accuracy of 81%, all without relying on external knowledge.

8/20/2024

Mitigating Multilingual Hallucination in Large Vision-Language Models

Xiaoye Qu, Mingyang Song, Wei Wei, Jianfeng Dong, Yu Cheng

While Large Vision-Language Models (LVLMs) have exhibited remarkable capabilities across a wide range of tasks, they suffer from hallucination problems, where models generate plausible yet incorrect answers given the input image-query pair. This hallucination phenomenon is even more severe when querying the image in non-English languages, while existing methods for mitigating hallucinations in LVLMs only consider the English scenarios. In this paper, we make the first attempt to mitigate this important multilingual hallucination in LVLMs. With thorough experiment analysis, we found that multilingual hallucination in LVLMs is a systemic problem that could arise from deficiencies in multilingual capabilities or inadequate multimodal abilities. To this end, we propose a two-stage Multilingual Hallucination Removal (MHR) framework for LVLMs, aiming to improve resistance to hallucination for both high-resource and low-resource languages. Instead of relying on the intricate manual annotations of multilingual resources, we fully leverage the inherent capabilities of the LVLM and propose a novel cross-lingual alignment method, which generates multiple responses for each image-query input and then identifies the hallucination-aware pairs for each language. These data pairs are finally used for direct preference optimization to prompt the LVLMs to favor non-hallucinating responses. Experimental results show that our MHR achieves a substantial reduction in hallucination generation for LVLMs. Notably, on our extended multilingual POPE benchmark, our framework delivers an average increase of 19.0% in accuracy across 13 different languages. Our code and model weights are available at https://github.com/ssmisya/MHR

8/2/2024

Don't Believe Everything You Read: Enhancing Summarization Interpretability through Automatic Identification of Hallucinations in Large Language Models

Priyesh Vakharia, Devavrat Joshi, Meenal Chavan, Dhananjay Sonawane, Bhrigu Garg, Parsa Mazaheri

Large Language Models (LLMs) are adept at text manipulation -- tasks such as machine translation and text summarization. However, these models can also be prone to hallucination, which can be detrimental to the faithfulness of any answers that the model provides. Recent works in combating hallucinations in LLMs deal with identifying hallucinated sentences and categorizing the different ways in which models hallucinate. This paper takes a deep dive into LLM behavior with respect to hallucinations, defines a token-level approach to identifying different kinds of hallucinations, and further utilizes this token-level tagging to improve the interpretability and faithfulness of LLMs in dialogue summarization tasks. Through this, the paper presents a new, enhanced dataset and a new training paradigm.

4/4/2024