Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering

Read original: arXiv:2404.14741 - Published 4/24/2024 by Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Kang Liu, Jun Zhao

📉

Overview

The paper addresses the issue of insufficient knowledge and tendency to generate hallucination in Large Language Models (LLMs).
Numerous studies have tried to integrate LLMs with Knowledge Graphs (KGs) to address this problem.
However, these methods are evaluated on conventional Knowledge Graph Question Answering (KGQA) with complete KGs, where the factual triples involved in each question are entirely covered by the given KG.
In real-world scenarios, KGs are often incomplete, and the paper proposes leveraging LLMs for Question Answering under Incomplete Knowledge Graph (IKGQA) conditions.
The paper introduces a training-free method called Generate-on-Graph (GoG) that can generate new factual triples while exploring on KGs.

Plain English Explanation

Large Language Models (LLMs) like GPT-3 are powerful tools that can understand and generate human-like text. However, they often lack sufficient knowledge and can sometimes produce hallucinated or inaccurate information. To address this, researchers have tried integrating LLMs with Knowledge Graphs (KGs) - structured databases of facts and relationships.

In the typical setup for evaluating these methods, the KG contains all the information needed to answer the questions being asked. This makes the task relatively straightforward, as the LLM can simply explore the KG to find the relevant facts and provide the answer.

But in the real world, KGs are often incomplete - they don't have all the knowledge required to answer every question. This is where the paper's research comes in. The authors propose a new method called Generate-on-Graph (GoG) that allows LLMs to not only explore the KG, but also <a href="https://aimodels.fyi/papers/arxiv/logic-query-thoughts-guiding-large-language-models">generate new factual information</a> to supplement the incomplete KG and answer questions more effectively.

Technical Explanation

The paper proposes a method called Generate-on-Graph (GoG) to handle Question Answering under Incomplete Knowledge Graph (IKGQA) conditions. Unlike previous approaches that treat LLMs as agents to simply explore the KG, GoG also treats the LLM as a knowledge source to <a href="https://aimodels.fyi/papers/arxiv/reasoning-efficient-knowledge-pathsknowledge-graph-guides-large">generate new facts</a> based on the explored subgraph and the LLM's own inherent knowledge.

The key steps of the GoG method are:

Selecting the most relevant subgraph from the incomplete KG based on the question.
Generating new factual triples using the LLM to supplement the subgraph.
Answering the question by exploring the augmented subgraph.

Experimental results on two datasets show that GoG can solve IKGQA to a certain extent, while <a href="https://aimodels.fyi/papers/arxiv/counter-intuitive-large-language-models-can-better">almost all previous methods cannot perform well</a> on this task.

Critical Analysis

The paper presents a promising approach to integrating LLMs with incomplete KGs, but there are a few potential limitations and areas for further research:

The paper does not provide a detailed analysis of the types of facts the LLM is able to generate and how accurate or reliable these generated facts are. More research is needed to understand the strengths and weaknesses of the LLM's knowledge generation capabilities.
The experiments are conducted on a limited set of datasets, and it would be valuable to evaluate the GoG method on a wider range of IKGQA benchmarks to assess its generalizability.
The paper does not discuss potential biases or safety issues that may arise from allowing LLMs to freely generate new factual information, which could be an important consideration for real-world applications. <a href="https://aimodels.fyi/papers/arxiv/curiousllm-elevating-multi-document-qa-reasoning-infused">Further research is needed</a> to address these concerns.

Overall, the GoG method represents an interesting step forward in leveraging the complementary strengths of LLMs and KGs, and the paper's insights could contribute to the development of more robust and versatile question answering systems.

Conclusion

This paper presents a novel approach called Generate-on-Graph (GoG) that aims to address the limitations of Large Language Models (LLMs) and Knowledge Graphs (KGs) when it comes to question answering in the real world, where KGs are often incomplete.

The key idea of GoG is to not only treat the LLM as an agent to explore the KG, but also as a source of knowledge that can <a href="https://aimodels.fyi/papers/arxiv/cross-data-knowledge-graph-construction-llm-enabled">generate new factual information</a> to supplement the incomplete KG. This allows the system to better integrate internal and external knowledge sources to answer questions more effectively.

The paper's experimental results demonstrate the potential of this approach, suggesting that LLMs can play a valuable role in enhancing knowledge-based question answering systems, particularly in scenarios where the available knowledge is incomplete. While further research is needed to address some of the limitations, the GoG method represents an important step forward in the ongoing effort to create more robust and capable AI-powered question answering systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📉

Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question Answering

Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Kang Liu, Jun Zhao

To address the issue of insufficient knowledge and the tendency to generate hallucination in Large Language Models (LLMs), numerous studies have endeavored to integrate LLMs with Knowledge Graphs (KGs). However, all these methods are evaluated on conventional Knowledge Graph Question Answering (KGQA) with complete KGs, where the factual triples involved in each question are entirely covered by the given KG. In this situation, LLM mainly acts as an agent to find answer entities by exploring the KG, rather than effectively integrating internal and external knowledge sources. However, in real-world scenarios, KGs are often incomplete to cover all the knowledge required to answer questions. To simulate real-world scenarios and evaluate the ability of LLMs to integrate internal and external knowledge, in this paper, we propose leveraging LLMs for QA under Incomplete Knowledge Graph (IKGQA), where the given KG doesn't include all the factual triples involved in each question. To handle IKGQA, we propose a training-free method called Generate-on-Graph (GoG) that can generate new factual triples while exploring on KGs. Specifically, we propose a selecting-generating-answering framework, which not only treat the LLM as an agent to explore on KGs, but also treat it as a KG to generate new facts based on the explored subgraph and its inherent knowledge. Experimental results on two datasets demonstrate that our GoG can solve IKGQA to a certain extent, while almost all previous methods cannot perform well on IKGQA.

4/24/2024

💬

Logic Query of Thoughts: Guiding Large Language Models to Answer Complex Logic Queries with Knowledge Graphs

Lihui Liu, Zihao Wang, Ruizhong Qiu, Yikun Ban, Eunice Chan, Yangqiu Song, Jingrui He, Hanghang Tong

Despite the superb performance in many tasks, large language models (LLMs) bear the risk of generating hallucination or even wrong answers when confronted with tasks that demand the accuracy of knowledge. The issue becomes even more noticeable when addressing logic queries that require multiple logic reasoning steps. On the other hand, knowledge graph (KG) based question answering methods are capable of accurately identifying the correct answers with the help of knowledge graph, yet its accuracy could quickly deteriorate when the knowledge graph itself is sparse and incomplete. It remains a critical challenge on how to integrate knowledge graph reasoning with LLMs in a mutually beneficial way so as to mitigate both the hallucination problem of LLMs as well as the incompleteness issue of knowledge graphs. In this paper, we propose 'Logic-Query-of-Thoughts' (LGOT) which is the first of its kind to combine LLMs with knowledge graph based logic query reasoning. LGOT seamlessly combines knowledge graph reasoning and LLMs, effectively breaking down complex logic queries into easy to answer subquestions. Through the utilization of both knowledge graph reasoning and LLMs, it successfully derives answers for each subquestion. By aggregating these results and selecting the highest quality candidate answers for each step, LGOT achieves accurate results to complex questions. Our experimental findings demonstrate substantial performance enhancements, with up to 20% improvement over ChatGPT.

4/16/2024

🔍

Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering

Larissa Pusch, Tim O. F. Conrad

Advancements in natural language processing have revolutionized the way we can interact with digital information systems, such as databases, making them more accessible. However, challenges persist, especially when accuracy is critical, as in the biomedical domain. A key issue is the hallucination problem, where models generate information unsupported by the underlying data, potentially leading to dangerous misinformation. This paper presents a novel approach designed to bridge this gap by combining Large Language Models (LLM) and Knowledge Graphs (KG) to improve the accuracy and reliability of question-answering systems, on the example of a biomedical KG. Built on the LangChain framework, our method incorporates a query checker that ensures the syntactical and semantic validity of LLM-generated queries, which are then used to extract information from a Knowledge Graph, substantially reducing errors like hallucinations. We evaluated the overall performance using a new benchmark dataset of 50 biomedical questions, testing several LLMs, including GPT-4 Turbo and llama3:70b. Our results indicate that while GPT-4 Turbo outperforms other models in generating accurate queries, open-source models like llama3:70b show promise with appropriate prompt engineering. To make this approach accessible, a user-friendly web-based interface has been developed, allowing users to input natural language queries, view generated and corrected Cypher queries, and verify the resulting paths for accuracy. Overall, this hybrid approach effectively addresses common issues such as data gaps and hallucinations, offering a reliable and intuitive solution for question answering systems. The source code for generating the results of this paper and for the user-interface can be found in our Git repository: https://git.zib.de/lpusch/cyphergenkg-gui

9/9/2024

Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering

Yuqi Wang, Boran Jiang, Yi Luo, Dawei He, Peng Cheng, Liangcai Gao

Large language models (LLMs), such as GPT3.5, GPT4 and LLAMA2 perform surprisingly well and outperform human experts on many tasks. However, in many domain-specific evaluations, these LLMs often suffer from hallucination problems due to insufficient training of relevant corpus. Furthermore, fine-tuning large models may face problems such as the LLMs are not open source or the construction of high-quality domain instruction is difficult. Therefore, structured knowledge databases such as knowledge graph can better provide domain back- ground knowledge for LLMs and make full use of the reasoning and analysis capabilities of LLMs. In some previous works, LLM was called multiple times to determine whether the current triplet was suitable for inclusion in the subgraph when retrieving subgraphs through a question. Especially for the question that require a multi-hop reasoning path, frequent calls to LLM will consume a lot of computing power. Moreover, when choosing the reasoning path, LLM will be called once for each step, and if one of the steps is selected incorrectly, it will lead to the accumulation of errors in the following steps. In this paper, we integrated and optimized a pipeline for selecting reasoning paths from KG based on LLM, which can reduce the dependency on LLM. In addition, we propose a simple and effective subgraph retrieval method based on chain of thought (CoT) and page rank which can returns the paths most likely to contain the answer. We conduct experiments on three datasets: GenMedGPT-5k [14], WebQuestions [2], and CMCQA [21]. Finally, RoK can demonstrate that using fewer LLM calls can achieve the same results as previous SOTAs models.

4/17/2024