Multi-hop Question Answering over Knowledge Graphs using Large Language Models

2404.19234

Published 5/1/2024 by Abir Chakraborty

💬

Abstract

Knowledge graphs (KGs) are large datasets with specific structures representing large knowledge bases (KB) where each node represents a key entity and relations amongst them are typed edges. Natural language queries formed to extract information from a KB entail starting from specific nodes and reasoning over multiple edges of the corresponding KG to arrive at the correct set of answer nodes. Traditional approaches of question answering on KG are based on (a) semantic parsing (SP), where a logical form (e.g., S-expression, SPARQL query, etc.) is generated using node and edge embeddings and then reasoning over these representations or tuning language models to generate the final answer directly, or (b) information-retrieval based that works by extracting entities and relations sequentially. In this work, we evaluate the capability of (LLMs) to answer questions over KG that involve multiple hops. We show that depending upon the size and nature of the KG we need different approaches to extract and feed the relevant information to an LLM since every LLM comes with a fixed context window. We evaluate our approach on six KGs with and without the availability of example-specific sub-graphs and show that both the IR and SP-based methods can be adopted by LLMs resulting in an extremely competitive performance.

Create account to get full access

Overview

Knowledge graphs (KGs) are large datasets that represent complex information, with entities as nodes and relationships between them as edges.
Extracting information from KGs using natural language queries requires reasoning over multiple connections in the graph, which can be challenging.
Traditional approaches to this problem involve either semantic parsing to generate logical forms or information retrieval to extract relevant entities and relations.
This research examines the capability of large language models (LLMs) to answer questions that require multi-hop reasoning over KGs.

Plain English Explanation

Knowledge graphs are like giant maps of information, where each node represents an important thing (like a person, place, or concept) and the lines connecting them show how those things are related. When you ask a question that requires understanding those connections, it can be tricky to find the right answer.

Reasoning-Efficient Knowledge Paths: Knowledge Graph Guides Large Language Models to Answer Complex Questions shows that large language models, which are very good at understanding and generating human-like text, can also be used to answer questions that involve following multiple steps through a knowledge graph.

The researchers explored different ways of feeding the information from the knowledge graph into the language model, since the model has a limit on how much context it can consider at once. They found that both the traditional approaches of Information Retrieval and Semantic Parsing can work well, depending on the size and nature of the knowledge graph.

This is an important step in enhancing question answering systems that use enterprise knowledge bases, as it shows that large language models can be effectively applied to this challenging problem.

Technical Explanation

The paper evaluates the ability of large language models (LLMs) to answer questions that require reasoning over multiple steps in a knowledge graph (KG). Traditional approaches to this task involve either semantic parsing, where a logical form is generated and then reasoned over, or information retrieval, where relevant entities and relations are extracted sequentially.

The researchers explore how to effectively feed KG information to LLMs, as the models have a fixed context window that may limit their ability to consider all the relevant connections. They evaluate their approaches on six different KGs, both with and without access to example-specific subgraphs.

The results show that both the semantic parsing and information retrieval-based methods can be adopted by LLMs, leading to highly competitive performance. The specific approach that works best depends on the size and nature of the KG being used.

Critical Analysis

The paper presents a thorough evaluation of using LLMs for multi-hop reasoning over knowledge graphs, and the findings provide useful insights for enhancing question answering systems that leverage enterprise knowledge bases.

One potential limitation is that the experiments were conducted on a relatively small set of six knowledge graphs. While the authors demonstrate the effectiveness of their approaches across this diverse set of KGs, further validation on a larger and more comprehensive set of benchmarks would help strengthen the generalizability of the findings.

Additionally, the paper does not delve into the interpretability or explainability of the LLM-based reasoning process. Understanding how the models arrive at their answers and exposing the underlying logic could be an important area for future research, especially in mission-critical applications where transparency is crucial.

Overall, this work represents a valuable contribution to the field of knowledge-driven natural language processing, and the techniques described could have far-reaching implications for cross-data knowledge graph construction enabled by large language models.

Conclusion

This research demonstrates the ability of large language models to effectively reason over knowledge graphs and answer complex questions that require understanding multiple connections in the graph. The findings suggest that both semantic parsing and information retrieval-based approaches can be successfully adopted by LLMs, with the optimal method depending on the characteristics of the specific knowledge graph.

These insights could lead to significant advancements in the field of question answering systems that leverage enterprise knowledge bases, providing users with more accurate and informative responses to their queries. As large language models continue to evolve, the techniques described in this paper will likely become increasingly important for unlocking the full potential of knowledge-driven natural language processing.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🧠

HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs

Pranoy Panda, Ankush Agarwal, Chaitanya Devaguptapu, Manohar Kaul, Prathosh A P

Given unstructured text, Large Language Models (LLMs) are adept at answering simple (single-hop) questions. However, as the complexity of the questions increase, the performance of LLMs degrade. We believe this is due to the overhead associated with understanding the complex question followed by filtering and aggregating unstructured information in the raw text. Recent methods try to reduce this burden by integrating structured knowledge triples into the raw text, aiming to provide a structured overview that simplifies information processing. However, this simplistic approach is query-agnostic and the extracted facts are ambiguous as they lack context. To address these drawbacks and to enable LLMs to answer complex (multi-hop) questions with ease, we propose to use a knowledge graph (KG) that is context-aware and is distilled to contain query-relevant information. The use of our compressed distilled KG as input to the LLM results in our method utilizing up to $67%$ fewer tokens to represent the query relevant information present in the supporting documents, compared to the state-of-the-art (SoTA) method. Our experiments show consistent improvements over the SoTA across several metrics (EM, F1, BERTScore, and Human Eval) on two popular benchmark datasets (HotpotQA and MuSiQue).

6/11/2024

cs.CL

Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering

Yuqi Wang, Boran Jiang, Yi Luo, Dawei He, Peng Cheng, Liangcai Gao

Large language models (LLMs), such as GPT3.5, GPT4 and LLAMA2 perform surprisingly well and outperform human experts on many tasks. However, in many domain-specific evaluations, these LLMs often suffer from hallucination problems due to insufficient training of relevant corpus. Furthermore, fine-tuning large models may face problems such as the LLMs are not open source or the construction of high-quality domain instruction is difficult. Therefore, structured knowledge databases such as knowledge graph can better provide domain back- ground knowledge for LLMs and make full use of the reasoning and analysis capabilities of LLMs. In some previous works, LLM was called multiple times to determine whether the current triplet was suitable for inclusion in the subgraph when retrieving subgraphs through a question. Especially for the question that require a multi-hop reasoning path, frequent calls to LLM will consume a lot of computing power. Moreover, when choosing the reasoning path, LLM will be called once for each step, and if one of the steps is selected incorrectly, it will lead to the accumulation of errors in the following steps. In this paper, we integrated and optimized a pipeline for selecting reasoning paths from KG based on LLM, which can reduce the dependency on LLM. In addition, we propose a simple and effective subgraph retrieval method based on chain of thought (CoT) and page rank which can returns the paths most likely to contain the answer. We conduct experiments on three datasets: GenMedGPT-5k [14], WebQuestions [2], and CMCQA [21]. Finally, RoK can demonstrate that using fewer LLM calls can achieve the same results as previous SOTAs models.

4/17/2024

cs.CL cs.AI cs.IR

Counter-intuitive: Large Language Models Can Better Understand Knowledge Graphs Than We Thought

Xinbang Dai, Yuncheng Hua, Tongtong Wu, Yang Sheng, Qiu Ji, Guilin Qi

As the parameter scale of large language models (LLMs) grows, jointly training knowledge graph (KG) embeddings with model parameters to enhance LLM capabilities becomes increasingly costly. Consequently, the community has shown interest in developing prompt strategies that effectively integrate KG information into LLMs. However, the format for incorporating KGs into LLMs lacks standardization; for instance, KGs can be transformed into linearized triples or natural language (NL) text. Current prompting methods often rely on a trial-and-error approach, leaving researchers with an incomplete understanding of which KG input format best facilitates LLM comprehension of KG content. To elucidate this, we design a series of experiments to explore LLMs' understanding of different KG input formats within the context of prompt engineering. Our analysis examines both literal and attention distribution levels. Through extensive experiments, we indicate a counter-intuitive phenomenon: when addressing fact-related questions, unordered linearized triples are more effective for LLMs' understanding of KGs compared to fluent NL text. Furthermore, noisy, incomplete, or marginally relevant subgraphs can still enhance LLM performance. Finally, different LLMs have distinct preferences for different formats of organizing unordered triples.

6/18/2024

cs.CL cs.AI

Knowledge Graph-Enhanced Large Language Models via Path Selection

Haochen Liu, Song Wang, Yaochen Zhu, Yushun Dong, Jundong Li

Large Language Models (LLMs) have shown unprecedented performance in various real-world applications. However, they are known to generate factually inaccurate outputs, a.k.a. the hallucination problem. In recent years, incorporating external knowledge extracted from Knowledge Graphs (KGs) has become a promising strategy to improve the factual accuracy of LLM-generated outputs. Nevertheless, most existing explorations rely on LLMs themselves to perform KG knowledge extraction, which is highly inflexible as LLMs can only provide binary judgment on whether a certain knowledge (e.g., a knowledge path in KG) should be used. In addition, LLMs tend to pick only knowledge with direct semantic relationship with the input text, while potentially useful knowledge with indirect semantics can be ignored. In this work, we propose a principled framework KELP with three stages to handle the above problems. Specifically, KELP is able to achieve finer granularity of flexible knowledge extraction by generating scores for knowledge paths with input texts via latent semantic matching. Meanwhile, knowledge paths with indirect semantic relationships with the input text can also be considered via trained encoding between the selected paths in KG and the input text. Experiments on real-world datasets validate the effectiveness of KELP.

6/21/2024

cs.CL cs.AI