Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!

2405.11706

Published 5/21/2024 by Dean Allemang, Juan Sequeda

Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!

Abstract

There is increasing evidence that question-answering (QA) systems with Large Language Models (LLMs), which employ a knowledge graph/semantic representation of an enterprise SQL database (i.e. Text-to-SPARQL), achieve higher accuracy compared to systems that answer questions directly on SQL databases (i.e. Text-to-SQL). Our previous benchmark research showed that by using a knowledge graph, the accuracy improved from 16% to 54%. The question remains: how can we further improve the accuracy and reduce the error rate? Building on the observations of our previous research where the inaccurate LLM-generated SPARQL queries followed incorrect paths, we present an approach that consists of 1) Ontology-based Query Check (OBQC): detects errors by leveraging the ontology of the knowledge graph to check if the LLM-generated SPARQL query matches the semantic of ontology and 2) LLM Repair: use the error explanations with an LLM to repair the SPARQL query. Using the chat with the data benchmark, our primary finding is that our approach increases the overall accuracy to 72% including an additional 8% of I don't know unknown results. Thus, the overall error rate is 20%. These results provide further evidence that investing knowledge graphs, namely the ontology, provides higher accuracy for LLM powered question answering systems.

Create account to get full access

Overview

This paper explores how incorporating ontologies, which are formal representations of knowledge, can improve the accuracy of large language models (LLMs) in question-answering tasks.
The researchers hypothesize that by leveraging the structured knowledge in ontologies, LLMs can better understand the context and semantics of questions, leading to more accurate answers.
The paper presents a novel approach that integrates ontological information into the LLM training and inference process, and evaluates its performance on various question-answering benchmarks.

Plain English Explanation

Large language models (LLMs) have made impressive strides in natural language processing, but they can still struggle with certain types of questions, particularly those that require a deeper understanding of the underlying concepts and relationships. This is where ontologies can lend a helping hand.

Ontologies are like structured databases of knowledge, where different entities (like people, places, or ideas) are organized and their relationships to each other are clearly defined. By incorporating this ontological information into the training and use of LLMs, the researchers believe they can improve the models' ability to comprehend the context and meaning behind questions, leading to more accurate and relevant answers.

Imagine you're trying to answer a question about the relationship between a specific person and a historical event. An LLM might struggle to connect the dots, but if it had access to an ontology that clearly showed how the person and event were related, it could provide a much more informed and accurate response.

The researchers in this paper have developed a novel approach that seamlessly integrates ontological knowledge into the LLM workflow. They've tested their method on various question-answering benchmarks and found that it outperforms traditional LLM-based approaches, highlighting the potential of using structured knowledge to enhance the capabilities of these powerful language models.

Technical Explanation

The paper proposes an approach called Reasoning Efficient Knowledge Paths that leverages ontological information to improve the accuracy of LLMs in question-answering tasks. The key idea is to incorporate the structured knowledge from ontologies into the LLM training and inference process, guiding the model to better understand the context and semantics of the questions.

The proposed method consists of two main components:

Ontology-Aware Encoding: During the LLM training phase, the model is exposed to both the question-answer pairs and the corresponding ontological information. This allows the LLM to learn how to effectively integrate the structured knowledge into its internal representations, enabling it to better comprehend the meaning and relationships within the questions.
Ontology-Guided Reasoning: When answering a new question, the LLM leverages the ontological information to guide its reasoning process. This helps the model identify relevant concepts and their connections, leading to more accurate and contextually appropriate answers.

The researchers evaluate their approach on several popular question-answering benchmarks, including Counter-Intuitive Large Language Models Can Better, Multi-Hop Question Answering over Knowledge Graphs, and Logic-Query Thoughts: Guiding Large Language Models. They demonstrate that their ontology-enhanced LLM outperforms traditional LLM-based baselines, highlighting the benefits of incorporating structured knowledge into the language modeling process.

Critical Analysis

The researchers have presented a compelling approach for improving LLM accuracy in question-answering tasks by leveraging ontological information. However, the paper does not address several potential limitations and areas for further research:

Scalability and Generalization: While the results on the evaluated benchmarks are promising, it's unclear how well the proposed method would scale to more complex, real-world scenarios with large, diverse knowledge bases. Further research is needed to assess the model's ability to generalize to a wide range of domains and question types.
Ontology Construction and Maintenance: The paper assumes the availability of high-quality ontologies, but the process of constructing and maintaining such knowledge bases can be challenging and resource-intensive. Exploring more automated or semi-automated approaches to ontology generation could enhance the practicality of the proposed solution.
Interpretability and Explainability: The integration of ontological knowledge into the LLM's reasoning process may introduce additional complexity, making it more challenging to understand and explain the model's decision-making. Investigating ways to improve the interpretability of the ontology-enhanced LLM could be valuable for building trust and transparency in the system.
Potential Biases and Limitations: As with any knowledge-based approach, the ontologies used may reflect the biases and limitations of their creators. It would be important to analyze the impact of these biases on the LLM's performance and explore methods to mitigate them.

Despite these potential areas for improvement, the researchers' work demonstrates the promising potential of leveraging structured knowledge to enhance the capabilities of large language models, particularly in the realm of question answering. Further advancements in this direction could lead to significant improvements in the reliability and trustworthiness of LLM-based applications.

Conclusion

This paper presents a novel approach that integrates ontological knowledge into the training and inference of large language models to improve their accuracy in question-answering tasks. By exposing the LLM to structured information about concepts and their relationships, the researchers have shown that the model can better understand the context and semantics of questions, leading to more accurate and relevant answers.

The proposed method, called Reasoning Efficient Knowledge Paths, has been evaluated on several benchmark datasets, and the results demonstrate its effectiveness in outperforming traditional LLM-based approaches. This work highlights the potential of using ontologies to enhance the capabilities of large language models, which could have significant implications for a wide range of natural language processing applications, from conversational AI to knowledge-intensive question answering.

As the field of AI continues to evolve, the integration of structured knowledge into language models like the one presented in this paper could be a crucial step towards developing more reliable, trustworthy, and context-aware language understanding systems that can better serve the needs of users and society.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🧠

HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs

Pranoy Panda, Ankush Agarwal, Chaitanya Devaguptapu, Manohar Kaul, Prathosh A P

Given unstructured text, Large Language Models (LLMs) are adept at answering simple (single-hop) questions. However, as the complexity of the questions increase, the performance of LLMs degrade. We believe this is due to the overhead associated with understanding the complex question followed by filtering and aggregating unstructured information in the raw text. Recent methods try to reduce this burden by integrating structured knowledge triples into the raw text, aiming to provide a structured overview that simplifies information processing. However, this simplistic approach is query-agnostic and the extracted facts are ambiguous as they lack context. To address these drawbacks and to enable LLMs to answer complex (multi-hop) questions with ease, we propose to use a knowledge graph (KG) that is context-aware and is distilled to contain query-relevant information. The use of our compressed distilled KG as input to the LLM results in our method utilizing up to $67%$ fewer tokens to represent the query relevant information present in the supporting documents, compared to the state-of-the-art (SoTA) method. Our experiments show consistent improvements over the SoTA across several metrics (EM, F1, BERTScore, and Human Eval) on two popular benchmark datasets (HotpotQA and MuSiQue).

6/11/2024

cs.CL

Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering

Yuqi Wang, Boran Jiang, Yi Luo, Dawei He, Peng Cheng, Liangcai Gao

Large language models (LLMs), such as GPT3.5, GPT4 and LLAMA2 perform surprisingly well and outperform human experts on many tasks. However, in many domain-specific evaluations, these LLMs often suffer from hallucination problems due to insufficient training of relevant corpus. Furthermore, fine-tuning large models may face problems such as the LLMs are not open source or the construction of high-quality domain instruction is difficult. Therefore, structured knowledge databases such as knowledge graph can better provide domain back- ground knowledge for LLMs and make full use of the reasoning and analysis capabilities of LLMs. In some previous works, LLM was called multiple times to determine whether the current triplet was suitable for inclusion in the subgraph when retrieving subgraphs through a question. Especially for the question that require a multi-hop reasoning path, frequent calls to LLM will consume a lot of computing power. Moreover, when choosing the reasoning path, LLM will be called once for each step, and if one of the steps is selected incorrectly, it will lead to the accumulation of errors in the following steps. In this paper, we integrated and optimized a pipeline for selecting reasoning paths from KG based on LLM, which can reduce the dependency on LLM. In addition, we propose a simple and effective subgraph retrieval method based on chain of thought (CoT) and page rank which can returns the paths most likely to contain the answer. We conduct experiments on three datasets: GenMedGPT-5k [14], WebQuestions [2], and CMCQA [21]. Finally, RoK can demonstrate that using fewer LLM calls can achieve the same results as previous SOTAs models.

4/17/2024

cs.CL cs.AI cs.IR

Counter-intuitive: Large Language Models Can Better Understand Knowledge Graphs Than We Thought

Xinbang Dai, Yuncheng Hua, Tongtong Wu, Yang Sheng, Qiu Ji, Guilin Qi

As the parameter scale of large language models (LLMs) grows, jointly training knowledge graph (KG) embeddings with model parameters to enhance LLM capabilities becomes increasingly costly. Consequently, the community has shown interest in developing prompt strategies that effectively integrate KG information into LLMs. However, the format for incorporating KGs into LLMs lacks standardization; for instance, KGs can be transformed into linearized triples or natural language (NL) text. Current prompting methods often rely on a trial-and-error approach, leaving researchers with an incomplete understanding of which KG input format best facilitates LLM comprehension of KG content. To elucidate this, we design a series of experiments to explore LLMs' understanding of different KG input formats within the context of prompt engineering. Our analysis examines both literal and attention distribution levels. Through extensive experiments, we indicate a counter-intuitive phenomenon: when addressing fact-related questions, unordered linearized triples are more effective for LLMs' understanding of KGs compared to fluent NL text. Furthermore, noisy, incomplete, or marginally relevant subgraphs can still enhance LLM performance. Finally, different LLMs have distinct preferences for different formats of organizing unordered triples.

6/18/2024

cs.CL cs.AI

LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering

Harry Li, Gabriel Appleby, Ashley Suh

We present LinkQ, a system that leverages a large language model (LLM) to facilitate knowledge graph (KG) query construction through natural language question-answering. Traditional approaches often require detailed knowledge of complex graph querying languages, limiting the ability for users -- even experts -- to acquire valuable insights from KG data. LinkQ simplifies this process by first interpreting a user's question, then converting it into a well-formed KG query. By using the LLM to construct a query instead of directly answering the user's question, LinkQ guards against the LLM hallucinating or generating false, erroneous information. By integrating an LLM into LinkQ, users are able to conduct both exploratory and confirmatory data analysis, with the LLM helping to iteratively refine open-ended questions into precise ones. To demonstrate the efficacy of LinkQ, we conducted a qualitative study with five KG practitioners and distill their feedback. Our results indicate that practitioners find LinkQ effective for KG question-answering, and desire future LLM-assisted systems for the exploratory analysis of graph databases.

6/12/2024

cs.CL cs.AI cs.LG