Journal of Frontiers of Computer Science and Technology

• Science Researches •     Next Articles

Overview of Knowledge Graph Question Answering Enhanced by Large Language Models

FENG Tuoyu,  LI Weiping,  GUO Qinglang, WANG Gangliang, ZHANG Yusong, QIAO Zijian   

  1. 1. School of Software & Microelectronics, Peking University, Beijing 100091, China
    2. China Academy of Electronics and Information Technology, Beijing 100041, China

大语言模型增强的知识图谱问答研究进展综述

冯拓宇,李伟平,郭庆浪,王刚亮,张雨松,乔子剑   

  1. 1.北京大学 软件与微电子学院,北京 100091
    2.中国电子科技集团有限公司电子科学研究院,北京 100041

Abstract: Knowledge Graph Question Answering (KGQA) is a technology that retrieves relevant answers from a knowledge graph by processing natural language questions posed by users. Early KGQA technologies were limited by the size of the knowledge graphs, computational power, and natural language processing capabilities, resulting in lower accuracy. In recent years, with advancements in artificial intelligence, particularly the development of Large Language Models (LLMs), KGQA technology has seen significant improvements. LLMs such as GPT-3 have been widely applied to enhance the performance of KGQA. To better study and learn about enhanced KGQA techniques, various methods using LLMs for KGQA have been summarized. First, the relevant knowledge of LLMs and KGQA is summarized, including the technical principles and training methods of LLMs, as well as the basic concepts of knowledge graphs, question answering, and KGQA. Second, existing methods of enhancing KGQA with LLMs are reviewed from two dimensions: semantic parsing and information retrieval, analyzing the problems these methods address and their limitations. Additionally, related resources and evaluation methods for KGQA enhanced by LLMs are collected and organized, and the performance of existing methods is summarized. Finally, the limitations of current methods are analyzed, and future research directions are proposed.

Key words: large language model, knowledge graph question answering, semantic parsing, information retrieval

摘要: 知识图谱问答(Knowledge Graph Question Answering, KGQA)是一种通过处理用户提出的自然语言问题,从知识图谱中获取相关答案的技术。早期的知识图谱问答技术受到知识图谱规模、计算能力以及自然语言处理能力的限制,其准确率较低。近年来,随着人工智能技术的进步,特别是大语言模型(Large Language Models, LLMs)的发展,知识图谱问答技术得到显著提升。大语言模型如GPT-3等已经被广泛应用于增强知识图谱问答的性能。为了更好地研究学习增强知识图谱问答的技术,对现有的各种大语言模型增强的知识图谱问答方法进行了总结归纳。首先,总结大语言模型和知识图谱问答的相关知识,即大语言模型的技术原理、训练方法以及知识图谱、问答和知识图谱问答的基本概念。其次,从语义解析和信息检索两个维度,综述大语言模型增强知识图谱问答的现有方法,分析方法所解决的问题及其局限性。此外,收集整理大语言模型增强知识图谱问答的相关资源和评测方法并对现有方法的性能表现进行总结。最后,针对现有方法存在的局限性,分析并提出出未来的重点研究方向。

关键词: 大语言模型, 知识图谱问答, 语义解析, 信息检索