SbSER: A Step-by-Step Enhanced Reasoning Framework for Large Language Model With External Subgraph Generation

doi:10.3778/j.issn.1673-9418.2409054

Abstract

Abstract: Large Language Models (LLMs) have achieved significant success across various tasks, particularly in machine translation, text generation, and question-answering systems since their inception. Their applications have rapidly expanded to more complex tasks. However, despite their impressive performance in many areas, LLMs still face significant challenges in tasks that require deep reasoning and logical deduction. This is mainly due to the fact that during training, LLMs rely heavily on large volumes of textual data, which often fail to comprehensively cover specialized knowledge across all domains. As a result, LLMs tend to generate "hallucinations" in handling domain-specific problems, meaning they output inaccurate or factually incorrect answers. This issue can be mitigated by incorporating external knowledge graphs (KG) to assist in the reasoning process of LLM. We present the SbSER, a step-by-step enhanced reasoning framework for LLM with external subgraph generation. First, it guides the LLM to do accurate semantic parsing through generating clear subgraph Schemas, converting questions into logical retrieval statements. Second, it imports knowledge triples into a graph database to complete precise knowledge retrieval. Finally, it achieves the final enhanced reasoning answer through the combination of two reasoning methods: direct retrieval reasoning and joint retrieval reasoning. Experimental results demonstrate that SbSER has made significant progress across multiple datasets. Based on this success, this study aims to provide valuable references for future research on the integration of KG and LLM, thereby enhancing LLM’s capabilities in solving complex problems.

Key words: large language model, subgraph generation, step-by-step reasoning

摘要： 大语言模型（Large Language Model, LLM）自问世以来在各种任务中取得了显著的成功，尤其是在机器翻译、文本生成、问答系统等任务中的卓越表现，它们的应用也迅速扩展到了更多复杂的任务中。然而，尽管LLM在多种任务中展现了强大的能力，但在需要深入推理和逻辑推导的任务场景中，它们仍然面临着显著的挑战。尤其是，由于模型训练过程中依赖大量的文本数据，往往难以全面涵盖所有领域的专业知识，导致LLM在处理特定领域问题时容易产生“幻觉”问题，即输出不准确或与实际知识不符的答案。该问题可以通过在大语言模型推理中引入外部知识图谱（Knowledge Graph, KG）来辅助解决。本研究创新性地提出基于外部子图生成的大模型分步增强推理框架（Step-by-Step Enhanced Reasoning Framework for Large Language Model With External Subgraph Generation, SbSER），首先通过生成清晰的子图Schema引导大模型完成准确的语义解析以完成问题到逻辑查询语句的转换，其次将知识三元组导入图数据库中以完成准确的知识查询，最后通过采用直接查询推理和联合推理两种推理方式实现问题的最终增强推理输出。实验结果表明，SbSER在多个数据集上取得显著进展。基于这一成功，本研究希望为未来KG与LLM融合的研究提供有价值的参考，从而提升LLM在解决复杂问题上的能力。

关键词: 大语言模型, 子图生成, 分步推理

FENG Tuoyu, WANG Gangliang, QIAO Zijian, LI Weiping, ZHANG Yusong, GUO Qinglang. SbSER: A Step-by-Step Enhanced Reasoning Framework for Large Language Model With External Subgraph Generation[J]. Journal of Frontiers of Computer Science and Technology, DOI: 10.3778/j.issn.1673-9418.2409054.

冯拓宇, 王刚亮, 乔子剑, 李伟平, 张雨松, 郭庆浪. SbSER：基于外部子图生成的大语言模型分步增强推理框架[J]. 计算机科学与探索, DOI: 10.3778/j.issn.1673-9418.2409054.

[1]	XIANG Xiaowei, SHEN Yanguang, HU Minghao, YAN Tianwei, LUO Wei, LUO Zhunchen. Research on Science and Technology Policy and Regulation Q&A System Driven by Large Models [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(9): 2349-2360.
[2]	LI Yifei, ZHANG Lingling, DONG Yuxuan, WANG Jiaxin, ZHONG Yujie, WEI Bifan. Large Language Model Augmentation and Feature Alignment Method for Few-Shot Continual Relation Extraction [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(9): 2326-2336.
[3]	JI Guiyang, WANG Peiyan, YU Zhuo. Research on Knowledge Injection Method for Large Language Model Oriented to Process Specification Texts [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(9): 2361-2369.
[4]	CHEN Longfei, GAO Xin, HOU Haotian, YE Chuyang, LIU Ya'ou, ZHANG Meihui. Application of Generative Large Language Models in Chinese Radiology Domain [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(9): 2337-2348.
[5]	SHENG Lei, CHEN Xiliang, LAI Jun. Offline Multi-agent Reinforcement Learning Method Based on Latent State Distribution GPT [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(8): 2169-2179.
[6]	LUO Shijie, JIN Rize, HAN Shuzhen. Research on University Basic Knowledge Question-Answering Using Low-Rank Encoding to Optimize Large Language Model [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(8): 2156-2168.
[7]	ZHANG Qi, ZHONG Hao. Submodular Optimization Approach for Entity Summarization in Knowledge Graph Driven by Large Language Models [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(7): 1806-1813.
[8]	FENG Jun, CHANG Yanghong, LU Jiamin, TANG Hailin, LYU Zhipeng, QIU Yuchun. Construction and Application of Knowledge Graph for Water Engineering Scheduling Based on Large Language Model [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(6): 1637-1647.
[9]	LIU Jun, LENG Fangling, WU Wangwang, BAO Yubin. Construction Method of Textbook Knowledge Graph Based on Multimodal and Knowledge Distillation [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(11): 2901-2911.
[10]	LIU Xin, GAO Huiquan, SHAO Changheng, CHEN Ziliang, LU Wenjuan, YANG Huiru. Construction and Application of Large Language Model for Public Complaints with Knowledge Reasoning and Similarity Retrieval [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(11): 2940-2953.
[11]	SANG Chenyang, MA Tinghuai, XIE Xintong, SUN Shengjie, HUANG Rui. Multi-stage Reasoning Method for Emotional Support Dialogue Generation Based on Large Language Models [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(11): 2925-2939.
[12]	FENG Tuoyu, LI Weiping, GUO Qinglang, WANG Gangliang, ZHANG Yusong, QIAO Zijian. Overview of Knowledge Graph Question Answering Enhanced by Large Language Models [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(11): 2887-2900.
[13]	XUE Di, LI Xin, LIU Mingshuai. PTCR: Knowledge-Based Visual Question Answering Framework Based on Large Language Model [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(11): 2912-2924.
[14]	LI Li, SHI Rongliang, GUO Xu, JIANG Hongxin. Diagnosis of Power System Defects by Large Language Models and Graph Neural Networks [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(10): 2643-2655.
[15]	ZHANG Caike, LI Xiaolong, ZHENG Sheng, CAI Jiajun, YE Xiaozhou, LUO Jing. Research on Construction and Application of Knowledge Graph Based on Large Language Model [J]. Journal of Frontiers of Computer Science and Technology, 2024, 18(10): 2656-2667.

SbSER: A Step-by-Step Enhanced Reasoning Framework for Large Language Model With External Subgraph Generation

SbSER：基于外部子图生成的大语言模型分步增强推理框架

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics