计算机科学与探索 ›› 2024, Vol. 18 ›› Issue (10): 2594-2615.DOI: 10.3778/j.issn.1673-9418.2407038

• 垂直领域大模型构建与应用专题 • 上一篇    下一篇

基于大语言模型的命名实体识别研究进展

梁佳,张丽萍,闫盛,赵宇博,张雅雯   

  1. 内蒙古师范大学 计算机科学技术学院,呼和浩特 010022
  • 出版日期:2024-10-01 发布日期:2024-09-29

Research Progress of Named Entity Recognition Based on Large Language Model

LIANG Jia, ZHANG Liping, YAN Sheng, ZHAO Yubo, ZHANG Yawen   

  1. College of Computer Science and Technology, Inner Mongolia Normal University, Hohhot 010022, China
  • Online:2024-10-01 Published:2024-09-29

摘要: 命名实体识别旨在从非结构化的文本中识别出命名实体及类型,是问答系统、机器翻译、知识图谱构建等自然语言处理技术中一项重要的基础任务。随着人工智能技术的发展,基于大语言模型的命名实体识别技术成为一大研究热点。对基于大语言模型的命名实体识别最新研究进展进行综述,概述大语言模型和命名实体识别的发展历程,简要介绍命名实体识别任务常用的数据集和评估方法,从基于规则和字典、基于统计机器学习和基于深度学习的命名实体识别方法这三方面对目前传统命名实体识别研究工作进行梳理。按照模型架构详细阐述不同大语言模型如何应用于不同领域的命名实体识别任务,并对存在的问题和改进的方向进行分析。总结当前基于大语言模型的命名实体识别任务面临的挑战,并展望未来的研究方向。

关键词: 大语言模型, 命名实体识别, 神经网络, 深度学习

Abstract: Named entity recognition aims to identify named entities and their types from unstructured text, which is an important basic task in natural language processing technologies such as question answering system, machine translation and knowledge graph. With the development of artificial intelligence, named entity recognition based on large language model has become a hot research topic. This paper reviews the latest research progress of named entity recognition based on large language model. Firstly, the development process of large language model and named entity recognition is summarized, and the commonly used datasets and evaluation methods for named entity recognition tasks are briefly introduced. This paper sorts out the traditional research work on named entity recognition from three aspects: rule-based and dictionary-based, statistical machine learning-based and deep learning-based. Secondly, how to apply different big language models to different fields of named entity recognition tasks is described in detail according to the model architecture, and the existing problems and improvement directions are analyzed. Finally, the challenges faced by named entity recognition tasks based on big language models are summarized, and future research directions are prospected.

Key words: large language model, named entity recognition, neural network, deep learning