Journal of Frontiers of Computer Science and Technology ›› 2024, Vol. 18 ›› Issue (6): 1613-1626.DOI: 10.3778/j.issn.1673-9418.2302029

• Artificial Intelligence·Pattern Recognition • Previous Articles     Next Articles

Nested Named Entity Recognition Combining Multi-modal and Multi-span Features

QIU Yunfei, XING Haoran, YU Zhilong, ZHANG Wenwen   

  1. 1. School of Software, Liaoning Technical University, Huludao, Liaoning 123105, China
    2. School of Business Administration, Liaoning Technical University, Huludao, Liaoning 123105, China
  • Online:2024-06-01 Published:2024-05-31

联合多模态与多跨度特征的嵌套命名实体识别

邱云飞,邢浩然,于智龙,张文文   

  1. 1. 辽宁工程技术大学 软件学院,辽宁 葫芦岛 123105
    2. 辽宁工程技术大学 工商管理学院,辽宁 葫芦岛 123105

Abstract: Nested named entity recognition (NNER) has become a research hotspot in information extraction because of its increasingly important practical significance. However, due to the shortage of corpus resources, limited exhaustive windows, missing span features, etc., NNER research in vertical field has made slow progress and there are problems of entity recognition errors or omissions. To solve these problems, a vertical field NNER model based on mineralogy and corpus awareness dictionary is proposed. Firstly, the point mutual information, word frequency inverse text frequency algorithm and attention mechanism are combined to automatically integrate the corpus awareness dictionary, and the anchor text knowledge is used to improve the training accuracy of the model. Secondly, from the shared perspective, three multi-modal information fusion strategies are designed to train the encoder to learn the extended vector representation of character, glyph and vocabulary. Through triple product operation and slicing attention mechanism, the private representations captured by the multi-layer perceptron are screened and integrated to narrow the spatial gap of heterogeneous features. Thirdly, the context association between spans is determined by a bottom-up hierarchical architecture, and the proposed span set is generated. The characteristics of target span and adjacent span, target span internal characterization, target span boundary, etc. are obtained by double affine mechanism and linear classifier. Finally, the corresponding entity type label is assigned to the target span. Experimental results on six datasets show that compared with baseline model, the proposed method achieves significant performance improvement and can effectively improve the NNER task effect in low-resource scenarios.

Key words: nested named entity recognition, multi-modal, multi-task, distant supervision, mineralogy

摘要: 嵌套命名实体识别(NNER)因日趋重要的现实意义成为信息抽取的研究热点。但是,由于语料资源匮乏、穷举窗口受限以及跨度特征缺失等问题,面向垂直领域的NNER研究进展缓慢且存在实体识别错误或遗漏的问题。针对上述问题,提出一种以矿物学为研究背景,融合语料感知词典的垂直领域NNER模型。首先,结合点互信息、词频逆文本频率算法与注意力机制自动集成语料感知词典,同时扩展锚文本知识提升模型的训练精度。其次,从共享视角出发,设计三种多模态信息的融合策略,训练编码器学习字符、字形、词汇的扩展向量表示,通过三重积运算和切片注意力机制,筛选整合由多层感知机捕捉到的私有表征,缩小异质特征的空间差距。再次,以自底向上的层级架构确定跨度间的上下文关联,生成建议跨度集合,以双仿射机制和线性分类器获得目标跨度与相邻跨度、目标跨度内部表征、目标跨度边界等特征。最后,为目标跨度分配对应的实体类型标签。在六项数据集上的实验结果表明,相比于基线模型,提出的方法实现了显著的性能提升,能有效提升低资源场景下的NNER任务效果。

关键词: 嵌套命名实体识别, 多模态, 多任务, 远程监督, 矿物学