Journal of Frontiers of Computer Science and Technology ›› 2020, Vol. 14 ›› Issue (5): 740-748.DOI: 10.3778/j.issn.1673-9418.1906031

Previous Articles     Next Articles

Research on Named Entity Recognition Technology in Military Software Testing

HAN Xinxin, BEN Kerong, ZHANG Xian   

  1. College of Electronic Engineering, Navy University of Engineering, Wuhan 430033, China
  • Online:2020-05-01 Published:2020-05-08



  1. 海军工程大学 电子工程学院,武汉 430033


Named entity recognition is an important stage in the construction of knowledge graph. Based on the national military standard and software testing documents, the entity type classification and the data set construction and labeling are completed. In the field of software testing, aiming at the problem that the character and word joint entity recognition method has low recognition precision, the character level feature extraction method is improved, and the CWA-BiLSTM-CRF (character and word attention- bi-directional long short term memory-conditional random field) recognition framework is proposed. The framework consists of two parts: the first part constructs a pre-trained word fusion dictionary, inputs the words and characters together to the bi-directional long short term memory network for training, and adds attention mechanism to measure the semantic contribution of each character in the word to extract the character-level features; the second part, the character-level features and word vectors are spliced, input to the bi-directional long short term memory network for training, and then through the conditional random field to solve the problem of unreasonable sequence of label results, the entities in the text are identified. The experimental results are compared with 3 commonly used deep learning character-level feature extraction methods. Both accuracy and recall rates are improved, and the optimal F1 value is 88.93%. Experiments show that the improved method is suitable for the named entity recognition task in the military software testing field, which lays the foundation for the next construction of the knowledge graph.

Key words: software testing, knowledge graph, named entity recognition, bi-directional long short term memory (BiLSTM), conditional random field (CRF)



关键词: 软件测试, 知识图谱, 命名实体识别, 双向长短期记忆网络, 条件随机场