Journal of Frontiers of Computer Science and Technology ›› 2014, Vol. 8 ›› Issue (6): 694-703.DOI: 10.3778/j.issn.1673-9418.1311021

Previous Articles     Next Articles

Research on Traceability Recovery Between Documentation and Source Code Based on Software Structure

YANG Bingxian+, LIU Chao   

  1. School of Computer Science and Engineering, Beihang University, Beijing 100191, China
  • Online:2014-06-01 Published:2014-05-30

基于软件结构的文档与代码间可追踪性研究

杨丙贤+,刘  超   

  1. 北京航空航天大学 计算机学院,北京 100191

Abstract: Traceability recovery between documentation and source code plays an important role in program comprehension, software maintenance and other software activities. Most of researches on the traceability recovery are based on text similarity, but few researches take full advantage of structure information in documentation and code. This paper presents a method for traceability recovery between documentation and code by combining information retrieval with structure information. Through the analysis of structure information in documentation and code, this method improves the effect of preprocessing and optimizes the similarity calculation process. What’s more, it improves overall effectiveness. The experimental results show that compared with simple approaches based on information retrieval, the proposed method can increase the recall and precision and extract more traceability links.

Key words: software structure, information retrieval, traceability recovery

摘要: 正确建立软件文档与代码间的可追踪关系对程序理解、软件维护等非常重要。近年来,软件文档与代码间的可追踪性研究大多基于文本词汇相似度,没有充分利用软件文档和代码所蕴含的结构信息,针对这一问题提出了将软件结构信息与信息检索模型相结合进行文档与代码间可追踪性分析的方法。通过对文档和代码结构信息的分析,改善预处理效果,优化相似度计算过程,进而提高整体方法的有效性。实验结果表明,该方法比单纯基于信息检索的方法在查全率和查准率上都有所提高,而且能提取到更多的可追踪性链。

关键词: 软件结构, 信息检索, 可追踪性