计算机科学与探索 ›› 2009, Vol. 3 ›› Issue (4): 347-357.DOI: 10.3778/j.issn.1673-9418.2009.04.002

• 综述·探索 • 上一篇    下一篇

话题发现与追踪技术研究

张晓艳+,王 挺   

  1. 国防科技大学 计算机学院,长沙 410073
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2009-07-15 发布日期:2009-07-15
  • 通讯作者: 张晓艳

Research of Technologies on Topic Detection and Tracking

ZHANG Xiaoyan+, WANG Ting   

  1. College of Computer, National University of Defense Technology, Changsha 410073, China
  • Received:1900-01-01 Revised:1900-01-01 Online:2009-07-15 Published:2009-07-15
  • Contact: ZHANG Xiaoyan

摘要: 话题发现与追踪以新闻流为处理对象,采用基于事件的信息组织方式进行研究,一直是自然语言处理领域里的热点。该研究借鉴大量相关研究尤其是信息检索中的经典模型和方法,取得了很大成功。首先介绍了话题发现与追踪的主要研究内容、评价方法以及发展历史;然后对其多个研究内容提出一个统一研究框架,并对该框架中的关键技术进行了详细分析;最后指出该领域中的关键问题及难点,并对未来研究做出展望。

关键词: 话题发现与追踪, 统一研究框架, 表示模型

Abstract: Topic detection and tracking (TDT) is the research that addresses event-based organization of broadcast news. It has always been an issue in the field of natural language processing from the beginning. Remarkable successes have been made with classical models and methods which are borrowed from other related researches, especially information retrieval. TDT and its primary tasks, evaluation methods and development history are first introduced. Then, an integrated research framework is provided for TDT. The key technologies of the framework are analyzed. Finally, the key problems and the difficulties in TDT are proposed and the future work is looked forward.

Key words: topic detection and tracking(TDT), integrated research framework, representation model