计算机科学与探索 ›› 2010, Vol. 4 ›› Issue (8): 723-730.DOI: 10.3778/j.issn.1673-9418.2010.08.006

• 学术研究 • 上一篇    下一篇

XML检索中的标签权重设置模型*

刘德喜1,2+, 万常选1,2, 刘喜平1,2, 焦贤沛1,2   

  1. 1. 江西财经大学 信息管理学院, 南昌 330013
    2. 江西财经大学 江西省数据与知识工程重点实验室, 南昌 330013
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-08-10 发布日期:2010-08-10
  • 通讯作者: 刘德喜

Tag Weighting Model for XML Retrieval*

LIU Dexi1,2+, WAN Changxuan1,2, LIU Xiping1,2, JIAO Xianpei1,2   

  1. 1. School of Information Technology, Jiangxi University of Finance & Economics, Nanchang 330013, China
    2. Jiangxi Key Lab of Data & Knowledge Engineering, Jiangxi University of Finance & Economics, Nanchang 330013, China
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-08-10 Published:2010-08-10
  • Contact: LIU Dexi

摘要: XML检索时, 考虑关键词在文档中的位置有助于改善检索效果, 一种常用的方法是为文档中不同的标签赋予不同的权重, 并根据关键词所在结点的标签合理地设置权重。然而, 目前为标签赋予权重的方法大都是人工设置, 这种方法工作量大且主观性强。提出了用主题概括强度衡量XML标签权重的方法, 实验结果显示, 该方法能有效提高XML检索的质量。

关键词: XML检索, 标签权重, 主题概括强度

Abstract: Taking the occurrence position of a term in XML (extensive makeup language) retrieval is helpful to improve the retrieval performance. The common method sets the weight of tag in XML document and integrates the tag weight into term weight model. However, tag weight is set manually in most related works, which is a subjective and heavy work. A tag weight model based on topic generalization is advanced, by which the tag weight is calculated automatically. Experiment results show that this model performs well in XML retrieval.

Key words: XML retrieval, tag weight, topic generalization

中图分类号: