XML关键词搜索结果的多样化

doi:10.3778/j.issn.1673-9418.2012.10.009

计算机科学与探索 ›› 2012, Vol. 6 ›› Issue (10): 935-947.DOI: 10.3778/j.issn.1673-9418.2012.10.009

XML关键词搜索结果的多样化

刘喜平1,2+，万常选1,2，刘德喜1,2

1. 江西财经大学信息管理学院，南昌 330013
2. 江西省高校数据与知识工程重点实验室，南昌 330013

出版日期:2012-10-01 发布日期:2012-09-28

Results Diversification for Keyword Search on XML Documents

LIU Xiping1,2+, WAN Changxuan1,2, LIU Dexi1,2

1. School of Information Technology, Jiangxi University of Finance and Economics, Nanchang 330013, China
2. Jiangxi College Key Laboratory of Data and Knowledge Engineering, Nanchang 330013, China

Online:2012-10-01 Published:2012-09-28

摘要/Abstract

摘要： 可扩展标记语言（extensible markup language，XML）数据的关键词搜索面临着搜索结果数量庞大，同质化严重和不易区分等问题，针对这些问题，提出了一种新的基于多样化的方法。首先从查询结果抽取原型以标识查询结果语义，然后根据结果原型的特点，定义了原型的兴趣度和原型之间的距离，在此基础上，实现了原型的多样化。进一步提出了一种XML关键词搜索结果组织方法，即按照原型聚集查询结果。这种组织方式能够解决上述问题。最后通过实验证明了所提方法的有效性。

关键词: 可扩展标记语言（XML）, 关键词搜索, 多样化

Abstract: Results of keyword search on extensible markup language (XML) documents are confronted with the problems of high volume, being homogenous in semantics and difficulty in differentiation. To solve these problems, this paper proposes a novel diversification-based method. It first defines the prototype of a search result to express the semantics of the result. Based on the characteristics of result prototype, it defines the interestingness of a prototype and the distance between prototypes. It then diversifies prototypes using these measures. The paper goes further to propose a new method to organize the search results of an XML keyword query, i.e., clustering the search results based on the diversified prototypes. The method can solve the above-mentioned problems. Experimental results verify that the methods are effective.

Key words: extensible markup language (XML), keyword search, diversification

刘喜平，万常选，刘德喜. XML关键词搜索结果的多样化[J]. 计算机科学与探索, 2012, 6(10): 935-947.

LIU Xiping, WAN Changxuan, LIU Dexi. Results Diversification for Keyword Search on XML Documents[J]. Journal of Frontiers of Computer Science and Technology, 2012, 6(10): 935-947.

[1]	李东，邓泽航，李祖立. 基于MapReduce的XML结构连接处理[J]. 计算机科学与探索, 2016, 10(8): 1080-1091.
[2]	范红杰，柳军飞，周鲁东，麻志毅. 多策略相似度整合的XML模式匹配方法[J]. 计算机科学与探索, 2016, 10(1): 14-24.
[3]	宋玉玲，王宁. 利用实体语义信息的关键字查询结果多样化[J]. 计算机科学与探索, 2014, 8(3): 266-274.
[4]	毕鑫，王国仁，赵相国，袁野，张盼. XML数据中Twig查询处理与优化技术研究综述[J]. 计算机科学与探索, 2013, 7(9): 769-782.
[5]	陆嘉俊，黄志球，王进，沈国华，柯昌博. 面向行为的Web服务组合隐私策略描述研究[J]. 计算机科学与探索, 2013, 7(7): 592-601.
[6]	廖湖声，李小青. XML树模式查询的描述语言及形式语义[J]. 计算机科学与探索, 2013, 7(5): 431-441.
[7]	周军锋，田姗姗，蓝国翔，陈子阳，郭景峰. TDCOL：列式存储的XML关键字查询处理策略[J]. 计算机科学与探索, 2012, 6(9): 829-843.
[8]	姜国华, 姜守旭, 王宏志, 李建中, 高宏. 标签劣质的XML数据上的查询处理 [J]. 计算机科学与探索, 2011, 5(8): 673-685.

XML关键词搜索结果的多样化

Results Diversification for Keyword Search on XML Documents

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 8

编辑推荐

Metrics