计算机科学与探索 ›› 2010, Vol. 4 ›› Issue (10): 918-926.DOI: 10.3778/j.issn.1673-9418.2010.10.006

• 学术研究 • 上一篇    下一篇

面向更新的扩展Dewey编码*

周军锋+;魏 蕊; 郭景峰

  

  1. 燕山大学 信息科学与工程学院, 河北 秦皇岛 066004
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2010-10-01 发布日期:2010-10-01
  • 通讯作者: 周军锋

Update-Oriented Extending Dewey Labeling Scheme*

ZHOU Junfeng+; WEI Rui;GUO Jingfeng

  

  1. School of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei 066004, China
  • Received:1900-01-01 Revised:1900-01-01 Online:2010-10-01 Published:2010-10-01
  • Contact: ZHOU Junfeng

摘要: 依赖于特定编码方案的高效查询处理算法是有效获取信息的必要手段, 扩展Dewey编码以其祖先名称可知性的特点, 在处理结构化查询时可显著减少需要扫描的元素数量, 加快查询处理的速度。针对扩展Dewey编码不支持更新和依赖于DTD的缺陷, 提出一种支持插入操作的动态扩展Dewey编码(DED), 可避免执行插入操作时对已有结点的重新编码操作; 提出一种支持DTD更新操作的动态有限状态转换器(DFST), 可避免由于导出DTD的变化所导致的编码失效问题。最后通过实验验证了该编码的有效性。

关键词: 可扩展标示语言, 扩展Dewey, 更新

Abstract: Efficient query processing algorithms that depend on a special labeling scheme are indispensable for extracting desired information from XML data. The extended Dewey (ED) labeling scheme has the feature that for a certain element, all its ancestors’ names can be derived from its labels, which greatly reduce the number of scanned elements. However, ED labeling scheme doesn’t support update operation, and the encoding and decoding opera-tions severely depend on a finite state transducer (FST) generated from the document type definition (DTD) schema, which makes it infeasible in the case where DTD is changed. This paper proposes a fully dynamic extended Dewey (DED) labeling scheme that can completely avoid the re-labeling operation when inserting new node, and also gives a dynamic FST (DFST) to avoid the invalidation problem of existing labels when the FST needs to be changed. The experimental results show the effectiveness of the DED labeling scheme according to different metrics.

Key words: extensible markup language (XML), extended Dewey (ED), update

中图分类号: