首页> 外文学位 >Querying XML data: Efficiency and security issues.
【24h】

Querying XML data: Efficiency and security issues.

机译:查询XML数据:效率和安全性问题。

获取原文
获取原文并翻译 | 示例

摘要

XML is emerging as a widely-used platform-independent data representation language. With increasing interests in XML data, techniques concerning XML evolve rapidly. In this thesis, we study two important issues when querying XML data, efficiency and security, which are essential to an XML searching engine.;We take into consideration ID/IDREF attributes, which are common in XML documents. Most related works model an XML document with ID/IDREF attributes as a graph. We retain a tree model, called extended XML tree, in which the IDREF attribute is regarded as an IDREF node, instead of an IDREF edge to the corresponding node. Based on this model, we propose a B+-tree style index (PIN-tree) integrating the essence of the path index and the inverted list approach. A query evaluation algorithm, PINE, based on the model and the indexing is proposed. PINE handles efficiently queries with/without IDREF requests, and IDREF requests can be stated explicitly or implicitly. We prove that PINE is near optimal for twig queries without IDREF requests under the assumption that the number of distinct tag paths to a label is limited. The assumption is proven to be reasonable by experiments. The complexity of PINE for queries with IDREF requests is also given.;The security of the XML data draws as much attention as the efficiency problem. In this thesis, we study a promising approach to store the accessibility information, which is based on the Compressed Accessibility Map (CAM). We make two advancements in this direction. (1) Previous work suggests that for each user group and each operation type, a different CAM is built. We observe that the performance and storage requirements can be further improved by combining multiple CAMs into an Integrated CAM (ICAM). We explore this possibility and propose an integration mechanism. (2) If the change in structure of the XML data is not frequent, we suggest an efficient lookup method, which can be applied to CAMS or ICAMs, with a much lower time complexity compared to the previous approach.;Experiments were conducted to show the effectiveness of our approaches.
机译:XML逐渐成为一种广泛使用的与平台无关的数据表示语言。随着对XML数据越来越感兴趣,与XML有关的技术迅速发展。本文研究了XML数据查询引擎在查询XML数据时需要考虑的两个重要问题,即效率和安全性。我们考虑了XML文档中常见的ID / IDREF属性。大多数相关的作品都将具有ID / IDREF属性的XML文档建模为图形。我们保留了一个树模型,称为扩展XML树,其中IDREF属性被视为IDREF节点,而不是对应节点的IDREF边缘。基于此模型,我们提出了结合路径索引和倒排列表方法本质的B +树样式索引(PIN树)。提出了基于模型和索引的查询评估算法PINE。 PINE可以有效处理带/不带IDREF请求的查询,并且IDREF请求可以显式或隐式声明。我们证明,在没有IDREF请求的情况下,PINE对于没有IDREF请求的小枝查询几乎是最佳的,前提是假设到标签的不同标签路径的数量是有限的。实验证明该假设是合理的。还给出了使用PREF进行IDREF请求查询的复杂性。XML数据的安全性与效率问题同样引起人们的关注。在本文中,我们研究了一种基于压缩可访问性图(CAM)的可访问性信息存储方法。我们在这个方向上取得了两个进步。 (1)先前的工作表明,对于每个用户组和每种操作类型,将构建一个不同的CAM。我们观察到,通过将多个CAM组合到集成CAM(ICAM)中,可以进一步提高性能和存储要求。我们探索这种可能性并提出一种整合机制。 (2)如果XML数据的结构变化不频繁,我们建议一种有效的查找方法,该方法可以应用于CAMS或ICAM,与以前的方法相比,其时间复杂度要低得多。我们方法的有效性。

著录项

  • 作者

    Jiang, Mingfei.;

  • 作者单位

    The Chinese University of Hong Kong (Hong Kong).;

  • 授予单位 The Chinese University of Hong Kong (Hong Kong).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2006
  • 页码 143 p.
  • 总页数 143
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号