【24h】

An Efficient Path Index for Querying Semi-structured Data

机译:查询半结构化数据的有效路径索引

获取原文
获取原文并翻译 | 示例

摘要

The richness of semi-structured data allows data of varied and inconsistent structures to be stored in a single database. Such data can be represented as a graph, and queries can be constructed using path expressions, which describe traversals through the graph. Instead of providing optimal performance for a limited range of path expressions, we propose a mechanism which is shown to have consistent and high performance for path expressions of any complexity, including those with descendant operators (path wildcards). We further detail mechanisms which employ our index to perform more complex processing, such as evaluating both path expressions containing links and entire (sub) queries containing path based predicates. Performance is shown to be independent of the number of terms in the path expression(s), even where these expressions contain wildcards. Experiments show that our index is faster than conventional methods by up to two orders of magnitude for certain query types, is compact, and scales well.
机译:半结构化数据的丰富性允许将结构不同且不一致的数据存储在单个数据库中。这样的数据可以表示为图形,并且可以使用路径表达式构造查询,该路径表达式描述了遍历图形的遍历。我们没有为有限范围的路径表达式提供最佳性能,而是提出了一种机制,该机制对任何复杂度的路径表达式(包括带有后代运算符(路径通配符)的路径表达式)都具有一致的高性能。我们进一步详细介绍了利用索引执行更复杂处理的机制,例如评估包含链接的路径表达式和包含基于路径的谓词的整个(子)查询。显示的性能与路径表达式中的术语数量无关,即使这些表达式包含通配符也是如此。实验表明,对于某些查询类型,我们的索引比常规方法快两个数量级,并且结构紧凑且可扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号