...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Indexing useful structural patterns for XML query processing
【24h】

Indexing useful structural patterns for XML query processing

机译:为XML查询处理索引有用的结构模式

获取原文
获取原文并翻译 | 示例
           

摘要

Queries on semistructured data are hard to process due to the complex nature of the data and call for specialized techniques. Existing path-based indexes and query processing algorithms are not efficient for searching complex structures beyond simple paths, even when the queries are high-selective. We introduce the definition of minimal infrequent structures (MIS), which are structures that 1) exist in the data, 2) are not frequent with respect to a support threshold, and 3) all substructures of them are frequent. By indexing the occurrences of MIS, we can efficiently locate the high-selective substructures of a query, improving search performance significantly. An efficient data mining algorithm is proposed, which finds the minimal infrequent structures. Their occurrences in the XML data are then indexed by a lightweight data structure and used as a fast filter step in query evaluation. We validate the efficiency and applicability of our methods through experimentation on both synthetic and real data.
机译:由于数据的复杂性,对半结构化数据的查询很难处理,因此需要专门的技术。现有的基于路径的索引和查询处理算法无法有效地搜索除简单路径之外的复杂结构,即使查询是高选择性的。我们介绍了最小不频繁结构(MIS)的定义,这些结构是1)存在于数据中; 2)相对于支持阈值而言不频繁; 3)它们的所有子结构都很频繁。通过索引MIS的出现,我们可以有效地定位查询的高选择性子结构,从而显着提高搜索性能。提出了一种有效的数据挖掘算法,该算法可以找到最小的不频繁结构。然后,它们在XML数据中的出现将由轻量级数据结构索引,并用作查询评估中的快速筛选步骤。通过对合成数据和真实数据进行实验,我们验证了我们方法的效率和适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号