首页> 外文期刊>International journal of web information systems >Algorithms for structure-based grouping in XML-OLAP
【24h】

Algorithms for structure-based grouping in XML-OLAP

机译:XML-OLAP中基于结构的分组算法

获取原文
获取原文并翻译 | 示例

摘要

Purpose - The purpose of this paper is to propose efficient algorithms for structural grouping over Extensible Markup Language (XML) data, called TOPOLOGICAL ROLLUP (T-ROLLUP), which are to compute aggregation functions based on XML data with multiple hierarchical levels. They play important roles in the online analytical processing of XML data, called XML-OLAP, with which complex analysis over XML can be performed to discover valuable information from XML. Design/methodology/approach - Several variations of algorithms are proposed for efficient T-ROLLUP computation. First, two basic algorithms, top-down algorithm (TDA) and bottom-up algorithm (BUA), are presented in which the well-known structural-join algorithms are used. The paper then proposes more efficient algorithms, called single-scan by preorder number and single-scan by postorder number (SSC-Pre/Post), which are also based on structural joins, but have been modified from the basic algorithms so that multiple levels of grouping are computed with a single scan over node lists. In addition, the paper attempts to adopt the algorithm for parallel execution in multi-core environments.rnFindings - Several experiments are conducted with XMark and synthetic XML data to show the effectiveness of the proposed algorithms. The experiments show that proposed algorithms perform much better than the naive implementation. In particular, the proposed SSC-Pre and SSC-Post perform better than TDA and BUA for all cases. Beyond that, the experiment using the parallel single scan algorithm also shows better performance than the ordinary basic algorithm. Research limitations/implications - This paper focuses on the T-ROLLUP operation for XML data analysis. For this reason, other operations related to XML-OLAP, such as CUBE, WINDOWING, and RANKING should also be investigated.rnOriginality/value - The paper presents an extended version of one of the award winning papers at iiWAS2008.
机译:目的-本文的目的是提出一种用于对可扩展标记语言(XML)数据进行结构分组的有效算法,称为TOPOLOGICAL ROLLUP(T-ROLLUP),该算法将基于具有多个层次结构级别的XML数据计算聚合函数。它们在称为XML-OLAP的XML数据的在线分析处理中发挥着重要作用,可以对XML进行复杂的分析以从XML中发现有价值的信息。设计/方法/方法-为有效进行T-ROLLUP计算,提出了几种算法变体。首先,提出了两种基本算法,即自上而下的算法(TDA)和自下而上的算法(BUA),其中使用了众所周知的结构连接算法。然后,论文提出了更有效的算法,分别称为“按序号进行单次扫描”和“按序号进行单次扫描”(SSC-Pre / Post),这些算法也基于结构连接,但已对基本算法进行了修改,从而实现了多个级别通过对节点列表进行一次扫描即可计算出分组的数量。此外,本文尝试在多核环境中采用该算法进行并行执行。rn发现-使用XMark和合成XML数据进行了一些实验,以证明所提出算法的有效性。实验表明,所提出的算法比单纯的实现要好得多。特别是,在所有情况下,建议的SSC-Pre和SSC-Post的性能均优于TDA和BUA。除此之外,使用并行单扫描算法的实验还显示出比普通基本算法更好的性能。研究局限性/含义-本文着重于XML数据分析的T-ROLLUP操作。因此,还应该研究与XML-OLAP相关的其他操作,例如CUBE,WINDOWING和RANKING。rnOriginality / value-本文提出了iiWAS2008上获奖论文之一的扩展版本。

著录项

  • 来源
  • 作者单位

    Graduate School of Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan;

    Graduate School of Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan;

    Graduate School of Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    algorithmic languages; extensible markup language;

    机译:算法语言;可扩展标记语言;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号