首页> 外文期刊>Information Systems >Extracting a largest redundancy-free XML storage structure from an acyclic hypergraph in polynomial time
【24h】

Extracting a largest redundancy-free XML storage structure from an acyclic hypergraph in polynomial time

机译:在多项式时间内从非循环超图提取最大的无冗余XML存储结构

获取原文
获取原文并翻译 | 示例
       

摘要

Given a hypergraph and a set of embedded functional dependencies, we investigate the problem of determining the conditions under which we can efficiently generate redundancy-free XML storage structures with as few scheme trees as possible. Redundancy-free XML structures guarantee both economy in storage space and the absence of update anomalies, and having the least number of scheme trees requires the fewest number of joins to navigate among the data elements. We know that the general problem is intractable. The problem may still be intractable even when the hypergraph is acyclic and each hyperedge is in Boyce-Codd normal form (BCNF). As we show here, however, given an acyclic hypergraph with each hyperedge in BCNF, a polynomial-time algorithm exists that generates a largest possible redundancy-free XML storage structure. Successively generating largest possible scheme trees from among hyper-edges not already included in generated scheme trees constitutes a reasonable heuristic for finding the fewest possible scheme trees. For many practical cases, this heuristic finds the set of redundancy-free XML storage structures with the fewest number of scheme trees. In addition to a correctness proof and a complexity analysis showing that the algorithm is polynomial, we also give experimental results over randomly generated but appropriately constrained hypergraphs showing empirically that the algorithm is indeed polynomial.
机译:给定一个超图和一组嵌入式功能依赖关系,我们研究确定条件的问题,在此条件下,我们可以使用尽可能少的方案树来高效地生成无冗余的XML存储结构。无冗余的XML结构既保证了存储空间的经济性又没有更新异常,并且方案树的数量最少,需要在数据元素之间导航的联接数量最少。我们知道总的问题是棘手的。即使超图是非循环的并且每个超边采用Boyce-Codd范式(BCNF),该问题仍可能是棘手的。但是,正如我们在此处所示,给定一个具有BCNF中每个超边的非循环超图,存在一个多项式时间算法,该算法生成最大可能的无冗余XML存储结构。从尚未包括在生成的方案树中的超边缘中连续生成最大可能的方案树,构成了寻找最少的可能方案树的合理启发。对于许多实际情况,这种启发式方法可以找到方案树最少的无冗余XML存储结构集。除了显示该算法是多项式的正确性证明和复杂性分析之外,我们还针对随机生成但经过适当约束的超图给出了实验结果,凭经验表明该算法确实是多项式。

著录项

  • 来源
    《Information Systems》 |2010年第7期|P.804-824|共21页
  • 作者单位

    Department of Economics and information Systems, University of Alabama in Huntsville. Huntsville, AL 35899, United States;

    rnDepartment of Computer Science, City University of Hong Kong, Hong Kong, China;

    rnDepartment of Computer Science, Brigham Young University, Provo, UT 84602, United States;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    XML data redundancy; large XML storage structures; XML-schema generation; acyclic hypergraphs;

    机译:XML数据冗余;大型XML存储结构;XML模式生成;非循环超图;
  • 入库时间 2022-08-18 02:48:00

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号