首页> 外文会议>IEEE international conference on data engineering >A tool for Internet-scale cardinality estimation of XPath queries over distributed semistructured data
【24h】

A tool for Internet-scale cardinality estimation of XPath queries over distributed semistructured data

机译:用于在分布式半结构化数据上对XPath查询进行Internet规模基数估计的工具

获取原文

摘要

We present a novel tool called XGossip for Internet-scale cardinality estimation of XPath queries over distributed XML data. XGossip relies on the principle of gossip, is scalable, decentralized, and can cope with network churn and failures. It employs a novel divide-and-conquer strategy for load balancing and reducing the overall network bandwidth consumption. It has a strong theoretical underpinning and provides provable guarantees on the accuracy of cardinality estimates, the number of messages exchanged, and the total bandwidth usage. In this demonstration, users will experience three engaging scenarios: In the first scenario, they can set up, configure, and deploy XGossip on Amazon Elastic Compute Cloud (EC2). In the second scenario, they can execute XGossip, pose XPath queries, observe in real-time the convergence speed of XGossip, the accuracy of cardinality estimates, the bandwidth usage, and the number of messages exchanged. In the third scenario, they can introduce network churn and failures during the execution of XGossip and observe how these impact the behavior of XGossip.
机译:我们提出了一个名为XGossip的新颖工具,用于对分布式XML数据上的XPath查询进行Internet规模的基数估计。 XGossip依靠八卦的原理,具有可扩展性,分散性,并且可以应对网络混乱和故障。它采用新颖的分而治之策略来实现负载平衡并减少整体网络带宽消耗。它具有强大的理论基础,并为基数估计的准确性,交换的消息数以及总带宽使用率提供了可证明的保证。在此演示中,用户将体验三种引人入胜的场景:在第一种场景中,他们可以在Amazon Elastic Compute Cloud(EC2)上设置,配置和部署XGossip。在第二种情况下,他们可以执行XGossip,提出XPath查询,实时观察XGossip的收敛速度,基数估计的准确性,带宽使用情况以及交换的消息数。在第三种情况下,他们可以在执行XGossip的过程中引入网络混乱和故障,并观察它们如何影响XGossip的行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号