【24h】

Replicated Declustering for Arbitrary Queries

机译:任意查询的复制聚类

获取原文
获取原文并翻译 | 示例

摘要

Declustering have attracted a lot of interest over the couple of years. Recently, declustering using replication is proposed to reduce the additive overhead of declustering. Most of the work on declustering focuses on spatial range queries. However, in many scenarios including multi-user environments, query shapes can be arbitrary. In this paper, we explore replicated declustering for arbitrary queries. Replication reduces the cost of arbitrary queries to manageable levels. First, we investigate theoretically what is possible using replication for arbitrary queries. Then, we propose a 2-copy replication strategy that achieves the theoretical limit and therefore is the best possible scheme. Using proposed scheme, an arbitrary query containing b buckets requires disk accesses bounded by [b~(1/2)] This is a significant improvement especially for small queries because using a single copy b buckets require min(b,N) disk accesses in the worst case even for small queries. Proposed scheme works for nonuniform data as well as uniform data. Finally, we extend the proposed scheme to a partial replication scheme to achieve best performance using limited replication.
机译:在过去的几年中,簇绒引起了人们的极大兴趣。近来,提出了使用复制进行去簇以减少去簇的附加开销。整理数据的大部分工作都集中在空间范围查询上。但是,在包括多用户环境在内的许多方案中,查询形状可以是任意的。在本文中,我们探讨了针对任意查询的复制分簇。复制将任意查询的成本降低到可管理的水平。首先,我们从理论上研究对任意查询使用复制的可能性。然后,我们提出了一种2副本复制策略,该策略达到了理论极限,因此是最好的方案。使用建议的方案,包含b个存储桶的任意查询都需要以[b〜(1/2)]为界的磁盘访问。这是一个重大改进,尤其是对于小型查询,因为使用单个副本b存储桶需要在磁盘中进行min(b,N)个磁盘访问最坏的情况,即使是小的查询。提议的方案适用于非均匀数据以及统一数据。最后,我们将提议的方案扩展到部分复制方案,以使用有限的复制来实现最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号