Replicated Declustering for Arbitrary Queries

机译：任意查询的复制聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Declustering have attracted a lot of interest over the couple of years. Recently, declustering using replication is proposed to reduce the additive overhead of declustering. Most of the work on declustering focuses on spatial range queries. However, in many scenarios including multi-user environments, query shapes can be arbitrary. In this paper, we explore replicated declustering for arbitrary queries. Replication reduces the cost of arbitrary queries to manageable levels. First, we investigate theoretically what is possible using replication for arbitrary queries. Then, we propose a 2-copy replication strategy that achieves the theoretical limit and therefore is the best possible scheme. Using proposed scheme, an arbitrary query containing b buckets requires disk accesses bounded by [b~(1/2)] This is a significant improvement especially for small queries because using a single copy b buckets require min(b,N) disk accesses in the worst case even for small queries. Proposed scheme works for nonuniform data as well as uniform data. Finally, we extend the proposed scheme to a partial replication scheme to achieve best performance using limited replication.

机译：在过去的几年中，簇绒引起了人们的极大兴趣。近来，提出了使用复制进行去簇以减少去簇的附加开销。整理数据的大部分工作都集中在空间范围查询上。但是，在包括多用户环境在内的许多方案中，查询形状可以是任意的。在本文中，我们探讨了针对任意查询的复制分簇。复制将任意查询的成本降低到可管理的水平。首先，我们从理论上研究对任意查询使用复制的可能性。然后，我们提出了一种2副本复制策略，该策略达到了理论极限，因此是最好的方案。使用建议的方案，包含b个存储桶的任意查询都需要以[b〜（1/2）]为界的磁盘访问。这是一个重大改进，尤其是对于小型查询，因为使用单个副本b存储桶需要在磁盘中进行min（b，N）个磁盘访问最坏的情况，即使是小的查询。提议的方案适用于非均匀数据以及统一数据。最后，我们将提议的方案扩展到部分复制方案，以使用有限的复制来实现最佳性能。

著录项

来源
《Association for Computing Machinery(ACM) Annual Symposium on Applied Computing(SAC 2004) vol.1; 20040314-17; Nicosia(CY)》|2004年|P.748-753|共6页
会议地点 Nicosia(CY)
作者
Ali Saman Tosun;
展开▼
作者单位

Department of Computer Science University of Texas at San Antonio San Antonio, TX 78249;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
declustering; replication; arbitrary query;

机译：聚类;复制;任意查询;

相似文献

外文文献
中文文献
专利

1. Query-Log Aware Replicated Declustering [J] . Turk Ata, Yasin Oktay Kerim, Aykanat Cevdet Parallel and Distributed Systems, IEEE Transactions on . 2013,第5期

机译：查询日志感知复制聚类
2. Efficient parallel processing of range queries through replicated declustering [J] . Hakan Ferhatosmanoglu, Ali Saman Tosun, Guadalupe Canahuate, Distributed and Parallel Databases . 2006,第2期

机译：通过复制分簇来有效并行处理范围查询
3. From Discrepancy to Declustering: Near-Optimal Multidimensional Declustering Strategies for Range Queries [J] . Chung-Min Chen, Christine T. Cheng Journal of the Association for Computing Machinery . 2004,第1期

机译：从差异到聚类：范围查询的近最佳多维聚类策略
4. Selective Replicated Declustering for Arbitrary Queries [C] . K. Yasin Oktay, Ata Turk, Cevdet Aykanat Euro-par 2009 parallel processing . 2009

机译：任意查询的选择性复制聚类
5. Query processing in spatial database systems: Declustering and clustering techniques. [D] . Ravada, Sivakumar. 1997

机译：空间数据库系统中的查询处理：聚类和聚类技术。
6. Geneshot: search engine for ranking genes from arbitrary text queries [O] . Alexander Lachmann, Brian M Schilder, Megan L Wojciechowicz, 2019

机译：Geneshot：用于对任意文本查询中的基因进行排名的搜索引擎
7. Selective Replicated Declustering for Arbitrary Queries [O] . K. Yasinoktay Ataturk Andcevdetaykanat 2013

机译：任意查询的选择性复制去聚集

Replicated Declustering for Arbitrary Queries

摘要

著录项

相似文献

相关主题

期刊订阅