首页> 外文会议>HomeNet.TV Plus >Declustering large multidimensional data sets for range queries over heterogeneous disks

【24h】

Declustering large multidimensional data sets for range queries over heterogeneous disks

机译：整理大型多维数据集以用于异构磁盘上的范围查询

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Declustering is a technique to distribute data sets over multiple disks so that future retrievals can be well balanced over the disks and be performed in parallel. Although clusters often have heterogeneous disks, most declustering work has focused only on homogeneous environments. In this work, we investigate the declustering problem for a heterogeneous disk environment using virtual servers, and propose approaches for deciding the number of virtual servers and the mapping between virtual servers and physical disks. Our experimental results show that by combining our algorithm for choosing the number of virtual servers with a greedy algorithm for mapping virtual servers to disks, users can expect range query retrieval performance within 4% of the optimum achievable in practice on average, in all configurations studied. Compared to an intuitively natural approach to the problem, this represents an improvement of 8-31% in average fetch ratio, as well a 26-38% reduction in the standard deviation of performance for small queries.

机译：群集是一种将数据集分布在多个磁盘上的技术，以便将来的检索可以在磁盘上得到很好的平衡，并可以并行执行。尽管群集通常具有异构磁盘，但是大多数群集工作仅集中在同类环境上。在这项工作中，我们调查使用虚拟服务器的异构磁盘环境的分簇问题，并提出确定虚拟服务器数量以及虚拟服务器和物理磁盘之间映射的方法。我们的实验结果表明，通过将我们选择虚拟服务器数量的算法与用于将虚拟服务器映射到磁盘的贪婪算法相结合，在所有研究的配置中，用户可以期望范围查询检索性能平均在实践中可实现的最佳平均值的4％之内。与直观地自然解决该问题的方法相比，这表示平均提取率提高了8-31％，对于小查询，性能的标准偏差降低了26-38％。

著录项

来源
《HomeNet.TV Plus》|1999年|p.212-221|共10页
会议地点
作者
Jonghyun Lee; Winslett M.; Xiaosong Ma; Shengke Yu;
展开▼
作者单位

Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. From Discrepancy to Declustering: Near-Optimal Multidimensional Declustering Strategies for Range Queries [J] . Chung-Min Chen, Christine T. Cheng Journal of the Association for Computing Machinery . 2004,第1期

机译：从差异到聚类：范围查询的近最佳多维聚类策略
2. Scalability analysis of declustering methods for multidimensional range queries [J] . Moon B., Saltz J.H. IEEE Transactions on Knowledge and Data Engineering . 1998,第2期

机译：多维范围查询的分簇方法的可伸缩性分析
3. Skiptree: A new scalable distributed data structure on multidimensional data supporting range-queries [J] . Saeed Alaei, Mohammad Ghodsi, Mohammad Toossi Computer Communications . 2010,第1期

机译：Skiptree：一种新的可扩展的分布式数据结构，用于支持范围查询的多维数据
4. Declustering large multidimensional data sets for range queries over heterogeneous disks [C] . Jonghyun Lee, Winslett, M., . 2003

机译：整理大型多维数据集以用于异构磁盘上的范围查询
5. Techniques for Accelerating Aggregated Range Queries on Large Multidimensional Datasets in Interactive Visual Exploration [D] . ?Wang, Zhe 2019

机译：在交互式视觉探索中加速大型多维数据集的聚合范围查询的技术
6. A rule driven bi-directional translation system for remapping queries and result sets between a mediated schema and heterogeneous data sources. [O] . R. Shaker, P. Mork, M. Barclay, 2002

机译：规则驱动的双向翻译系统用于在中介模式与异构数据源之间重新映射查询和结果集。
7. Scalability Analysis of Declustering Methods for Multidimensional Range Queries [O] . Bongki Moon, Joel H. Saltz 1998

机译：多维范围查询的分簇方法的可伸缩性分析
8. Declustering databases on heterogeneous disk systems [R] . Chen, L. T. , Rotem, D. , Seshadri, S. 1995

机译：异构磁盘系统上的数据库集群

Declustering large multidimensional data sets for range queries over heterogeneous disks

摘要

著录项

相似文献

相关主题

期刊订阅