首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Workload-Based Ordering of Multi-Dimensional Data
【24h】

Workload-Based Ordering of Multi-Dimensional Data

机译:基于工作负载的多维数据排序

获取原文
获取原文并翻译 | 示例

摘要

Transforming multi-dimensional data into a one-dimensional sequence using space-filling curves such as the Hilbert curve, the Gray curve, and the Z-curve has been studied extensively. These techniques are not sensitive to data or workload skewness, however, in practice, user-access patterns and data distributions are often very skewed in high dimensional space. It is desirable to produce a one-dimensional sequence which keeps the multi-dimensional grid cells that are queried together close to each other. This generates sequences with higher spatial locality. We propose a workload-based approach to produce one-dimensional ordering from multi-dimensional data in this paper. An extensive experimental evaluation suggests that our approach produces a high quality ordering sequence which outperforms the existing state-of-the-art Hilbert curve by a factor of 4.84, the Gray curve by a factor of 6.66, and the Z-curve by a factor of 7.26 for the number of subsequences used to answer a query; and for IO time, it outperforms the Hilbert curve by a factor of 2.20, the Gray curve by a factor of 2.25, and the Z-curve by 2.38.
机译:使用诸如希尔伯特曲线,格雷曲线和Z曲线之类的空间填充曲线将多维数据转换为一维序列已得到了广泛的研究。这些技术对数据或工作负载偏度不敏感,但是,实际上,用户访问模式和数据分布通常在高维空间中非常偏斜。期望产生一维序列,该一维序列将被一起查询的多维网格单元保持彼此靠近。这将生成具有较高空间局部性的序列。本文提出了一种基于工作负载的方法,可以从多维数据中生成一维排序。广泛的实验评估表明,我们的方法产生了高质量的排序序列,其性能比现有的最新Hilbert曲线高出4.84倍,Gray曲线高出6.66倍,Z曲线高出一倍7.26,用于回答查询的子序列数;在IO时间上,它的性能优于希尔伯特曲线的2.20倍,灰色曲线的2.25倍,Z曲线的2.38倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号