首页> 外文会议>IEEE International Conference on Advanced Information Networking and Applications >Query-Adaptive Online Partitioning of Associated Data for Efficient Retrieval
【24h】

Query-Adaptive Online Partitioning of Associated Data for Efficient Retrieval

机译:用于有效检索的关联数据的查询自适应在线分区

获取原文

摘要

Data partitioning is a crucial component of any distributed storage system that wants to scale. For retrieval efficiency, data frequently requested together in the same query should be placed on the same server as much as possible. Although intuitive, this is not easy to be implemented if constrained by load balancing, computationally, it is an NP hard problem. Existing research has offered approximate solutions optimized for a given workload of queries, in which the order as to when each query is received is not considered. This paper initiates a new study on online partitioning algorithms that are sequentially optimized for a query sequence. In the new problem, the queries arrive in a stream manner, unknown, and given the option to revise the partition after each query, the objective is to minimize the total query processing cost and data migration cost. We formulate this problem formally, investigate several online heuristics, and evaluate them using simulation.
机译:数据分区是任何想要扩展的分布式存储系统的关键组成部分。为了提高检索效率,在同一查询中经常一起请求的数据应尽可能放置在同一服务器上。尽管直观,但是如果受到负载平衡的约束,这不容易实现,但是在计算上,这是一个NP难题。现有研究已经提供了针对给定查询工作量优化的近似解决方案,其中不考虑有关接收每个查询的时间顺序。本文启动了对在线分区算法的新研究,该算法针对查询序列进行了顺序优化。在新问题中,查询以未知的流方式到达,并且在每个查询后都可以选择修改分区,目的是使总查询处理成本和数据迁移成本最小化。我们正式提出这个问题,研究几种在线启发式方法,并使用仿真对其进行评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号