首页> 外文会议>IEEE international conference on data engineering >Cache-oblivious scheduling of shared workloads
【24h】

Cache-oblivious scheduling of shared workloads

机译:缓存令人沮丧的共享工作负载调度

获取原文
获取外文期刊封面目录资料

摘要

Shared workload optimization is feasible if the set of tasks to be executed is known in advance, as is the case in updating a set of materialized views or executing an extract-transform-load workflow. In this paper, we consider data-intensive workloads with precedence constraints arising from data dependencies. While there has been previous work on identifying common subexpressions and task re-ordering to enable shared scans, in this paper we solve the problem of scheduling shared data-intensive workloads in a cache-oblivious way. Our solution relies on a novel formulation of precedence constrained scheduling with the additional constraint that once a data item is in the cache, all tasks that require this item should execute as soon as possible thereafter. We give an optimal algorithm using A* search over the space of possible orderings, and we propose efficient and effective heuristics that obtain nearly-optimal schedules in much less time. We present experimental results on real-life data warehouse workloads and the TCP-DS benchmark to validate our claims.
机译:共享工作负载优化如果要执行的一组任务是预先已知的,则根据更新一组物化视图或执行提取变换负载工作流程时,如果要执行的一组任务,则是可行的。在本文中,我们考虑具有从数据依赖性引起的优先约束的数据密集型工作负载。虽然有了以前的工作识别常见的子表单和任务重新订购以启用共享扫描,但在本文中,我们解决了缓存忽视方式调度共享数据密集型工作负载的问题。我们的解决方案依赖于新颖的预先制定的优先权调度,其中附加约束是数据项在缓存中,要求此项目的所有任务应尽快执行。我们使用A *搜索可能订购的空间来提供最佳算法,我们提出了高效且有效的启发式,以更少的时间获得了几乎最佳的时间表。我们在现实生活数据仓库工作负载和TCP-DS基准上呈现实验结果,以验证我们的索赔。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号