首页>
外国专利>
OPTIMIZING DATA PARTITIONING FOR DATA-PARALLEL COMPUTING
OPTIMIZING DATA PARTITIONING FOR DATA-PARALLEL COMPUTING
展开▼
机译:优化数据分区以进行数据并行计算
展开▼
页面导航
摘要
著录项
相似文献
摘要
A data partitioning plan is automatically generated that—given a data-parallel program and a large input dataset, and without having to first run the program on the input dataset—substantially optimizes performance of the distributed execution system that explicitly measures and infers various properties of both data and computation to perform cost estimation and optimization. Estimation may comprise inferring the cost of a candidate data partitioning plan, and optimization may comprise generating an optimal partitioning plan based on the estimated costs of computation and input/output.
展开▼