首页> 外国专利> SHUFFLE REDUCE TASKS TO REDUCE I/O OVERHEAD

SHUFFLE REDUCE TASKS TO REDUCE I/O OVERHEAD

机译:随机减少了减少I / O开销的任务

摘要

A Shuffle Reduce operation receives as input files that have been sorted and written by different map tasks, fetches batches of data from each input file, and merges and sorts the batches of data to form a large unified piece of data. A Shuffle Reduce operation is applied to the unified piece of data to produce output data. The Shuffle Reduce operation includes a commutative reduce operation that provides an amount of output data that is significantly less than an amount of input data. The output data is written to memory. The process is repeated for different batches of data until data from each input file is entirely consumed and the output data has been fully formed. The Shuffle Reduce operation greatly reduces the data size that needs to be read by the reduce tasks in a Shuffle operation, thereby significantly reducing the input/output overhead and total execution time.
机译:随着Shuffle减少操作作为由不同地图任务进行排序和写入的输入文件,从每个输入文件获取批次的数据,并合并并对批量数据进行排序以形成大统一数据。将Shuffle减少操作应用于统一数据以产生输出数据。 Shuffle减少操作包括换向性的减少操作,提供了大于输入数据量的输出数据量。输出数据写入存储器。对于不同批次的数据重复该过程,直到来自每个输入文件的数据完全消耗并且输出数据已完全形成。随机减少操作大大降低了减少任务中的数据大小,从而显着降低了输入/输出开销和总执行时间。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号