首页> 外文会议> >On improving the performance of data partitioning oriented parallel irregular reductions
【24h】

On improving the performance of data partitioning oriented parallel irregular reductions

机译:关于提高数据分区性能的并行不规则并行化

获取原文

摘要

Different parallelization techniques for reductions have been classified in this paper into two classes: LPO (loop partitioning-oriented techniques) and DPO (data partitioning-oriented techniques). We have analyzed both classes in terms of a set of performance properties: data locality, memory overhead, parallelism and workload balancing. We propose several techniques to increase the exploited parallelism and to introduce load balancing into a DPO method. Regarding parallelism, the solution is based on the partial expansion of the reduction array. For load balancing, the first technique is generic, as it can deal with any kind of load unbalance present in the problem domain. The second technique handles a special case of load unbalancing appearing when there are a large number of write operations on small regions of the reduction arrays. Efficient implementations of the proposed optimizing solutions for the DWA-LIP (data write affinity-loop index prefetching) DPO method are presented, experimentally tested on static and dynamic kernel codes, and compared with other parallel reduction methods.
机译:本文将不同的并行化减少技术分为两类:LPO(面向循环分区的技术)和DPO(面向数据分区的技术)。我们已经根据一组性能属性对这两个类进行了分析:数据局部性,内存开销,并行性和工作负载平衡。我们提出了几种技术来增加被利用的并行性,并将负载平衡引入DPO方法中。关于并行性,该解决方案基于约简阵列的部分扩展。对于负载平衡,第一种技术是通用的,因为它可以处理问题域中存在的任何类型的负载不平衡。第二种技术处理在缩减数组的较小区域上执行大量写操作时出现的负载不平衡的特殊情况。提出了针对DWA-LIP(数据写入亲和力循环索引预取)DPO方法的建议优化解决方案的有效实现,并在静态和动态内核代码上进行了实验测试,并与其他并行归约方法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号