首页> 美国政府科技报告 >Performance Impact of Data Reuse in Parallel Dense Cholesky Factorization
【24h】

Performance Impact of Data Reuse in Parallel Dense Cholesky Factorization

机译:并行密集Cholesky分解中数据重用的性能影响

获取原文

摘要

The paper explores performance issues for several prominent approaches toparallel dense Cholesky factorization. The primary focus is on issues that arise when blocking techniques are integrated into parallel factorization approaches to improve data reuse in the memory hierarchy. The authors first consider panel-oriented approaches, where sets of contiguous columns are manipulated as single units. These methods represent natural extensions of the column-oriented methods that have been widely used previously. On machines with memory hierarchies, panel-oriented methods significantly increase the achieved performance over column-oriented methods. However, the authors find that panel-oriented methods do not expose enough concurrency for problems that one might reasonably expect to solve on moderately machines, thus significantly limiting their performance. The authors then explore block-oriented approaches, where square submatrices are manipulated instead of sets of columns. These methods greatly increase the amount of available concurrency, thus alleviating the problems encountered with panel-oriented methods. However, a number of issues, including scheduling choices and block-placement issues, complicate their implementation. The authors discuss these issues and consider approaches that solve the resulting problems. The resulting block-oriented implementation yields high processor utilization levels over a wide range of problem sizes.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号