首页> 外文会议>2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) >Analyzing the soft error resilience of linear solvers on multicore multiprocessors
【24h】

Analyzing the soft error resilience of linear solvers on multicore multiprocessors

机译:分析多核多处理器上线性求解器的软错误恢复能力

获取原文
获取原文并翻译 | 示例

摘要

As chip transistor densities continue to increase, soft errors (bit flips) are becoming a significant concern in networked multiprocessors with multicore nodes. Large cache structures in multicore processors are especially susceptible to soft errors as they occupy a significant portion of the chip area. In this paper, we consider the impacts of soft errors in caches on the resilience and energy efficiency of sparse linear solvers. In particular, we focus on two widely used sparse iterative solvers, namely Conjugate Gradient (CG) and Generalized Minimum Residuals (GMRES). We propose two adaptive schemes, (i) a Write Eviction Hybrid ECC (WEH-ECC) scheme for the L1 cache and (ii) a Prefetcher Based Adaptive ECC (PBA-ECC) scheme for the L2 cache, and evaluate the energy and reliability trade-offs they bring in the context of GMRES and CG solvers. Our evaluations indicate that WEH-ECC reduces the CG and GMRES soft error vulnerability by a factor of 18 to 220 in L1 cache, relative to an unprotected L1 cache, and energy consumption by 16%, relative to a cache with strong protection. The PBA-ECC scheme reduces the CG and GMRES soft error vulnerability by a factor of 9 × 103 to 8.6 × 109, relative to an unprotected L2 cache, and reduces the energy consumption by 8.5%, relative to a cache with strong ECC protection. Our energy overheads over unprotected L1 and L2 caches are 5% and 14% respectively.
机译:随着芯片晶体管密度的不断提高,在具有多核节点的网络化多处理器中,软错误(位翻转)正成为一个重要问题。多核处理器中的大型缓存结构特别容易遭受软错误,因为它们占据了芯片面积的很大一部分。在本文中,我们考虑了缓存中的软错误对稀疏线性求解器的弹性和能效的影响。特别是,我们专注于两个广泛使用的稀疏迭代求解器,即共轭梯度(CG)和广义最小残差(GMRES)。我们提出了两种自适应方案,(i)用于L1缓存的写逐出混合ECC(WEH-ECC)方案和(ii)用于L2缓存的基于预取器的自适应ECC(PBA-ECC)方案,并评估了能量和可靠性他们在GMRES和CG解决方案的背景下进行权衡。我们的评估表明,相对于不受保护的L1缓存,WEH-ECC在L1缓存中将CG和GMRES软错误漏洞减少了18到220倍,相对于具有强大保护的缓存,其能耗降低了16%。 PBA-ECC方案将CG和GMRES软错误漏洞降低了9 ƒ–10 3 到8.6 ƒ–10 9 相对于具有强大ECC保护的缓存,它不受保护的L2缓存并能将能耗降低8.5%。我们在不受保护的L1和L2缓存上的能源开销分别为5%和14%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号