首页> 外文会议>The 24th IEEE International Symposium on Field-Programmable Custom Computing Machines >Communication Optimization for the 16-Core Epiphany Floating-Point Processor Array
【24h】

Communication Optimization for the 16-Core Epiphany Floating-Point Processor Array

机译:16核主显节浮点处理器阵列的通信优化

获取原文
获取原文并翻译 | 示例

摘要

The management and optimization of communication in an NoC-based (network-on-chip) bespoke computing platform such as the Parallella (Zynq 7010 + Epiphany-III SoC) is critical for performance and energy-efficiency of floating-point bulk-synchronous workloads. In this paper, we explore the opportunities and capabilities of the Epiphany-III SoC for communication-intensive workloads. Using our communication support library for the Epiphany, we are able to accelerate single-precision BSP workloads like the Sparse Matrix-Vector multiplication (SpMV) on Matrix Market datasets by up to 6.5× and PageRank algorithm on the BerkStan SNAP dataset by up to 8×, while lowering power usage by 2× over optimized ARM-based implementations. When compared to optimized OpenMP x86 mappings, we observe a ≈10× improvement in energy efficiency (GFLOP/s/W) with Epiphany SoC.
机译:在基于NoC的(片上网络)定制计算平台(如Parallella(Zynq 7010 + Epiphany-III SoC))中,通信的管理和优化对于浮点批量同步工作负载的性能和能效至关重要。 。在本文中,我们探索Epiphany-III SoC在通信密集型工作负载中的机会和功能。使用我们的顿悟通信支持库,我们能够将单精度BSP工作负载(例如Matrix Market数据集上的稀疏矩阵-矢量乘法(SpMV)最多提高6.5倍,BerkStan SNAP数据集上的PageRank算法最多提高8倍)。 ×,与基于ARM的优化实现相比,功耗降低了2倍。与优化的OpenMP x86映射进行比较时,我们发现Epiphany SoC的能效(GFLOP / s / W)提升了约10倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号