Communication Optimization for the 16-Core Epiphany Floating-Point Processor Array

机译：16核主显节浮点处理器阵列的通信优化

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The management and optimization of communication in an NoC-based (network-on-chip) bespoke computing platform such as the Parallella (Zynq 7010 + Epiphany-III SoC) is critical for performance and energy-efficiency of floating-point bulk-synchronous workloads. In this paper, we explore the opportunities and capabilities of the Epiphany-III SoC for communication-intensive workloads. Using our communication support library for the Epiphany, we are able to accelerate single-precision BSP workloads like the Sparse Matrix-Vector multiplication (SpMV) on Matrix Market datasets by up to 6.5× and PageRank algorithm on the BerkStan SNAP dataset by up to 8×, while lowering power usage by 2× over optimized ARM-based implementations. When compared to optimized OpenMP x86 mappings, we observe a ≈10× improvement in energy efficiency (GFLOP/s/W) with Epiphany SoC.

机译：在基于NoC的（片上网络）定制计算平台（如Parallella（Zynq 7010 + Epiphany-III SoC））中，通信的管理和优化对于浮点批量同步工作负载的性能和能效至关重要。。在本文中，我们探索Epiphany-III SoC在通信密集型工作负载中的机会和功能。使用我们的顿悟通信支持库，我们能够将单精度BSP工作负载（例如Matrix Market数据集上的稀疏矩阵-矢量乘法（SpMV）最多提高6.5倍，BerkStan SNAP数据集上的PageRank算法最多提高8倍）。 ×，与基于ARM的优化实现相比，功耗降低了2倍。与优化的OpenMP x86映射进行比较时，我们发现Epiphany SoC的能效（GFLOP / s / W）提升了约10倍。

著录项

来源
《The 24th IEEE International Symposium on Field-Programmable Custom Computing Machines》|2015年|26-26|共1页
会议地点 Washington DC(US)
作者
Nachiket Kapre; Siddhartha;
展开▼
作者单位

Nanyang Technol. Univ., Singapore, Singapore;

Nanyang Technol. Univ., Singapore, Singapore;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Optimization; Sparse matrices; Libraries; Parallel processing; Acceleration; Benchmark testing; Network-on-chip;

机译：优化;稀疏矩阵;库;并行处理;加速;基准测试;片上网络;

相似文献

外文文献
中文文献
专利

1. A 16-Core Processor With Shared-Memory and Message-Passing Communications [J] . IEEE transactions on circuits and systems . I , Regular papers . 2014,第4期

机译：具有共享内存和消息传递通信的16核处理器
2. Hardware-software optimizations of reconfigurable multi-core processors for floating-point computations of large sparse matrices [J] . Xiaofang Wang Journal of Real-Time Image Processing . 2014,第1期

机译：可重构多核处理器的硬件-软件优化，用于大型稀疏矩阵的浮点计算
3. A Distributed Shared Memory Model and C++ Templated Meta-Programming Interface for the Epiphany RISC Array Processor [J] . David Richie, James Ross, Jamie Infantolino Procedia Computer Science . 2017,第1期

机译：主显RISC阵列处理器的分布式共享内存模型和C ++模板元编程接口
4. Communication Optimization for the 16-Core Epiphany Floating-Point Processor Array [C] . Nachiket Kapre, Siddhartha IEEE International Symposium on Field-Programmable Custom Computing Machines . 2016

机译：16核EPIphany浮点处理器阵列的通信优化
5. Generalized Quadratically Constrained Quadratic Programming and its Applications in Array Processing and Cooperative Communications. [D] . Khabbazibasmenj, Arash. 2013

机译：广义二次约束二次规划及其在阵列处理和协作通信中的应用。
6. Optimizing the technological and informational relationship of the health care process and of the communication between physician and patient– Factors that have an impact on the process of diagnosis from the physicians and the patients perspectives [O] . VL Purcarea, DG Petrescu, IR Gheorghe, 2011

机译：优化医疗保健过程以及医师与患者之间的沟通的技术和信息关系–从医师和患者的角度对诊断过程产生影响的因素
7. A Distributed Shared Memory Model and C++ Templated Meta-Programming Interface for the Epiphany RISC Array Processor [O] . Richie, David, Ross, James, Infantolino, Jamie 2017

机译：分布式共享内存模型和C ++模板元编程 Epiphany RIsC阵列处理器的接口

Communication Optimization for the 16-Core Epiphany Floating-Point Processor Array

摘要

著录项

相似文献

相关主题

期刊订阅