首页> 外文会议>Simulation Multi-Conference >A study of manycore shared Memory Architecture as a way to build SOC applications
【24h】

A study of manycore shared Memory Architecture as a way to build SOC applications

机译:多核共享内存架构的研究作为构建SoC应用的一种方式

获取原文

摘要

Manycore shared memory architectures hold a significant premise to speed up and simplify SOCs. Using many homogeneous small-cores will allow replacing the hardware accelerators of SOCs by parallel algorithms communicating through shared memory. Currently shared memory is realized by maintaining cache-consistency across the cores, caching all the connected cores to one main memory module. This approach, though used today, is not likely to be scalable enough to support the high number of cores needed for highly parallel SOCs. Therefore we consider a theoretical scheme for shared memory wherein: the shared address space is divided between a set of memory modules; and a communication network allows each core to access every such module in parallel. Load-balancing between the memory modules is obtained by rehashing the memory address-space. We consider practical aspects involved with a practical realization of this scheme, e.g., how will the wire complexity of the communication network affect the execution time. We have designed a simple generic shared memory architecture, synthesized it to 2,4,8,16,..., 1024 - cores for FPGA virtex-7 and evaluated it on several parallel programs. The synthesis results and the execution measurements show that, for the FPGA, all problematic aspects of this construction can be resolved. For example, unlike ASICs, the growing complexity of the communication network is absorbed by the FPGA's routing grid and by its routing mechanism. This makes this type of architectures particularly suitable for FPGAs. We used 32-bits modified PACOBLAZE cores and tested different parameters of this architecture verifying its ability to achieve high speedups. The results suggest that re-hashing is not essential and one hash-function suffice (compared to the family of universal hash functions that is needed by the theoretical construction).
机译:Manycore共享内存架构保持重大前提以加速和简化SOC。使用许多均匀的小核心将通过通过共享内存进行通信的并行算法来替换SOC的硬件加速器。目前共享内存通过维护核心的缓存一致性来实现,将所有连接的核心缓存到一个主内存模块。这种方法虽然今天使用,但不太可能足够可扩展,以支持高度平行的SOC所需的大量核心。因此,我们考虑共享存储器的理论方案,其中:共享地址空间在一组内存模块之间划分;通信网络允许每个核心并行访问每个这样的模块。通过重新加入存储器地址空间来获得存储器模块之间的负载平衡。我们考虑参与该方案的实际实现所涉及的实际方面,例如,通信网络的电线复杂性如何影响执行时间。我们设计了一个简单的通用共享内存架构,合成为2,4,8,16,...,1024 - 用于FPGA Virtex-7的核心,并在几个并行程序中进行评估。合成结果和执行测量表明,对于FPGA,可以解决这种结构的所有问题方面。例如,与ASIC不同,通信网络的日益复杂性由FPGA的路由网格和其路由机制吸收。这使得这种类型的架构特别适用于FPGA。我们使用了32位修改的PacoBlaze核心,并测试了这种架构的不同参数验证其实现高速度的能力。结果表明,重新散列不是必需的,一个哈希功能足够(与理论建设所需的通用哈希函数的家庭相比)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号