...
首页> 外文期刊>Computation >Evaluation of External Memory Access Performance on a High-End FPGA Hybrid Computer
【24h】

Evaluation of External Memory Access Performance on a High-End FPGA Hybrid Computer

机译:高端FPGA混合计算机上的外部存储器访问性能评估

获取原文

摘要

The motivation of this research was to evaluate the main memory performance of a hybrid super computer such as the Convey HC-x, and ascertain how the controller performs in several access scenarios, vis-à-vis hand-coded memory prefetches. Such memory patterns are very useful in stencil computations. The theoretical bandwidth of the memory of the Convey is compared with the results of our measurements. The accurate study of the memory subsystem is particularly useful for users when they are developing their application-specific personality. Experiments were performed to measure the bandwidth between the coprocessor and the memory subsystem. The experiments aimed mainly at measuring the reading access speed of the memory from Application Engines (FPGAs). Different ways of accessing data were used in order to find the most efficient way to access memory. This way was proposed for future work in the Convey HC-x. When performing a series of accesses to memory, non-uniform latencies occur. The Memory Controller of the Convey HC-x in the coprocessor attempts to cover this latency. We measure memory efficiency as a ratio of the number of memory accesses and the number of execution cycles. The result of this measurement converges to one in most cases. In addition, we performed experiments with hand-coded memory accesses. The analysis of the experimental results shows how the memory subsystem and Memory Controllers work. From this work we conclude that the memory controllers do an excellent job, largely because (transparently to the user) they seem to cache large amounts of data, and hence hand-coding is not needed in most situations.
机译:这项研究的目的是评估混合超级计算机(例如Convey HC-x)的主内存性能,并确定控制器在几种访问方案下(相对于手动编码的内存预取)的性能。这样的存储模式在模板计算中非常有用。 Convey存储器的理论带宽与我们的测量结果进行了比较。当用户发展其特定于应用程序的个性时,对内存子系统的准确研究特别有用。进行实验以测量协处理器和内存子系统之间的带宽。这些实验主要旨在测量来自应用引擎(FPGA)的内存的读取访问速度。为了找到最有效的访问内存的方式,使用了不同的访问数据的方式。建议在Convey HC-x的将来工作中使用这种方式。当执行对存储器的一系列访问时,会出现不均匀的延迟。协处理器中Convey HC-x的内存控制器试图弥补此延迟。我们将内存效率衡量为内存访问次数与执行周期数之比。在大多数情况下,此测量结果收敛到一个。此外,我们使用手工编码的内存访问进行了实验。对实验结果的分析显示了内存子系统和内存控制器如何工作。通过这项工作,我们得出结论,内存控制器的工作非常出色,主要是因为(对于用户而言,透明的)它们似乎缓存了大量数据,因此在大多数情况下不需要手动编码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号