首页> 外文期刊>Computers, IEEE Transactions on >Cache-Based Memory Copy Hardware Accelerator for Multicore Systems
【24h】

Cache-Based Memory Copy Hardware Accelerator for Multicore Systems

机译:用于多核系统的基于缓存的内存复制硬件加速器

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we present a new architecture of the cache-based memory copy hardware accelerator in a multicore system supporting message passing. The accelerator is able to accelerate memory data movements, in particular memory copies. We perform an analytical analysis based on open-queuing theory to study the utilization of our accelerator in a multicore system. In order to correctly model the system, we gather the necessary information by utilizing a full-system simulator. We present both the simulation results and the analytical analysis. We demonstrate the advantages of our solution based on a full-system simulator utilizing several applications: the STREAM benchmark and the receiver-side of the TCP/IP stack. Our accelerator provides speedups from 2.96 to 4.61 for the receiver-side of the TCP/IP stack, reduces the number of instructions from 26 percent to 44 percent and achieves a higher cache hit rate. Utilizing the analytical analysis, our accelerator reduces in the average number of cycles executed per instruction up to 50 percent for one of the CPUs in the multicore system.
机译:在本文中,我们提出了一种在支持消息传递的多核系统中基于缓存的内存复制硬件加速器的新体系结构。加速器能够加速内存数据移动,特别是内存副本。我们基于开放排队理论进行分析分析,以研究加速器在多核系统中的利用率。为了正确地对系统建模,我们通过使用完整的系统模拟器来收集必要的信息。我们同时给出了仿真结果和分析分析。我们展示了基于基于全系统仿真器的解决方案的优势,该仿真器利用了多种应用程序:STREAM基准测试和TCP / IP堆栈的接收器端。我们的加速器为TCP / IP堆栈的接收器端提供了从2.96到4.61的加速,将指令数量从26%减少到44%,并实现了更高的缓存命中率。利用分析分析,对于多核系统中的一个CPU,我们的加速器可将每条指令执行的平均周期数减少多达50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号