首页> 外文会议>IEEE Annual International Symposium on Field-Programmable Custom Computing Machines >CAMAS: Static and Dynamic Hybrid Cache Management for CPU-FPGA Platforms
【24h】

CAMAS: Static and Dynamic Hybrid Cache Management for CPU-FPGA Platforms

机译:CAMAS:用于CPU-FPGA平台的静态和动态混合缓存管理

获取原文

摘要

Heterogeneous computing brings the opportunity to catch up with the increasing demands of modern computing tasks. For this purpose, the CPU-FPGA platform is promising due to the high flexibility of FPGA, which enables customization for various computing tasks to boost performance and energy efficiency. Nowadays, shared coherent cache based CPU-FPGA systems (like Intel HARP and IBM POWER8 with CAPI) are proposed to enhance the communication efficiency between CPU and FPGA and simplify the programming model. In such systems, a coherent cache is attached to FPGA for the quick memory access from FPGA, and its behavior dominates the performance of the FPGA and the entire system. However, the FPGA execution tends to encounter severe cache misses on the FPGA cache, which degrades the FPGA acceleration benefits. To solve this problem, we propose CAMAS, a static and dynamic coordinated cache management approach to reduce the FPGA cache misses and enhance the AFU performance. In the static step, reuse distance analysis is applied to the memory access trace from FPGA to characterize the accessed cachelines into three types according to their locality level. Then a dynamic control with a learning mechanism performs bypassing or caching for the returned cachelines at the cache miss according to the corresponding type. Our approach combines compile-time analysis to determine the caching or bypassing preference with the run-time management equipped with a dynamic learning mechanism. Experiments on Polybench applications demonstrate an average performance improvement of 24.92% using CAMAS.
机译:异构计算带来了赶上现代计算任务不断增长的需求的机会。为此,由于FPGA的高度灵活性,CPU-FPGA平台是有前途的,它可以定制各种计算任务以提高性能和能效。如今,提出了基于共享一致缓存的CPU-FPGA系统(如Intel HARP和具有CAPI的IBM POWER8),以提高CPU和FPGA之间的通信效率并简化编程模型。在这样的系统中,一致性缓存连接到FPGA,以便从FPGA快速访问存储器,其行为支配了FPGA和整个系统的性能。但是,FPGA执行往往会在FPGA高速缓存上遇到严重的高速缓存未命中,从而降低了FPGA加速优势。为了解决这个问题,我们提出了CAMAS,一种静态和动态协调的缓存管理方法,以减少FPGA缓存丢失并提高AFU性能。在静态步骤中,将重用距离分析应用于来自FPGA的存储器访问跟踪,以根据访问的缓存行的位置级别将访问缓存行分为三种类型。然后,具有学习机制的动态控件会根据相应的类型在缓存未命中对返回的缓存行执行绕过或缓存。我们的方法将编译时分析与具有动态学习机制的运行时管理相结合,以确定缓存或旁路优先级。在Polybench应用程序上进行的实验表明,使用CAMAS可使平均性能提高24.92%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号