首页> 外文会议>International conference on supercomputing >Increasing Memory bandwidth with wide buses: Compiler; hardware and performance trade-offs
【24h】

Increasing Memory bandwidth with wide buses: Compiler; hardware and performance trade-offs

机译:通过宽总线增加内存带宽:编译器;硬件和性能之间的权衡

获取原文

摘要

Memory latency and lack of bandwidth are the main barriers to achieve high performace fro mcurrent and future processors. Specially in numeric applications. New organizations of the memory subsystem as well as hardware and software mechnaisms to effectitveyly exploit them are required. the apper presents a new cimpilation technique to pack several load/stores into a single wide load/store, so that the number of wide load/stores is maximized. It also evaluates the performance trade-offs of wide buses and the additional register pressure, showing that it is minimal nad has negligible effets. Finally, the paper proposes a hardware mechanism ot detect and group memory accesses into wide accesses at run time, aso that binary compatibility is peserved. the evaluations are performed using 1180 loops that represent about 78
机译:内存延迟和带宽不足是当前和未来处理器实现高性能的主要障碍。特别是在数字应用中。需要存储器子系统的新组织以及有效地利用它们的硬件和软件机制。装箱工提出了一种新的扣压技术,可将多个装货/仓库打包到一个单独的宽装货/仓库中,从而最大程度地增加了宽装货/仓库的数量。它还评估了宽总线的性能折衷和附加的寄存器压力,这表明它最小,效果可忽略不计。最后,本文提出了一种硬件机制,可以在运行时检测和将内存访问分组为宽访问,以确保二进制兼容性。使用代表约78个循环的1180个循环执行评估

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号