【24h】

DMDC

机译:DMDC

获取原文

摘要

One of the main challenges of modern processor design is the implementation of a scalable and efficient mechanism to detect memory access order violations as a result of out-of-order execution of memory instructions. Traditional CAM-based associative queues can be very slow and energy hungry. In this paper we introduce two new management schemes. The first one is a filtering scheme based on simple age-tracking. This scheme can easily avoid 95-98% of associative load queue (LQ) searches using only a few registers. This translates into significant power savings. More importantly, however, this filtering makes our second scheme, Delayed Memory Dependence Checking (DMDC), practical. With a small hash table, DMDC completely avoids the need for an associative LQ and relies on indexing-based checking at the commit phase and hence cuts the energy spent on LQ by an average of 95%. At an average of about 0.3%, the performance impact is negligible. When the energy cost of the increased execution time is factored in, the processor still makes net energy savings of about 3-8%, depending on the configuration and the applications.
机译:现代处理器设计的主要挑战之一是实现可伸缩且高效的机制,以检测由于内存指令的乱序执行而导致的内存访问顺序违规。传统的基于CAM的关联队列可能非常慢且耗能。在本文中,我们介绍了两种新的管理方案。第一个是基于简单年龄跟踪的过滤方案。此方案仅使用几个寄存器即可轻松避免进行95%到98%的关联负载队列(LQ)搜索。这转化为大量的功率节省。但是,更重要的是,这种过滤使我们的第二种方案“延迟内存相关性检查(DMDC)”变得切实可行。借助较小的哈希表,DMDC完全避免了对关联LQ的需求,并在提交阶段依靠基于索引的检查,因此将LQ上的能耗平均减少了95%。平均约0.3%的性能影响可忽略不计。考虑到增加的执行时间的能源成本后,处理器仍然可以节省约3-8%的净能源,具体取决于配置和应用。

相似文献

  • 外文文献
  • 中文文献
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号