首页> 外文期刊>IEEE Transactions on Computers >Multilevel optimization of pipelined caches
【24h】

Multilevel optimization of pipelined caches

机译:流水线式缓存的多级优化

获取原文
获取原文并翻译 | 示例

摘要

This paper formulates and shows how to solve the problem of selecting the cache size and depth of cache pipelining that maximizes the performance of a given instruction-set architecture. The solution combines trace-driven architectural simulations and the timing analysis of the physical implementation of the cache. Increasing cache size tends to improve performance but this improvement is limited because cache access time increases with its size. This trade-off results in an optimization problem we referred to as multilevel optimization, because it requires the simultaneous consideration of two levels of machine abstraction: the architectural level and the physical implementation level. The introduction of pipelining permits the use of larger caches without increasing their apparent access time, however, the bubbles caused by load and branch delays limit this technique. In this paper we also show how multilevel optimization can be applied to pipelined systems if software- and hardware-based strategies are considered for hiding the branch and load delays. The multilevel optimization technique is illustrated with the design of a pipelined cache for a high clock rate MIPS-based architecture. The results of this design exercise show that, because processors with pipelined caches can have shorter CPU cycle times and larger caches, a significant performance advantage is gained by using two or three pipeline stages to fetch data from the cache. Of course, the results are only optimal for the implementation technologies chosen for the design exercise; other choices could result in quite different optimal designs. The exercise is primarily to illustrate the steps in the design of pipelined caches using multilevel optimization; however, it does exemplify the importance of pipelined caches if high clock rate processors are to achieve high performance.
机译:本文阐述并展示了如何解决选择高速缓存大小和高速缓存流水线深度的问题,该问题可以最大化给定指令集体系结构的性能。该解决方案结合了跟踪驱动的架构仿真和缓存物理实施的时序分析。高速缓存大小的增加往往会提高性能,但是由于高速缓存访​​问时间随其大小而增加,因此这种改进受到限制。这种折衷会导致一个优化问题,我们称其为多级优化,因为它需要同时考虑机器抽象的两个级别:体系结构级别和物理实现级别。流水线的引入允许使用更大的缓存,而不会增加它们的视在访问时间,但是,由负载和分支延迟引起的气泡限制了该技术。在本文中,我们还展示了如果考虑基于软件和硬件的策略来隐藏分支和负载延迟,则如何将多级优化应用于流水线系统。通过针对基于MIPS的高时钟速率架构的流水线缓存设计,说明了多级优化技术。此设计工作的结果表明,由于具有流水线式缓存的处理器可以具有更短的CPU周期时间和更大的缓存,因此通过使用两个或三个流水线阶段从缓存中获取数据可以获得显着的性能优势。当然,结果仅对于为设计练习选择的实施技术是最佳的。其他选择可能导致完全不同的最佳设计。练习主要是说明使用多级优化设计流水线式缓存的步骤。但是,如果高时钟速率的处理器要获得高性能,它确实说明了流水线式缓存的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号