首页> 外文OA文献 >O efeito da largura de Fetch no desempenho das arquiteturas super escalar, trace cache e DTSVLIW
【2h】

O efeito da largura de Fetch no desempenho das arquiteturas super escalar, trace cache e DTSVLIW

机译:访存宽度对超标量,跟踪缓存和DTSVLIW架构的性能的影响

摘要

Superscalar machines fetch multiple scalar instructions per cycle from the instruction cache. However, machines that fetch no more than one instruction per cycle fromthe instruction cache, such as Dynamically Trace Scheduled VLIW (DTSVLIW) machines, have shown performance comparable to that of Superscalars. In this paper we present experiments which show that fetching a single instruction from the instruction cache per cycle allows the same performance achieved fetching multiple instructions per cycle thanks to the execution locality present in programs. We alsopresent the first direct comparison between the Superscalars, Trace Cache and DTSVLIW architectures. Our results show that a DTSVLIW machine capable of executing up to 16 instructions per cycle can perform 21.9% better than aSuperscalar and 6.6% better than a Trace Cache with equivalent hardware. In the comparison between a DTSVLIW machine and an Alpha 21264 machine, we have shown that the DTSVLIW can perform 24,17% better than Alpha using integer programs, and 60,36% better than Alpha using floating point programs.
机译:超标量机器每个周期从指令高速缓存中获取多个标量指令。但是,每个周期从指令缓存中获取不超过一条指令的机器(例如动态跟踪调度的VLIW(DTSVLIW)机器)已显示出与超级标量相当的性能。在本文中,我们提供了一些实验,这些实验表明,由于程序中存在执行局部性,因此每个周期从指令缓存中提取一条指令可以实现相同的性能,从而每个周期获取多个指令。我们还展示了超标量,跟踪缓存和DTSVLIW体系结构之间的首次直接比较。我们的结果表明,具有相同硬件的DTSVLIW机器每个周期最多可以执行16条指令,其性能比超级标量高21.9%,比跟踪高速缓存好6.6%。在DTSVLIW机器和Alpha 21264机器之间的比较中,我们显示了使用整数程序,DTSVLIW的性能比Alpha好24.17%,而使用浮点程序的性能要比Alpha好60.36%。

著录项

  • 作者

    Freitas Christian Daros de;

  • 作者单位
  • 年度 2003
  • 总页数
  • 原文格式 PDF
  • 正文语种 por
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号