首页> 外文学位 >VLIW processors: Efficiently exploiting instruction level parallelism.
【24h】

VLIW processors: Efficiently exploiting instruction level parallelism.

机译:VLIW处理器:有效利用指令级并行性。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation explores high-performance complexity-efficient processors focusing on VLIW processors. Complexity efficiency is a qualitative characteristic that describes a system where performance has not reached the point of diminishing returns. Using the techniques described in this dissertation, simple statically-scheduled very-long-instruction-word (VLIW) processors can be efficient architectures for exploiting instruction-level parallelism and can effectively address the needs of general purpose computing.;We studied the ability of dynamic execution to exploit instruction-level parallelism in dynamic VLIW processors. Unlike previous studies, this study explores the benefits of dynamic execution on an instruction stream with explicit instruction-level parallelism. Dynamic execution is thus applied to problems that compilers have difficulty solving rather than to those problems that compilers readily solve reducing the need for complex and costly hardware. In addition to presenting performance results, we also describe a general processor model and execution definition that improves upon the precise execution model used in traditional processors; we also describe the simulator that implements this new execution model. In our simulations we varied a number of parameters allowing extraction of the individual effects of each parameter on performance. These simulation results show that although a small amount of reordering is adequate to eliminate almost all penalties associated with scheduling errors and latency variations, even a significant amount of reordering is inadequate to eliminate the penalty associated with branch mispredictions, and long memory latencies.;As an alternative to dynamic VLIW processors, we developed Replay Buffers to extend static VLIW processors to support efficient multi-threading. Replay Buffers provide zero switch-cycle thread switches as well as overhead-free exception handling (beyond the cost of the exception handler) and reasonable latency tolerance for delays. Replay Buffers allow VLIW processors to meet the needs of general-purpose applications without the complexity of dynamic VLIW. In addition to improving the capabilities and performance of VLIW processors, this technique has applications beyond VLIW processors and can also benefit all processors and systems using pipelines, particularly those using wave pipelining.
机译:本文探讨了以VLIW处理器为核心的高性能,高效率的处理器。复杂性效率是一种定性特征,它描述了一个性能尚未达到收益递减点的系统。使用本文描述的技术,简单的静态调度超长指令字(VLIW)处理器可以成为利用指令级并行性的有效架构,并可以有效满足通用计算的需求。动态执行以利用动态VLIW处理器中的指令级并行性。与以前的研究不同,本研究探索了在具有显式指令级并行性的指令流上动态执行的好处。因此,动态执行应用于编译器难以解决的问题,而不是应用于编译器容易解决的问题,从而减少了对复杂且昂贵的硬件的需求。除了提供性能结果外,我们还描述了通用处理器模型和执行定义,该模型改进了传统处理器中使用的精确执行模型。我们还将描述实现此新执行模型的模拟器。在我们的仿真中,我们改变了许多参数,从而可以提取每个参数对性能的影响。这些仿真结果表明,尽管少量的重新排序足以消除几乎所有与调度错误和等待时间变化相关的惩罚,但即使是大量的重新排序也不足以消除与分支错误预测和长存储延迟相关的惩罚。作为动态VLIW处理器的替代产品,我们开发了重播缓冲区以扩展静态VLIW处理器以支持高效的多线程。重播缓冲区提供了零切换周期线程切换以及无开销的异常处理(超出了异常处理程序的成本)以及合理的延迟延迟。重放缓冲区使VLIW处理器可以满足通用应用程序的需求,而无需复杂的动态VLIW。除了提高VLIW处理器的功能和性能之外,该技术还具有VLIW处理器以外的应用程序,并且还可使所有使用管线的处理器和系统受益,特别是那些使用波流水线技术的处理器和系统。

著录项

  • 作者

    Rudd, Kevin William.;

  • 作者单位

    Stanford University.;

  • 授予单位 Stanford University.;
  • 学科 Electrical engineering.;Computer science.
  • 学位 Ph.D.
  • 年度 2000
  • 页码 162 p.
  • 总页数 162
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号