VLIW processors: Efficiently exploiting instruction level parallelism.

机译：VLIW处理器：有效利用指令级并行性。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This dissertation explores high-performance complexity-efficient processors focusing on VLIW processors. Complexity efficiency is a qualitative characteristic that describes a system where performance has not reached the point of diminishing returns. Using the techniques described in this dissertation, simple statically-scheduled very-long-instruction-word (VLIW) processors can be efficient architectures for exploiting instruction-level parallelism and can effectively address the needs of general purpose computing.;We studied the ability of dynamic execution to exploit instruction-level parallelism in dynamic VLIW processors. Unlike previous studies, this study explores the benefits of dynamic execution on an instruction stream with explicit instruction-level parallelism. Dynamic execution is thus applied to problems that compilers have difficulty solving rather than to those problems that compilers readily solve reducing the need for complex and costly hardware. In addition to presenting performance results, we also describe a general processor model and execution definition that improves upon the precise execution model used in traditional processors; we also describe the simulator that implements this new execution model. In our simulations we varied a number of parameters allowing extraction of the individual effects of each parameter on performance. These simulation results show that although a small amount of reordering is adequate to eliminate almost all penalties associated with scheduling errors and latency variations, even a significant amount of reordering is inadequate to eliminate the penalty associated with branch mispredictions, and long memory latencies.;As an alternative to dynamic VLIW processors, we developed Replay Buffers to extend static VLIW processors to support efficient multi-threading. Replay Buffers provide zero switch-cycle thread switches as well as overhead-free exception handling (beyond the cost of the exception handler) and reasonable latency tolerance for delays. Replay Buffers allow VLIW processors to meet the needs of general-purpose applications without the complexity of dynamic VLIW. In addition to improving the capabilities and performance of VLIW processors, this technique has applications beyond VLIW processors and can also benefit all processors and systems using pipelines, particularly those using wave pipelining.

机译：本文探讨了以VLIW处理器为核心的高性能，高效率的处理器。复杂性效率是一种定性特征，它描述了一个性能尚未达到收益递减点的系统。使用本文描述的技术，简单的静态调度超长指令字（VLIW）处理器可以成为利用指令级并行性的有效架构，并可以有效满足通用计算的需求。动态执行以利用动态VLIW处理器中的指令级并行性。与以前的研究不同，本研究探索了在具有显式指令级并行性的指令流上动态执行的好处。因此，动态执行应用于编译器难以解决的问题，而不是应用于编译器容易解决的问题，从而减少了对复杂且昂贵的硬件的需求。除了提供性能结果外，我们还描述了通用处理器模型和执行定义，该模型改进了传统处理器中使用的精确执行模型。我们还将描述实现此新执行模型的模拟器。在我们的仿真中，我们改变了许多参数，从而可以提取每个参数对性能的影响。这些仿真结果表明，尽管少量的重新排序足以消除几乎所有与调度错误和等待时间变化相关的惩罚，但即使是大量的重新排序也不足以消除与分支错误预测和长存储延迟相关的惩罚。作为动态VLIW处理器的替代产品，我们开发了重播缓冲区以扩展静态VLIW处理器以支持高效的多线程。重播缓冲区提供了零切换周期线程切换以及无开销的异常处理（超出了异常处理程序的成本）以及合理的延迟延迟。重放缓冲区使VLIW处理器可以满足通用应用程序的需求，而无需复杂的动态VLIW。除了提高VLIW处理器的功能和性能之外，该技术还具有VLIW处理器以外的应用程序，并且还可使所有使用管线的处理器和系统受益，特别是那些使用波流水线技术的处理器和系统。

著录项

作者
Rudd, Kevin William.;
展开▼
作者单位

Stanford University.;

展开▼
授予单位 Stanford University.;
学科 Electrical engineering.;Computer science.
学位 Ph.D.
年度 2000
页码 162 p.
总页数 162
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A VLIW processor using both thread-level and instruction-level parallelism [J] . Hiroyuki Shimajiri, Takeo Yoshida 電子情報通信学会技術研究報告. コンピュ-タシステム. Computer Systems . 2000,第86期

机译：使用线程级和指令级并行性的VLIW处理器
2. Construction of a multi-threaded VLIW processor using both instruction-level and thread-level parallelism [J] . Hiroyuki Shimajiri, Takeo Yoshida 電子情報通信学会技術研究報告. コンピュ-タシステム. Computer Systems . 2000,第248期

机译：使用指令级和线程平行度构建多线程VLIW处理器
3. Construction of a multi-threaded VLIW processor using both instruction-level and thread-level parallelism [J] . Hiroyuki Shimajiri, Takeo Yoshida 電子情報通信学会技術研究報告. コンピュ-タシステム. Computer Systems . 2000,第248期

机译：使用指令级和线程平行度构建多线程VLIW处理器
4. Exploiting conditional instructions in code generation for embedded VLIW processors [C] . Rainer Leupers, PRainer Leupers Proceedings of the Third Workshop on Virtual Machines and Intermediate Languages . 2009

机译：在嵌入式VLIW处理器的代码生成中利用条件指令
5. Branch optimizations and instruction-level parallelism exploitation for dynamic superscalar and VLIW processors. [D] . Mantripragada, Srinivas. 2000

机译：动态超标量和VLIW处理器的分支优化和指令级并行性开发。
6. Exploiting Thread-Level and Instruction-Level Parallelism to Cluster Mass Spectrometry Data using Multicore Architectures [O] . Fahad Saeed, Jason D. Hoffert, Trairak Pisitkun, -1

机译：利用多核体系结构利用线程级和指令级并行性对质谱数据进行聚类
7. Exploiting Conditional Instructions in Code Generation for Embedded VLIW Processors [O] . 2008

机译：在嵌入式VLIW处理器的代码生成中利用条件指令

VLIW processors: Efficiently exploiting instruction level parallelism.

摘要

著录项

相似文献

相关主题

期刊订阅