首页> 外文会议>International Conference on Parallel and Distributed Processing Techniques and Applications(PDPTA'04) v.1; 20040621-20040624; Las Vegas,NV; US >The Microarchitecture of the CUE-v2 Processor: Enabling the Simultaneous Processing of Dataflow and Control-Flow Threads
【24h】

The Microarchitecture of the CUE-v2 Processor: Enabling the Simultaneous Processing of Dataflow and Control-Flow Threads

机译:CUE-v2处理器的微体系结构:启用数据流和控制流线程的同时处理

获取原文
获取原文并翻译 | 示例

摘要

The authors have been developing the CUE-v2 microprocessor performing both as dataflow and as superscalar using common pipeline resource. The CUE-v2 can simultaneously process two orthogonal types of threads, i.e., dataflow and control-flow threads, at instruction level. Its instruction fetch unit basically gives priority to dataflow thread(s), while it guarantees the minimal instruction issue opppotu-nity of a control-flow thread in order to avoid blocking. That is, it basically allocates empty slots, caused by the execution of dataflow threads, to a control-flow thread. By this means, the CUE-v2 retains the advantages of dataflow architectures, e.g., fair multiprocessing with no context switching overheads, and achieves the speedup of sequential processing by combining a superscalar execution mode. The basic design of the CUE-v2 seems to have many similar components to that of superscalar machines. However, an instruction fetch unit and the scheme of dynamic out-of-order scheduling differ considerably due to the simultaneous instruction fetch/execution of two kinds of threads. This paper first describes the architecrural design, and then quantifies its hardware amount and its design complexity using a standard cell library for a TSMC 0.18 μm process. Consequently, we found that the CUE-v2 can perform the simultaneous processing of dataflow and control-flow threads with less than 10% increase of additional hardware compared to a superscalar processor.
机译:作者一直在开发CUE-v2微处理器,它使用通用管道资源既可作为数据流又可作为超标量。 CUE-v2可以在指令级别同时处理两种正交类型的线程,即数据流和控制流线程。它的指令获取单元基本上将优先级给予数据流线程,同时它保证控制​​流线程的最小指令发布对等性以避免阻塞。也就是说,它基本上将由于执行数据流线程而导致的空插槽分配给控制流线程。通过这种方式,CUE-v2保留了数据流架构的优势,例如没有上下文切换开销的公平多处理,并通过结合超标量执行模式来实现顺序处理的加速。 CUE-v2的基本设计似乎具有许多与超标量机器相似的组件。然而,由于两种线程的同时指令获取/执行,指令获取单元和动态无序调度方案存在很大差异。本文首先描述了架构设计,然后使用针对台积电0.18μm工艺的标准单元库来量化其硬件数量和设计复杂性。因此,我们发现与超标量处理器相比,CUE-v2可以同时处理数据流和控制流线程,而额外硬件的增加不到10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号