...
首页> 外文期刊>Journal of supercomputing >Design space exploration of hardware task superscalar architecture
【24h】

Design space exploration of hardware task superscalar architecture

机译:硬件任务超标量架构的设计空间探索

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

For current high performance computing systems, exploiting concurrency is a serious and important challenge. Recently, several dynamic software task management mechanisms have been proposed. In particular, task-based dataflow programming models which benefit from dataflow principles to improve task-level parallelism and overcome the limitations of static task management systems. However, these programming models rely on software-based dependency analysis, which are performed inherently slowly; and this limits their scalability specially when there is fine-grained task granularity and a large amount of tasks. Moreover, task scheduling in software introduces overheads, and so becomes increasingly inefficient with the number of cores. In contrast, a hardware scheduling solution, like Task SuperScalar (TSS), can achieve greater values of speed-up because a hardware task scheduler requires fewer cycles than the software version to dispatch a task. TSS combines the effectiveness of Out-of-Order processors together with the task abstraction. It has been implemented in software with limited parallelism and high memory consumption due to the nature of the software implementation. Hardware Task Superscalar (HTSS) is proposed to solve these drawbacks. HTSS is designed to be integrated in a future high performance computer with the ability to exploit fine-grained task parallelism. In this article, a deep latency and design space exploration of HTSS is described. For design space exploration, we have designed a full cycle-accurate simulator of HTSS, called SimTSS. The simulator has been tuned based on latency exploration of HTSS components resulted from VHDL description of each component. As the result of this exploration, we have found the number of components and memory capacity of HTSS for HPC systems.
机译:对于当前的高性能计算系统,利用并发是一个严重而重要的挑战。最近,已经提出了几种动态软件任务管理机制。特别是,基于任务的数据流编程模型可从数据流原理中受益,以改善任务级并行性并克服静态任务管理系统的局限性。但是,这些编程模型依赖于基于软件的依赖性分析,这种分析固有地执行缓慢。并且这特别限制了它们的可伸缩性,尤其是当任务的粒度很细且任务很多时。此外,软件中的任务调度会带来开销,因此随着内核数量的增加,效率变得越来越低。相反,诸如Task SuperScalar(TSS)之类的硬件调度解决方案可以实现更大的加速值,因为与软件版本相比,硬件任务调度程序需要更少的周期来调度任务。 TSS结合了无序处理器的有效性和任务抽象。由于软件实现的性质,它已在并行性有限且内存消耗较高的软件中实现。硬件任务超标量(HTSS)被提出来解决这些缺点。 HTSS被设计为集成在未来的高性能计算机中,能够利用细粒度的任务并行性。本文介绍了HTSS的深时延和设计空间探索。为了进行设计空间探索,我们设计了一个称为HTSS的全周期精确模拟器,称为SimTSS。该模拟器已根据对每个组件的VHDL描述导致的HTSS组件的延迟探索进行了调整。作为探索的结果,我们发现了用于HPC系统的HTSS的组件数量和存储容量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号