首页> 外文会议>High-Performance Computing, 1997. Proceedings. Fourth International Conference on >Simultaneous multithreaded vector architecture: merging ILP and DLPfor high performance
【24h】

Simultaneous multithreaded vector architecture: merging ILP and DLPfor high performance

机译:同时多线程矢量体系结构:合并ILP和DLP高性能

获取原文

摘要

Shows that instruction-level parallelism (ILP) and data-levelparallelism (DLP) can be merged in a single simultaneous vectormultithreaded architecture to execute regular vectorizable code at aperformance level that cannot be achieved using either paradigm on itsown. We show that the combination of the two techniques yields very highperformance at a low cost and a low complexity. We show that thisarchitecture achieves a sustained performance on numerical regular codesthat is 20 times the performance that can be achieved with today'ssuperscalar microprocessors. Moreover, we show that the architecture cantolerate very large memory latencies, of up to a 100 cycles, with arelatively small performance degradation. This high performance isindependent of working set size or of locality considerations, since theDLP paradigm allows very efficient exploitation of a high-performanceflat memory bandwidth
机译:显示指令级并行性(ILP)和数据级 并行性(DLP)可以合并在单个同时矢量中 多线程体系结构可在一个位置执行常规的可矢量化代码 在其范本上使用任何一种范式都无法达到的性能水平 自己的。我们证明了两种技术的结合产生了很高的收益 低成本和低复杂度的性能。我们证明了这一点 架构在数字规则代码上实现了持续的性能 这是今天的性能所能达到的20倍 超标量微处理器。此外,我们证明了该架构可以 可以承受高达100个周期的超大内存延迟, 相对较小的性能下降。这种高性能是 不受工作集大小或位置考虑因素的影响,因为 DLP范式允许非常高效地利用高性能 固定内存带宽

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号