首页> 外文会议>2011 Sixth IEEE International Conference on Networking, Architecture and Storage >A Novel Highly Scalable Architecture with Partially Distributed Pipeline and Hardware/Software Instruction Encoding
【24h】

A Novel Highly Scalable Architecture with Partially Distributed Pipeline and Hardware/Software Instruction Encoding

机译:一种具有部分分布式管道和硬件/软件指令编码的新型高度可扩展架构

获取原文

摘要

The partitioning resources like pipelines and register files among clusters is proved to be an effective way to improve performance and scalability. However, improvement in scalability is limited by traditional instruction encoding schemes that quickly run out of bits in fixed-length instruction words to encode multiple register operands. Meanwhile, clustered processors may come at a cost of performance degradation, the major cause of which is the limited data locality arising from the lack of available registers and functional units. This paper introduces a highly scalable clustered architecture (HiSCA) to improve the scalability and performance of clustered processors. The pipeline of HiSCA provides high performance through in-order issuing, out-of-order execution and parallel but in-order commitment, while releasing instruction issuing from the heavy burden of dynamic scheduling. The hardware/software instruction encoding scheme of HiSCA splits instruction stream into chains of instructions (packs), and provides common information of instructions in the same packs in dedicated instruction words, thus reducing the total amount of information encoded in the instructions within the packs. HiSCA scales efficiently to 32 clusters with 1024 general purpose registers. Experiment results show that, for a 4-cluster and 8-issue configuration, HiSCA can achieve a 4.6% improvement in frequency with minimal hardware overhead, and an average of 13.3% performance speedup at the cost of 1.9% overhead to code size, compared with a traditional clustered processor with nearly the same hardware complexity.
机译:事实证明,在集群之间对诸如管道和寄存器文件之类的资源进行分区是提高性能和可伸缩性的有效方法。但是,可伸缩性的提高受到传统指令编码方案的限制,该传统指令编码方案会很快用完固定长度指令字中的位以对多个寄存器操作数进行编码。同时,群集处理器可能会以性能下降为代价,其主要原因是由于缺少可用的寄存器和功能单元而导致的数据局部性有限。本文介绍了一种高度可扩展的集群体系结构(HiSCA),以改善集群处理器的可扩展性和性能。 HiSCA的流水线通过有序发布,无序执行和并行但有序的承诺提供了高性能,同时释放了由动态调度带来的沉重负担的指令发布。 HiSCA的硬件/软件指令编码方案将指令流分成指令链(数据包),并以专用指令字的形式在同一数据包中提供指令的公共信息,从而减少了在数据包内的指令中编码的信息总量。 HiSCA可有效扩展到具有1024个通用寄存器的32个群集。实验结果表明,与4个集群和8个问题的配置相比,HiSCA可以在不增加硬件开销的情况下将频率提高4.6%,并以1.9%的代码大小开销为代价,平均提高13.3%的性能。传统的集群处理器具有几乎相同的硬件复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号