首页> 外文会议>2004 computing frontier conference >A First Glance at Kilo-instruction Based Multiprocessors
【24h】

A First Glance at Kilo-instruction Based Multiprocessors

机译:基于Kilo指令的多处理器的第一印象

获取原文
获取原文并翻译 | 示例

摘要

The ever increasing gap between processor and memory speed,sometimes referred to as the Memory Wall prob- lem[42],has a very negative impact on performance.This mismatch will be more severe in future processor’s gener- ation.Modern cache organizations and prefetching tech- niques will not be able to solve this problem.A very novel and promising technique to deal with the Memory Wall con- sists on designing processors able to maintain thousands of in-flight instructions.An example of this kind of processors has been denoted as Kilo-instruction processors[8].When running numerical applications,Kilo-instruction processors have demonstrated its ability to effectively maintain high values of IPC while increasing memory latencies. rnIn this paper,we will study for the first time,the influence of Kilo-instruction processors on the performance of small-scale CC-NUMA multiprocessors.Our first results, using an ideal network,show the enormous potential of the Kilo-instruction processors,when using them as comput- ing nodes,not only for hiding local DRAM latencies but also for the remote ones.A deeper analysis,using real- istic networks,reveals the existence of heavy demands on packet throughput required by each node,since larger re- order butters translate on higher density of remote accesses. Next,we show that current interconnection networks can- not cope with this high traffic levels,so newer and faster networks have to be designed.In short,our results show dramatic performance gains over multiprocessors based on current microprocessors and dictate a possible way to build future shared-memory multiprocessor systems.
机译:处理器和内存速度之间不断扩大的差距(有时称为“内存墙问题” [42])对性能产生非常负面的影响。这种不匹配在将来的处理器一代中将更加严重。现代缓存组织和预取这项技术将无法解决这个问题。一种非常新颖且很有前途的技术来处理内存墙,包括设计能够维护数千条飞行指令的处理器。作为Kilo指令处理器[8]。在运行数字应用程序时,Kilo指令处理器已证明其能够有效维持IPC的高值,同时增加内存延迟的能力。在本文中,我们将首次研究Kilo指令处理器对小型CC-NUMA多处理器性能的影响。我们的第一个结果是,使用理想网络展示了Kilo指令处理器的巨大潜力,当将它们用作计算节点时,不仅用于隐藏本地DRAM延迟,而且还用于远程节点。更深入的分析(使用实际网络)揭示了每个节点对分组吞吐量的强烈要求,因为它更大重新订购黄油可以实现更高密度的远程访问。接下来,我们表明当前的互连网络无法应付如此高的流量水平,因此必须设计更新更快的网络。总之,我们的结果表明,基于当前微处理器的多处理器性能显着提高,并指出了一种可能的构建方法未来的共享内存多处理器系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号