首页> 外文学位 >Chip multiprocessors with on-chip aggregate function network .
【24h】

Chip multiprocessors with on-chip aggregate function network .

机译:具有片上聚合功能网络的片式多处理器。

获取原文
获取原文并翻译 | 示例

摘要

State-of-the-art on-chip networks and block-based cache coherence protocols used in cache-coherent shared-memory Chip MultiProcessors (CMPs) are inefficient for collective operations across cores. Performance of CMPs can be seriously degraded by the multitude of memory requests and coherence messages required to implement each collective operation. This thesis presents a CMP-AFN architecture and Instruction Set Architecture (ISA) extensions that augment a conventional shared-memory CMP with a tightly-integrated Aggregate Function Network (AFN) that implements low-latency collective operations without using or interfering with the memory hierarchy. For a modest increase in circuit complexity, traffic within a CMP's internal network is dramatically reduced, improving the performance of caches and reducing power consumption. Full system simulations of 16-core CMPs show a CMP-AFN outperforms the reference design significantly, eliminating up to 52% of memory accesses and up to 73% of private L1 data cache misses in both the EPCC OpenMP microbenchmarks and SPEC OMP benchmarks.
机译:高速缓存一致性共享内存芯片多处理器(CMP)中使用的最新的片上网络和基于块的高速缓存一致性协议对于跨内核的集体操作效率低下。实施每个集合操作所需的大量内存请求和一致性消息可能会严重降低CMP的性能。本文提出了一种CMP-AFN体系结构和指令集体系结构(ISA)扩展,它们通过紧密集成的聚合功能网络(AFN)增强了传统的共享内存CMP,该功能网络实现了低延迟的集体操作,而无需使用或干扰内存层次结构。为了适度增加电路复杂度,可以大大减少CMP内部网络中的通信量,从而提高了缓存的性能并降低了功耗。对16核CMP的完整系统仿真表明,CMP-AFN明显优于参考设计,在EPCC OpenMP微基准和SPEC OMP基准测试中,消除了多达52%的内存访问和多达73%的私有L1数据高速缓存未命中。

著录项

  • 作者

    Kim, Soohong Peter.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Engineering Electronics and Electrical.Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 124 p.
  • 总页数 124
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号