首页> 外文期刊>Concurrency and Computation >The performance of parallel matrix algorithms on a broadcast-based architecture
【24h】

The performance of parallel matrix algorithms on a broadcast-based architecture

机译:基于广播的体系结构上并行矩阵算法的性能

获取原文
获取原文并翻译 | 示例

摘要

Due to advances in fiber-optics and very large scale integration (VLSI) technology, interconnection networks which allow multiple simultaneous broadcasts are becoming feasible. This paper summarizes one such multiprocessor architecture called the Simultaneous Optical Multiprocessor Exchange Bus (SOME-Bus). It also presents enhancements to the network interface and the cache and directory controllers which support cache block combining, capture and prefetch, and allow complete overlap of processing time with the communication time due to compulsory misses. The paper uses two fundamental matrix algorithms to characterize the impact of each enhancement on performance. Cache miss analysis and results from the execution of these programs on a SOME-Bus simulator show that block capture and prefetch combined with an effective block replacement policy succeed in significantly reducing the miss rate due to compulsory misses as the cache size increases, while a similar increase of cache size in traditional architectures leaves the miss rate due to compulsory misses unaffected.
机译:由于光纤和超大规模集成(VLSI)技术的进步,允许多个同时广播的互连网络变得可行。本文总结了一种这样的多处理器体系结构,称为同时光学多处理器交换总线(SOME-Bus)。它还提供了对网络接口以及高速缓存和目录控制器的增强功能,这些功能支持高速缓存块合并,捕获和预取,并且由于强制丢失而允许处理时间与通信时间完全重叠。本文使用两种基本矩阵算法来表征每种增强功能对性能的影响。高速缓存未命中分析和在SOME-Bus模拟器上执行这些程序的结果表明,随着高速缓存大小的增加,块捕获和预取与有效的块替换策略相结合,成功地成功降低了由于强制性未命中而导致的未命中率。传统体系结构中高速缓存大小的增加使强制丢失的丢失率不受影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号