The performance advantages of integrating block data transfer in cache-coherent multiprocessors

机译：在缓存一致性多处理器中集成块数据传输的性能优势

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Integrating support for block data transfer has become an important emphasis in recent cache-coherent shared address space multiprocessors. This paper examines the potential performance benefits of adding this support. A set of ambitious hardware mechanisms is used to study performance gains in five important scientific computations that appear to be good candidates for using block transfer. Our conclusion is that the benefits of block transfer are not substantial for hardware cache-coherent multiprocessors. The main reasons for this are (i) the relatively modest fraction of time applications spend in communication amenable to block transfer, (ii) the difficulty of finding enough independent computation to overlap with the communication latency that remains after block transfer, and (iii) long cache lines often capture many of the benefits of block transfer in efficient cache-coherent machines. In the cases where block transfer improves performance, prefetching can often provide comparable, ifnot superior, performance benefits. We also examine the impact of varying important communication parameters and processor speed on the effectiveness of block transfer, and comment on useful features that a block transfer facility should support for real applications.

机译：在最近的缓存一致的共享地址空间多处理器中，对块数据传输的集成支持已成为重要的重点。本文研究了添加此支持的潜在性能优势。一组雄心勃勃的硬件机制用于研究五项重要的科学计算中的性能提升，这些计算似乎是使用块传输的理想选择。我们的结论是，对于硬件高速缓存一致的多处理器而言，块传输的好处并不重要。造成这种情况的主要原因是：（i）应用程序在通信中花费的时间相对较少，适合进行块传输;（ii）难以找到足够的独立计算来与块传输后剩余的通信等待时间重叠;以及（iii）高速缓存行通常会在高效的高速缓存一致性计算机中捕获块传输的许多好处。在块传输提高性能的情况下，预取通常可以提供相当的性能优势，即使不是更好。我们还研究了重要的通信参数和处理器速度的变化对块传输有效性的影响，并评论了块传输工具应支持实际应用的有用功能。

著录项

来源
《International conference on Architectural support for programming languages and operating systems》|1994年|P.219-229|共11页
会议地点
作者
Steven Cameron Woo; Jaswinder Pal Singh; John L. Hennessy; PJaswinder Pal Singh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. HIGH PERFORMANCE FFT ALGORITHMS FOR CACHE-COHERENT MULTIPROCESSORS [J] . Kevin R. Wadleigh International Journal of High Performance Computing Applications . 1999,第2期

机译：高速缓存相干多处理器的高性能FFT算法
2. LIGERO: A Light but Efficient Router Conceived for Cache-Coherent Chip Multiprocessors [J] . PABLO ABAD, VALENTIN PUENTE, JOSE-ANGEL GREGORIO ACM Transactions on Architecture and Code Optimization . 2012,第4期

机译：LIGERO：一种用于高速缓存一致性芯片多处理器的轻便高效路由器
3. CCNoC： Cache-Coherent Network on Chip for Chip Multiprocessors [J] . 王惊雷, 薛一波, Member CCF IEEE, 计算机科学技术学报：英文版 . 2010,第002期

机译：CCNoC：用于芯片多处理器的高速缓存一致性片上网络
4. Integrating Non-blocking Synchronisation in Parallel Applications: Performance Advantages and Methodologies [C] . Philippas Tsigas, Yi Zhang Third International Workshop on Software and Performance (WOSP2002), Jul 24-26, 2002, Rome, Italy . 2002

机译：在并行应用程序中集成无阻塞同步：性能优势和方法
5. Memory latency evaluation in cluster-based cache-coherent multiprocessor systems with different interconnection topologies. [D] . Asaduzzaman, Abu Sadath Mohammad. 1997

机译：具有不同互连拓扑的基于群集的缓存一致性多处理器系统中的内存延迟评估。
6. Advantages of IoT-Based Geotechnical Monitoring Systems Integrating Automatic Procedures for Data Acquisition and Elaboration [O] . Andrea Carri, Alessandro Valletta, Edoardo Cavalca, 2021

机译：基于IOT的岩土监测系统的优点集成了数据采集和阐述的自动程序
7. The Performance Advantages of Integrating Block Data Transfer in Cache-Coherent Multiprocessors [O] . Steven Cameron Woo, Albert Macovski 1996

机译：在缓存一致性多处理器中集成块数据传输的性能优势

The performance advantages of integrating block data transfer in cache-coherent multiprocessors

摘要

著录项

相似文献

相关主题

期刊订阅