Reducing false sharing and improving spatial locality in a unified compilation framework

Kandemir M.; Choudhary A.; Ramanujam J.; Banerjee P.

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Reducing false sharing and improving spatial locality in a unified compilation framework

【24h】

Reducing false sharing and improving spatial locality in a unified compilation framework

机译：在统一的编译框架中减少错误共享并改善空间局部性

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The performance of applications on large shared-memory multiprocessors with coherent caches depends on the interaction between the granularity of data sharing, the size of the coherence unit, and the spatial locality exhibited by the applications, in addition to the amount of parallelism in the applications. Large coherence units are helpful in exploiting spatial locality, but worsen the effects of false sharing. A mathematical framework that allows a clean description of the relationship between spatial locality and false sharing is derived in this paper. First, a technique to identify a severe form of multiple-writer false sharing is presented. The importance of the interaction between optimization techniques aimed at enhancing locality and the techniques oriented toward reducing false sharing is then demonstrated. Given the conflicting requirements, a compiler-based approach to this problem holds promise. This paper investigates the use of data transformations in addressing spatial locality and false sharing, and derives an approach that balances the impact of the two. Experimental results demonstrate that such a balanced approach outperforms those approaches that consider only one of these two issues. On an eight-processor SGI/Cray Origin 2000 multiprocessor, our approach brings an additional 9 percent improvement over a powerful locality optimization technique that uses both loop and data transformations. The presented approach also obtains an additional 19 percent improvement over an optimization technique that is oriented specifically toward reducing false sharing. This study also reveals that, in addition to reducing synchronization costs and improving the memory subsystem performance, obtaining large granularity parallelism is helpful in balancing the effects of enhancing locality and reducing false sharing, rendering them compatible.

机译：具有相干缓存的大型共享内存多处理器上的应用程序性能取决于数据共享的粒度，相干单元的大小以及应用程序显示的空间局部性之间的相互作用，以及应用程序中的并行度。。大型相干单元有助于开发空间局部性，但会加剧错误共享的影响。本文推导了一个数学框架，该框架允许对空间局部性和虚假共享之间的关系进行清晰的描述。首先，提出了一种识别严重形式的多作者错误共享的技术。然后说明了旨在提高局部性的优化技术与旨在减少虚假共享的技术之间进行交互的重要性。考虑到相互矛盾的需求，基于编译器的方法可以解决这个问题。本文研究了数据转换在解决空间局部性和虚假共享方面的用途，并得出了一种平衡两者影响的方法。实验结果表明，这种平衡的方法优于仅考虑这两个问题之一的方法。在八处理器SGI / Cray Origin 2000多处理器上，我们的方法比使用循环和数据转换的强大的局部优化技术提高了9％。与专门针对减少错误共享的优化技术相比，本文提出的方法还获得了19％的额外改进。这项研究还表明，除了降低同步成本和提高内存子系统性能外，获得大粒度并行度还有助于平衡增强局部性和减少虚假共享（使它们兼容）的效果。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2003年第4期|p.337-354|共18页
作者
Kandemir M.; Choudhary A.; Ramanujam J.; Banerjee P.;
展开▼
作者单位

Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
shared memory systems; synchronisation; program compilers; cache storage; program control structures; parallel programming; large shared-memory multiprocessors; coherent caches; data sharing granularity; coherence unit size; spatial locality; paralle;

机译：共享内存系统;同步;程序编译器;缓存存储;程序控制结构;并行编程大型共享内存多处理器;相干缓存数据共享粒度;相干单位大小;空间局部性并列;

相似文献

外文文献
中文文献
专利

1. False sharing and spatial locality in multiprocessor caches [J] . Torrellas J., Lam H.S. IEEE Transactions on Computers . 1994,第6期

机译：多处理器缓存中的虚假共享和空间局部性
2. The following statement is a compilation of thoughts from all authors, prompted by a request from DS and GQ for a unifying framework (if any) for all three viewpoints [J] . Delgado Mauricio R., Beer Jennifer S., Fellows Lesley K., Nature neuroscience . 2016,第12期

机译：以下陈述是DS和GQ提出的针对所有三个观点的统一框架（如果有）的要求，是所有作者思想的汇编
3. Shared Nearest Neighbor Clustering in a Locality Sensitive Hashing Framework [J] . Sawsan Kanj, Thomas Brüls, Stéphane Gazut Journal of computational biology . 2018,第2期

机译：局部敏感哈希框架中的共享最近邻居聚类
4. On reducing false sharing while improving locality on shared memory multiprocessors [C] . Kandemir, M., Choudhary, . 1999

机译：在减少错误共享的同时提高共享内存多处理器的局部性
5. A Unified Semiotics Framework for Spatial and Non-Spatial Brain Network Data Visualizations [D] . Zhang, Guohao. 2017

机译：用于空间和非空间脑网络数据可视化的统一符号学框架
6. Unified Framework for Robust Estimation of Brain Networks From fMRI Using Temporal and Spatial Correlation Analyses [O] . Yongmei Michelle Wang, Jing Xia -1

机译：使用时空相关分析从fMRI可靠估计脑网络的统一框架
7. Reducing False Sharing and Improving Spatial Locality in a Unified Compilation Framework [O] . Mahmut Kandemir, Alok Choudhary, J. Ramanujam, 2003

机译：减少虚假共享并改善统一编译框架中的空间位置

Reducing false sharing and improving spatial locality in a unified compilation framework

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅