首页> 外文会议>2012 IEEE/ACM International Conference on Computer-Aided Design : Digest of Technical Papers. >Improving last level cache locality by integrating loop and data transformations

【24h】

Improving last level cache locality by integrating loop and data transformations

机译：通过集成循环和数据转换来改善上一级缓存的局部性

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Motivated by the observation that most existing data locality optimizations do not specifically target shared last-level caches of emerging multicores and that even multicore-specific locality-oriented techniques employ either loop or data layout optimizations but not both, in this paper we present an integrated loop and data layout optimization strategy, with the goal of improving the last-level cache performance of multicores that execute multithreaded applications. We present a detailed mathematical formulation of our locality optimization strategy and present experimental data from our current implementation. Our results, collected using 14 application programs, clearly show that the proposed integrated approach is very successful in practice, and outperforms both pure loop optimization and pure data layout optimization based alternatives. Our results also indicate that the savings achieved increase with increased core count and larger data set sizes.

机译：出于以下观察的动机：大多数现有数据局部性优化并不专门针对新兴多核的共享最后一级缓存，甚至针对特定于多核的局部性技术都采用了循环或数据布局优化，但没有同时采用这两种方法，在本文中，我们提出了一种集成循环和数据布局优化策略，旨在提高执行多线程应用程序的多核的最后一级缓存性能。我们提供了我们的地理位置优化策略的详细数学公式，并提供了来自当前实施情况的实验数据。我们使用14个应用程序收集的结果清楚地表明，所提出的集成方法在实践中非常成功，并且优于基于纯循环优化和基于纯数据布局优化的替代方案。我们的结果还表明，随着内核数量的增加和数据集大小的增加，节省的费用也增加了。

著录项

来源
《2012 IEEE/ACM International Conference on Computer-Aided Design : Digest of Technical Papers. 》|2012年|p.65- 72|共8页
会议地点 San Jose CA(US);San Jose CA(US)
作者
Ding Wei; Kandemir Mahmut;
展开▼
作者单位

The Pennsylvania State University, University, Park;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类 TP391.72;TP391.72;
关键词

相似文献

外文文献
中文文献
专利

1. Improving cache locality by a combination of loop and data transformations [J] . Kandemir M., Ramanujam J. IEEE Transactions on Computers . 1999 ,第2期

机译：通过循环和数据转换的组合来改善缓存的局部性
2. Obtaining Affine Transformations to Improve Locality of Loop Nests [J] . N. A. Likhoded, S. V. Bakhanovich, A. V. Zherelo Programming and Computer Software . 2005 ,第5期

机译：获取仿射变换以改善循环嵌套的局部性
3. Locality-aware data replication in the last-level cache for large scale multicores [J] . Hijaz Farrukh, Shi Qingchuan, Kurian George, Journal of supercomputing . 2016 ,第2期

机译：大型多核的最后一级缓存中的本地感知数据复制
4. Improving last level cache locality by integrating loop and data transformations [C] . Ding Wei, Kandemir Mahmut IEEE/ACM International Conference on Computer-Aided Design . 2012

机译：通过集成循环和数据转换来提高最后一级缓存局部
5. Improving cache locality for thread-level speculation systems. [D] . Fung, Stanley Lap Chiu. 2005

机译：改善线程级推测系统的缓存局部性。
6. Improving the prediction of organism-level toxicity through integration of chemical protein target and cytotoxicity qHTS data [O] . Chad H. G. Allen, Alexios Koutsoukas, Isidro Cortés-Ciriano, 2016

机译：通过整合化学蛋白质靶标和细胞毒性qHTS数据改善对生物体毒性的预测
7. Improving Cache Locality by a Combination of Loop and Data Transformations [O] . Mahmut K, Student Member, J. Ramanujam, 2014

机译：通过循环和数据转换的组合改善缓存局部性

Improving last level cache locality by integrating loop and data transformations

摘要

著录项

相似文献

相关主题

期刊订阅