首页> 外文会议> >The performance of runtime data cache prefetching in a dynamic optimization system

【24h】

The performance of runtime data cache prefetching in a dynamic optimization system

机译：动态优化系统中运行时数据高速缓存预取的性能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional software controlled data cache prefetching is often ineffective due to the lack of runtime cache miss and miss address information. To overcome this limitation, we implement runtime data cache prefetching in the dynamic optimization system ADORE (ADaptive Object code Reoptimization). Its performance has been compared with static software prefetching on the SPEC2000 benchmark suite. Runtime cache prefetching shows better performance. On an Itanium 2 based Linux workstation, it can increase performance by more than 20% over static prefetching on some benchmarks. For benchmarks that do not benefit from prefetching, the runtime optimization system adds only 1%-2% overhead. We have also collected cache miss profiles to guide static data cache prefetching in the ORC compiler. With that information the compiler can effectively avoid generating prefetches for loops that hit well in the data cache.

机译：由于缺少运行时高速缓存未命中和未命中地址信息，传统的软件控制的数据高速缓存预取通常无效。为了克服此限制，我们在动态优化系统ADORE（自适应目标代码重新优化）中实现了运行时数据缓存预取。它的性能已与SPEC2000基准测试套件上的静态软件预取进行了比较。运行时缓存预取显示了更好的性能。在基于Itanium 2的Linux工作站上，与某些基准测试上的静态预取相比，它可以将性能提高20％以上。对于无法从预取中受益的基准，运行时优化系统仅增加1％-2％的开销。我们还收集了高速缓存未命中配置文件，以指导在ORC编译器中进行静态数据高速缓存预取。利用这些信息，编译器可以有效地避免为在数据高速缓存中命中的循环生成预取。

著录项

来源
《》|2003年|p.180-190|共11页
会议地点
作者
Jiwei Lu; Chen; H.; Rao Fu; Wei-Chung Hsu; Othmer; B.; Pen-Chung Yew; Dong-Yuan Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
storage management; program control structures; cache storage; optimising compilers; runtime data cache prefetching; dynamic optimization system; software controlled data cache prefetching; runtime cache miss; miss address information; ADORE; adaptive ob;

机译：存储管理;程序控制结构;高速缓存存储;优化编译器;运行时数据高速缓存预取;动态优化系统;软件控制的数据高速缓存预取;运行时高速缓存未命中;缺少地址信息; ADORE;自适应对象;

相似文献

外文文献
中文文献
专利

1. Prefetching J+-Tree: A Cache-Optimized Main Memory Database Index Structure [J] . Hua Luan, Xiao-Yong Du, Sha Wang 计算机科学技术学报（英文版） . 2009,第004期

机译：预取J +-树：缓存优化的主内存数据库索引结构
2. Prefetch-aware fingerprint cache management for data deduplication systems [J] . Li Mei, Zhang Hongjun, Wu Yanjun, Frontiers of computer science in China . 2019,第3期

机译：重复数据删除系统的预取感知指纹缓存管理
3. Prefetch-aware fingerprint cache management for data deduplication systems [J] . Li Mei, Zhang Hongjun, Wu Yanjun, Frontiers of computer science . 2019,第3期

机译：用于数据重复数据删除系统的预取感人指纹缓存管理
4. The Performance of Runtime Data Cache Prefetching in a Dynamic Optimization System [C] . Jiwei Lu, Howard Chen, Rao Fu, Annual IEEE/ACM International Symposium on Microarchitecture . 2003

机译：动态优化系统中运行时数据缓存预取的性能
5. Dynamic Data Prefetching and Layout Optimizations for High Performance Heterogeneous Data Access. [D] . Tang, Houjun. 2016

机译：用于高性能异构数据访问的动态数据预取和布局优化。
6. Strategies of data layout and cache writing for input-output optimization in high performance scientific computing: Applications to the forward electrocardiographic problem [O] . Louie Cardone-Noott, Blanca Rodriguez, Alfonso Bueno-Orovio 2012

机译：高性能科学计算中输入输出优化的数据布局和高速缓存写入策略：应用于正向心电图问题
7. The Performance of Runtime Data Cache Prefetching in a Dynamic Optimization System [O] . Jiwei Lu, Howard Chen, Rao Fu, 2003

机译：动态优化系统中运行时数据缓存预取的性能

The performance of runtime data cache prefetching in a dynamic optimization system

摘要

著录项

相似文献

相关主题

期刊订阅