首页> 外文会议>International conference on computational science >Efficient Characterization of Hidden Processor Memory Hierarchies
【24h】

Efficient Characterization of Hidden Processor Memory Hierarchies

机译:隐藏处理器内存层次结构的有效表征

获取原文
获取外文期刊封面目录资料

摘要

A processor's memory hierarchy has a major impact on the performance of running code. However, computing platforms, where the actual hardware characteristics are hidden from both the end user and the tools that mediate execution, such as a compiler, a JIT and a runtime system, are used more and more, for example, performing large scale computation in cloud and cluster. Even worse, in such environments, a single computation may use a collection of processors with dissimilar characteristics. Ignorance of the performance-critical parameters of the underlying system makes it difficult to improve performance by optimizing the code or adjusting runtime-system behaviors; it also makes application performance harder to understand. To address this problem, we have developed a suite of portable tools that can efficiently derive many of the parameters of processor memory hierarchies, such as levels, effective capacity and latency of caches and TLBs, in a matter of seconds. The tools use a series of carefully considered experiments to produce and analyze cache response curves automatically. The tools are inexpensive enough to be used in a variety of contexts that may include install time, compile time or runtime adaption, or performance understanding tools.
机译:处理器的内存层次结构对运行代码的性能产生了重大影响。然而,从最终用户隐藏实际硬件特性的计算平台和中调解执行的工具,例如编译器,JIT和运行时系统,例如,执行大规模计算云和群集。更糟糕的是,在这种环境中,单个计算可以使用具有不同特征的处理器集合。无知底层系统的性能关键参数使得难以优化代码或调整运行时系统行为来提高性能;它还使应用程序表现难以理解。为了解决这个问题,我们开发了一套便携式工具,可以有效地推导出处理器内存层次结构的许多参数,例如秒的级别,高速缓存和TLB的级别,有效容量和延迟。该工具使用一系列仔细考虑的实验来自动生产和分析缓存响应曲线。该工具足够便宜,可用于各种上下文,可以包括安装时间,编译时间或运行时自适应或性能理解工具。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号