首页> 外国专利> Interrupt-based hardware support for profiling memory system performance

Interrupt-based hardware support for profiling memory system performance

机译:基于中断的硬件支持,可用于分析内存系统性能

摘要

Fueled by higher clock rates and superscalar technologies, growth in processor speed continues to outpace improvement in memory system performance. Reflecting this trend, architects are developing increasingly complex memory hierarchies to mask the speed gap, compiler writers are adding locality enhancing transformations to better utilize complex memory hierarchies, and applications programmers are re-coding their algorithms to exploit memory systems. All of these groups need empirical data on memory behavior to guide their optimizations. This paper describes how to combine simple hardware support and sampling techniques to obtain such data without appreciably perturbing system performance. By augmenting a cache miss counter with a compare register and interrupt line such that the processor is interrupted when the counter matches the compare value, we can sample system state and develop cache miss profiles that associate cache misses with specific processes, procedures, call stacks, addresses, or user defined aspects of system state. This idea is implemented in the Mprof prototype that profiles data stall cycles, first level cache misses, and second level misses on the sun Sparc 10/41. Simple case studies are provided to illustrate Mprof's features.
机译:在更高的时钟频率和超标量技术的推动下,处理器速度的增长继续超过内存系统性能的提高。反映这一趋势的是,架构师正在开发越来越复杂的内存层次结构以掩盖速度差距,编译器编写者正在添加局部性增强转换以更好地利用复杂的内存层次结构,并且应用程序程序员正在重新编码其算法以利用内存系统。所有这些组都需要有关内存行为的经验数据来指导其优化。本文介绍了如何结合简单的硬件支持和采样技术来获取此类数据,而又不会明显影响系统性能。通过使用比较寄存器和中断线增加缓存未命中计数器,以便当计数器与比较值匹配时中断处理器,我们可以对系统状态进行采样,并开发将缓存未命中与特定进程,过程,调用堆栈相关联的缓存未命中配置文件,地址或系统状态的用户定义方面。这个想法在Mprof原型中实现,该原型在sun Sparc 10/41上分析了数据停顿周期,一级缓存未命中和二级错误。提供了简单的案例研究来说明Mprof的功能。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号