【24h】

ProfileMe

机译:ProfileMe

获取原文

摘要

Profile data is valuable for identifying performance bottlenecks and guiding optimizations. Periodic sampling of a processor's performance monitoring hardware is an effective, unobtrusive way to obtain detailed profiles. Unfortunately, existing hardware simply counts events, such as cache misses and branch mispredictions, and cannot accurately attribute these events to instructions, especially on out-of-order machines. We propose an alternative approach, called ProfileMe, that samples instructions. As a sampled instruction moves through the processor pipeline, a detailed record of all interesting events and pipeline stage latencies is collected. ProfileMe also support paired sampling, which captures information about the interactions between concurrent instructions, revealing information about useful concurrency and the utilization of various pipeline stages while an instruction is in flight. We describe an inexpensive hardware implementation of ProfileMe, outline a variety of software techniquesto extract useful profile information from the hardware, and explain several ways in which this information can provide valuable feedback for programmers and optimizers.
机译:概要文件数据对于识别性能瓶颈和指导优化很有价值。定期对处理器的性能监视硬件进行采样是一种获取详细配置文件的有效,简便的方法。不幸的是,现有的硬件仅对诸如高速缓存未命中和分支错误预测之类的事件进行计数,而不能将这些事件准确地归因于指令,尤其是在乱序机器上。我们提出了一种称为ProfileMe的替代方法,该方法对指令进行采样。当采样的指令在处理器管线中移动时,将收集所有有趣事件和管线阶段等待时间的详细记录。 ProfileMe还支持配对采样,该采样可捕获有关并发指令之间交互的信息,显示有关有用的并发信息以及在指令执行过程中各个流水线阶段的利用率的信息。我们描述了ProfileMe的廉价硬件实现,概述了从硬件中提取有用的配置文件信息的各种软件技术,并说明了该信息可以为程序员和优化人员提供有价值的反馈的几种方法。

著录项

相似文献

  • 外文文献
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号