首页> 外文会议>International Symposium on High-Performance Computer Architecture >Predicting inter-thread cache contention on a chip multi-processor architecture
【24h】

Predicting inter-thread cache contention on a chip multi-processor architecture

机译:预测芯片多处理器架构的线程间缓存争用

获取原文
获取外文期刊封面目录资料

摘要

This paper studies the impact of L2 cache sharing on threads that simultaneously share the cache, on a chip multi-processor (CMP) architecture. Cache sharing impacts threads nonuniformly, where some threads may be slowed down significantly, while others are not. This may cause severe performance problems such as sub-optimal throughput, cache thrashing, and thread starvation for threads that fail to occupy sufficient cache space to make good progress. Unfortunately, there is no existing model that allows extensive investigation of the impact of cache sharing. To allow such a study, we propose three performance models that predict the impact of cache sharing on co-scheduled threads. The input to our models is the isolated L2 cache stack distance or circular sequence profile of each thread, which can be easily obtained on-line or off-line. The output of the models is the number of extra L2 cache misses for each thread due to cache sharing. The models differ by their complexity and prediction accuracy. We validate the models against a cycle-accurate simulation that implements a dual-core CMP architecture, on fourteen pairs of mostly SPEC benchmarks. The most accurate model, the inductive probability model, achieves an average error of only 3.9%. Finally, to demonstrate the usefulness and practicality of the model, a case study that details the relationship between an application's temporal reuse behavior and its cache sharing impact is presented.
机译:本文研究了L2缓存共享对同时共享缓存的线程的影响,在芯片多处理器(CMP)架构上。缓存共享影响线程不均匀,其中某些线程可能会显着减慢,而另一些线​​程则不是。这可能会导致严重的性能问题,例如子最优吞吐量,缓存抖动和线程删除的线程,用于线程无法占用足够的缓存空间以取得良好的进展。不幸的是,没有现有的模型,可以广泛调查缓存共享的影响。为了允许这样的研究,我们提出了三种性能模型,其预测高速缓存共享对共同计划的线程的影响。我们模型的输入是隔离的L2高速缓存堆栈距离或每个螺纹的圆形序列轮廓,可以在线或离线容易地获得。模型的输出是由于缓存共享导致每个线程的额外L2缓存未命中的数量。模型因其复杂性和预测精度而异。我们验证模型,以防止实现双核CMP架构的循环准确仿真,大多数规范基准测试。最准确的模型,电感概率模型,实现平均误差仅为3.9%。最后,为了展示模型的有用性和实用性,呈现了详细说明应用程序的时间重用行为与其高速缓存共享影响之间的关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号