首页> 外文学位 >Improving cache performance by smart page mapping in application programs.
【24h】

Improving cache performance by smart page mapping in application programs.

机译:通过在应用程序中进行智能页面映射来提高缓存性能。

获取原文
获取原文并翻译 | 示例

摘要

This thesis studies the use of software methods to improve memory performance in a heterogeneous cache system. In particular, it studies a cache system that consists of two data caches, the main cache and the mini cache, at the same level of the cache hierarchy. Both caches may differ in size, associativity, and replacement policy. Furthermore, this cache system allows application programs to specify the caching policy for each virtual page among three choices, i.e. whether to map the page to the main cache, the mini-cache, or neither. In the latter case, the memory accesses to this page bypass both caches. This process of specifying the mapping between the caching policies and the virtual pages is called cache mapping.; This thesis investigates the problem of optimal cache mapping, assuming that we can predict the trace of the memory reference in advance. The practical effectiveness of such cache mapping will depend on the degree of accuracy of such prediction. However, even assuming perfect prediction, cache mapping faces complex and important issues, which are the focus of this thesis. On the theoretical side, we prove that the problem of finding the optimal cache mapping for an arbitrary memory trace is NP-hard. On the experimental side, we propose two techniques to perform the cache mapping. The first technique is a heuristic algorithm which is applied to the entire memory trace. The cache mapping problem is reduced to an approximated problem and then transformed to a network flow problem which can be solved in polynomial time. The second technique is based on trace sampling. Under the proposed sampling framework, a number of important loops are automatically selected for sampling. The optimal cache mapping for the simplified memory trace is obtained by solving an Integer Linear Programming (ILP) problem. Both techniques are implemented for two real world PDAs, namely the Compaq iPAQ 3650 which has an Intel StrongARM SA-1110 processor core, and the Compaq iPAQ 3950 which has an Intel XScale processor core. We report the experimental results including performance improvements, memory access statistics, and energy consumption savings.
机译:本文研究了使用软件方法来提高异构缓存系统中的存储器性能。特别是,它研究了由两个数据高速缓存组成的高速缓存系统,主高速缓存和小型高速缓存位于高速缓存层次结构的同一级别。两个缓存的大小,关联性和替换策略可能不同。此外,该高速缓存系统允许应用程序在三种选择中为每个虚拟页面指定高速缓存策略,即是否将页面映射到主高速缓存,迷你高速缓存或两者都不映射。在后一种情况下,对该页面的内存访问将绕过两个缓存。指定缓存策略和虚拟页面之间的映射的过程称为缓存映射。本文假设可以提前预测内存引用的轨迹,因此研究了最佳缓存映射的问题。这种高速缓存映射的实际有效性将取决于这种预测的准确度。然而,即使假设完美的预测,缓存映射也面临着复杂而重要的问题,这是本文的重点。从理论上讲,我们证明为任意内存跟踪找到最佳缓存映射的问题是NP-hard。在实验方面,我们提出了两种技术来执行缓存映射。第一种技术是一种启发式算法,该算法应用于整个内存跟踪。将缓存映射问题简化为一个近似问题,然后转换为可以在多项式时间内解决的网络流问题。第二种技术基于跟踪采样。在建议的采样框架下,将自动选择一些重要的循环进行采样。通过解决整数线性规划(ILP)问题,可以获得用于简化内存跟踪的最佳高速缓存映射。两种技术都针对两个实际的PDA实现,分别是具有Intel StrongARM SA-1110处理器内核的Compaq iPAQ 3650和具有Intel XScale处理器内核的Compaq iPAQ 3950。我们报告了实验结果,包括性能改进,内存访问统计信息和能源消耗节省。

著录项

  • 作者

    Xu, Rong.;

  • 作者单位

    Purdue University.;

  • 授予单位 Purdue University.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2005
  • 页码 111 p.
  • 总页数 111
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号