首页> 外文OA文献 >Temperature, energy and performance: addressing embedded system challenges through fast cache simulation
【2h】

Temperature, energy and performance: addressing embedded system challenges through fast cache simulation

机译:温度,能量和性能:通过快速缓存仿真解决嵌入式系统挑战

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Temperature, energy and performance are essential design considerations during the conception of modern digital systems. The work presented in this thesis focusses on three aspects that can be used to overcome these limitations. First an evaluation of the suitability of the dynamic application adaptation method is researched with the aim of using it to control the temperature of a Field Programmable Gate Array (FPGA) device. Despite the use of an extremely adaptive custom JPEG encoder, it was determined that application adaptation alone is ineffective in an FPGA for thermal management.Next, a study is performed which aims to assess which components are principally responsible for the rise in temperatures in FPGAs. It was found that the external memory interface is a significant heat-source in FPGA-based embedded systems, and that device temperature correlates with CPU cache miss rate. The third and main aspect covered in this dissertation is the speeding up of CPU cache simulation. Single pass cache simulation is a tool that can be employed at design time to select a cache yielding acceptable temperature, system performance and energy consumption. Three Multiple cAche Simulators in Hardware (MASH) or in Software (MASS) are proposed for three cache replacement policies: MASH{lru} for the Least Recently Used (LRU) cache algorithm, MASH{fifo} for First In First Out (FIFO) and MASS{plrut} for Pseudo Least Recently Used tree (PLRUt). The former two are novel in that they are implemented in hardware and are respectively 53x and 11.10x faster than software counterparts. The PLRUt simulator presents for the first time an optimised hash table-based algorithm yielding a speedup of 1.93x over an unoptimised solution. All cache simulators employ cache properties specific to their replacement policies to improve simulator characteristics.Additionally, it is shown that the hardware (or MASH) simulators can be implemented in-system alongside an embedded system, allowing for the direct trace extraction and cache simulation from within an FPGA. Using in-system simulation, large speedups can be achieved as trace generation and multiple cache simulation happen at the same time at high frequencies.
机译:在现代数字系统的概念中,温度,能量和性能是必不可少的设计考虑因素。本文提出的工作集中在可以用来克服这些局限性的三个方面。首先,研究动态应用自适应方法的适用性评估,目的是将其用于控制​​现场可编程门阵列(FPGA)设备的温度。尽管使用了高度自适应的定制JPEG编码器,但已确定仅应用自适应在FPGA的热量管理中是无效的。接下来,进行了一项研究,旨在评估哪些元件是造成FPGA温度升高的主要原因。结果发现,外部存储器接口是基于FPGA的嵌入式系统中的重要热源,并且器件温度与CPU缓存未命中率相关。本文涉及的第三个也是主要方面是加速CPU缓存仿真。单遍缓存仿真是一种工具,可以在设计时使用它来选择产生可接受的温度,系统性能和能耗的缓存。针对三种高速缓存替换策略,提出了三种在硬件(MASH)或软件(MASS)中的多重缓存模拟器:用于最近最少使用(LRU)缓存算法的MASH {lru},用于先进先出(FIFO)的MASH {fifo}和MASS {plrut}用于最近最少使用的伪树(PLRUt)。前两个是新颖的,因为它们是在硬件中实现的,比软件对应的速度分别快53倍和11.10倍。 PLRUt模拟器首次提出了一种基于哈希表的优化算法,与未优化的解决方案相比,可提高1.93倍的速度。所有高速缓存模拟器都采用特定于其替换策略的高速缓存属性来改善模拟器的特性。此外,还证明了硬件(或MASH)模拟器可以与嵌入式系统一起在系统内实现,从而可以直接从中提取跟踪并进行高速缓存模拟。在FPGA中。使用系统内仿真,可以在高频下同时发生跟踪生成和多个高速缓存仿真,从而实现大幅度的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号