首页> 外文学位 >Achieving high performance and energy efficiency in superpipelined processors.

【24h】

Achieving high performance and energy efficiency in superpipelined processors.

机译：在超流水线处理器中实现高性能和高能效。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

One approach to exploring instruction-level parallelism is superpipelining which uses deep pipelines to achieve high clock rates. Pipeline hazards, memory latency, and power consumption are three vital factors that limit the benefits of superpipelining. This dissertation presents several novel approaches to achieve high-performance and energy-efficient superpipelined microprocessors. These approaches focus mainly on reducing pipeline stalls, memory latency, and energy consumption of unnecesary bit switches.;To reduce the number of pipeline stalls, an optimizing instruction scheduler, named Super-reorderer, was built in which in-block scheduling and cross-block scheduling are applied to minimize the number of data and structural hazards. A novel branch scheme is proposed, called branch with masked squashing, to minimize the number of control hazards. The basic idea of branch with masked squashing is to fill delay slots with safe instructions which may come before or after the branch. For the remaining unfilled delay slots, instructions from the predicted target path are used to fill the delay slots. In the case of misprediction, only unsafe instructions are annulled. The safe instructions in branch delay slots are always executed.;To reduce memory latency, unconventional cache mapping functions, hardware-controlled instruction prefetching, and software-controlled data prefetching techniques are investigated. Two novel unconventional cache mapping functions: bit-flipping and segmented bit-selection are proposed and evaluated. A direct-mapped cache with these unconventional cache mapping functions can achieve high hit rates, while maintaining a hit time as fast as a direct-mapped cache with traditional mapping. A novel technique for software-controlled data prefetching is proposed in which the starting data in a data region of a working set is prefetched by software and the subsequence data in the data region is prefetched by hardware. One of the limitations of the software-controlled data prefetching techniques is the execution overhead caused by prefetch instructions. A novel instruction set is proposed in which non-memory-access operations are combined with an optional prefetch operation to effectively eliminate the execution overhead caused by a prefetch instruction. A novel hardware-controlled instruction prefetching technique, called branch correlation-based cache prefetching (BCCP), is proposed. The BCCP, which takes advantage of high branch prediction accuracies of correlation-based cache prediction and aggressive cache line look ahead prefetching, is able to effectively hide long instruction cache latency.;To reduce energy consumption in a modern instruction set processor, several novel hardware and software techniques are investigated. A software technique, called Cold Scheduling, is proposed to reduce energy consumption in the control path. The basic idea is to apply compilation techniques to reorder instruction sequences such that the amount of bit switching on the control path is minimal during program execution. Dynamic power management, which automatically shuts down power consumption in unused functional units during program execution, is investigated to reduce energy consumption in the data path. Two novel cache design techniques are proposed, namely Gray code addressing and cache partitioning, to reduce energy consumption in the caches. The idea of the Gray code addressing is to minimize the bit switches on address buses and I/O pads which usually consume a significant amount of energy in the caches. The idea of cache partitioning is to minimize average energy consumption in each cache access by vertically or horizontally partitioning cache memory cell arrays. (Abstract shortened by UMI.)

机译：探索指令级并行性的一种方法是超级流水线，它使用深流水线来实现高时钟速率。流水线危害，内存延迟和功耗是限制超级流水线优势的三个重要因素。本文提出了几种新颖的方法来实现高性能和高能效的超流水线微处理器。这些方法主要集中在减少流水线停顿，存储器等待时间和不必要的位开关的能量消耗上。为了减少流水线停顿的数量，构建了一个优化的指令调度程序，称为超级重排序程序，在其中进行了块内调度和交叉调度。应用块调度可最大程度地减少数据和结构危害的数量。提出了一种新的分支方案，称为带屏蔽挤压的分支，以最大程度地减少控制危害的数量。分支屏蔽掩蔽的基本思想是用安全指令填充延迟时隙，这些指令可能出现在分支之前或之后。对于剩余的未填充延迟时隙，使用来自预测目标路径的指令来填充延迟时隙。在错误预测的情况下，仅取消不安全的指令。为了减少存储器等待时间，研究了非常规的缓存映射功能，硬件控制的指令预取和软件控制的数据预取技术。提出并评估了两种新颖的非常规缓存映射功能：位翻转和分段位选择。具有这些非常规缓存映射功能的直接映射缓存可以实现较高的命中率，同时保持与传统映射的直接映射缓存一样快的命中时间。提出了一种用于软件控制的数据预取的新技术，其中，通过软件预取工作集的数据区域中的起始数据，并且通过硬件预取数据区域中的子序列数据。软件控制的数据预取技术的局限性之一是由预取指令引起的执行开销。提出了一种新颖的指令集，其中非存储器访问操作与可选的预取操作相结合以有效消除由预取指令引起的执行开销。提出了一种新的硬件控制指令预取技术，称为基于分支相关的缓存预取（BCCP）。 BCCP利用基于相关的高速缓存预测的高分支预测准确性和积极的高速缓存行提前预取功能，能够有效地隐藏较长的指令高速缓存等待时间。为了减少现代指令集处理器的能耗，一些新颖的硬件和软件技术进行了研究。提出了一种称为冷调度的软件技术，以减少控制路径中的能耗。基本思想是将编译技术应用于指令序列的重新排序，以使程序执行期间控制路径上的位切换量最小。研究了动态电源管理，该功能可在程序执行期间自动关闭未使用功能单元中的功耗，以减少数据路径中的能耗。提出了两种新颖的缓存设计技术，即格雷码寻址和缓存分区，以减少缓存中的能耗。格雷码寻址的想法是最大程度地减少地址总线和I / O焊盘上的位切换，这些位总线通常会在高速缓存中消耗大量能量。高速缓存分区的思想是通过垂直或水平分区高速缓存存储单元阵列来最大程度地减少每次高速缓存访问中的平均能耗。（摘要由UMI缩短。）

著录项

作者
Su, Ching-Long Jim.;
展开▼
作者单位

University of Southern California.;

展开▼
授予单位 University of Southern California.;
学科 Engineering Electronics and Electrical.;Computer Science.
学位 Ph.D.
年度 1995
页码 340 p.
总页数 340
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Enerize E3 Factory Energy Management System - For Visualizing the Energy Key Performance Indicator and Achieving Optimal Energy Efficiency [J] . Katsutomo Tanaka, Hiroshi Watanabe, Akira Endou Yokogawa Technical Report . 2010,第1期

机译：增强E3工厂能源管理系统-用于可视化能源关键绩效指标并实现最佳能源效率
2. Improving Energy Efficiency of Social Housing Areas: A Case Study of a Retrofit Achieving an "A" Energy Performance Rating in the UK [J] . MINNA SUNIKKA-BLANK, JUN CHEN, JUDITH BRITNELL, European Planning Studies . 2012,第1期

机译：改善社会住房区域的能源效率：以英国实现“ A”级能源绩效评级的改造为例
3. Improving Energy Efficiency of Social Housing Areas: A Case Study of a Retrofit Achieving an âAâ Energy Performance Rating in the UK [J] . Minna Sunikka-Blanka* Jun Chena Judith Britnellb Dimitra Dantsioub European Planning Studies . 2012,第1期

机译：改善社会住房区域的能源效率：以英国实现“ A”级能源绩效评级的改造为例
4. Managing Energy Efficiency in Buildings: How Standardization Will Help Architects and Design Concepters to Achieve Energy Performance of Buildings [C] . B. Ziegler, E. Khalil International Energy Conversion Engineering Conference . 2005

机译：管理建筑物的能源效率：标准化将如何帮助建筑师和设计概念设计师实现建筑物的能源绩效
5. Resource management techniques for performance and energy efficiency in multithreaded processors. [D] . Sharkey, Joseph James. 2006

机译：用于多线程处理器中性能和能源效率的资源管理技术。
6. Daily energy balance in growth hormone receptor/binding protein (GHR−/−) gene-disrupted mice is achieved through an increase in dark-phase energy efficiency [O] . Kenneth A. Longo, Darlene E. Berryman, Bruce Kelder, -1

机译：在生长激素受体每日能量平衡/结合蛋白（GHR - / - ）基因被破坏的小鼠中通过增加暗相位的能源效率来实现
7. An Analytical Model for IEEE 802.15.4 / ZigBee Wireless Sensor Networks with Duty Cycle Mechanism for Performance Prediction and Configuration of MAC Parameters to Achieve QoS and Energy Efficiency [O] . Dushyanta Dutta, Arindam Karmakar, Dilip Kr. Saikia 2015

机译：具有占空比机制的IEEE 802.15.4 / ZigBee无线传感器网络的分析模型，用于性能预测和maC参数配置，以实现Qos和能效

Achieving high performance and energy efficiency in superpipelined processors.

摘要

著录项

相似文献

相关主题

期刊订阅