Out-of-order vector architectures

机译：无序向量架构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Register renaming and out-of-order instruction issue are now commonly used in superscalar processors. These techniques can also be used to significant advantage in vector processors, as this paper shows. Performance is improved and available memory bandwidth is used more effectively. Using a trace driven simulation we compare a conventional vector implementation, based on the Convex C3400, with an out-of-order, register renaming, vector implementation. When the number of physical registers is above 12, out-of-order execution coupled with register renaming provides a speedup of 1.24--1.72 for realistic memory latencies. Out-of-order techniques also tolerate main memory latencies of 100 cycles with a performance degradation less than 6%. The mechanisms used for register renaming and out-of-order issue can be used to support precise interrupts -- generally a difficult problem in vector machines. When precise interrupts are implemented, there is typically less than a 10% degradation in performance. Anew technique based on register renaming is targeted at dynamically eliminating spill code; this technique is shown to provide an extra speedup ranging between 1.10 and 1.20 while reducing total memory traffic by an average of 15--20%.

机译：寄存器重命名和乱序指令问题现在通常在超标量处理器中使用。如本文所示，这些技术也可以在矢量处理器中发挥显着优势。性能得到改善，可用内存带宽得到更有效的利用。使用跟踪驱动的仿真，我们将基于Convex C3400的常规矢量实现与乱序的寄存器重命名矢量实现进行了比较。当物理寄存器的数量大于12时，乱序执行与寄存器重命名一起提供了1.24--1.72的加速，以实现实际的内存延迟。乱序技术还可以承受100个周期的主内存延迟，而性能降级不到6％。用于寄存器重命名和乱序问题的机制可用于支持精确的中断，这通常是向量机中的难题。当实施精确的中断时，性能通常会下降不到10％。一种基于寄存器重命名的新技术旨在动态消除溢出代码。该技术显示出可提供1.10到1.20的额外加速，同时将总内存流量平均减少15--20％。

著录项

来源
《Annual ACM/IEEE international symposium on Microarchitecture;ACM/IEEE international symposium on Microarchitecture》|1997年|P.160-170|共11页
会议地点
作者
Roger Espasa; Mateo Valero; James E. Smith;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类总体结构、系统结构;
关键词
vector architecture, out-of-order execution, microarchitecture, memory traffic elimination, register renaming, precise interrupts, memory latency;

机译：向量架构，乱序执行，微架构，内存流量消除，寄存器重命名，精确中断，内存延迟;

相似文献

外文文献
中文文献
专利

1. O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference [J] . Geng Tong, Li Ang, Wang Tianqi, IEEE Transactions on Parallel and Distributed Systems . 2021,第1期

机译：O3BNN-R：用于高性能和正则化BNN推理的订单超出架构
2. Architecture Support for Task Out-of-Order Execution in MPSoCs [J] . Wang Chao, Li Xi, Zhang Junneng, Computers, IEEE Transactions on . 2015,第5期

机译：MPSoC中任务无序执行的体系结构支持
3. A Sequentially Consistent Multiprocessor Architecture for Out-of-Order Retirement of Instructions [J] . Ubal Rafael Parallel and Distributed Systems, IEEE Transactions on . 2012,第8期

机译：顺序一致的指令无序淘汰的多处理器体系结构
4. A Performance Study of Out-of-order Vector Architectures and Short Registers [C] . Luis Villa, Roger Espasa, Mateo Valero 1998 international conference on supercomputing . 1998

机译：无序向量架构和短寄存器的性能研究
5. Exploring Relationships between Vector-Borne Diseases and Landscape Architecture: Aedes aegypti, Aedes albopictus and Landscape Architecture [D] . Alarcon, Jorge Antonio. 2016

机译：探索载体传播疾病与景观建筑之间的关系：AEDES AEGYPTI，AEDES ALPOPICTUS和景观建筑
6. 42’:6’4- and 32’:6’3-Terpyridines: The Conflict between Well-Defined Vectorial Properties and Serendipity in the Assembly of 1D- 2D- and 3D-Architectures [O] . Y. Maximilian Klein, Alessandro Prescimone, Edwin C. Constable, 2017

机译：42’：6’4-和32’：6’3-三联吡啶：定义好的向量性质与1D2D和3D建筑装配中的偶然性之间的冲突
7. Out-of-Order Vector Architectures [O] . Roger Espasa, Mateo Valero, U. Polit`ecnica De Catalunya-barcelona, 1997

机译：无序向量架构

Out-of-order vector architectures

摘要

著录项

相似文献

相关主题

期刊订阅