Initial results on the performance and cost of vector microprocessors

机译：向量微处理器的性能和成本的初步结果

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Increasingly wider superscalar processors are experiencing diminishing performance returns while requiring larger portions of die area dedicated to control rather than datapath. As an alternative to using these processors to exploit parallelism effectively, we are investigating the viability of using single-chip vector microprocessors. This paper presents some initial results of our investigation where we compare the performance and cost of vector microprocessors to that of aggressive, out-of-order super- scalar microprocessors.On the performance side, we show that vector processors are able to execute a highly parallel, integer-based application 1.5- 7.3 times faster than superscalar processors can by exploiting parallelism more effectively. This ability stems from the use of vector instructions. Vector instructions exploit parallelism across loop iterations by implicitly re-scheduling operations and temporally localizing the parallelism. Vector instructions also reduce instruction bandwidth by more than an order of magnitude because they express an abundance of parallelism in a compact encoding.On the cost side we show that, to achieve these performance gains, highly parallel, integer-based vector microprocessors are no more costly to implement than existing in-order and out-of- order superscalar microprocessors. One reason for this is that the organization of a vector register file provides tremendous bandwidth without incurring a large area penalty. A second reason is that the control logic for issuing vector instructions is relatively simple.Both the performance gains and cost savings are possible because vector processors rely on a vectorizing compiler, rather than hardware, to detect parallelism and to express it in a compact form to the hardware. These initial results suggest that transferring this functionality to the compiler offers a tremendous performance/cost benefit.

机译：越来越宽的超标量处理器正经历着越来越低的性能回报，同时需要更多的裸片区域专用于控制而不是数据路径。作为使用这些处理器有效利用并行性的替代方法，我们正在研究使用单芯片矢量微处理器的可行性。本文介绍了我们研究的一些初步结果，我们将矢量微处理器的性能和成本与具有攻击性的无序超标量微处理器进行了比较。在性能方面，我们证明了矢量处理器能够执行高度高效的处理。通过更有效地利用并行性，基于整数的并行应用程序比超标量处理器快1.5- 7.3倍。这种能力源于矢量指令的使用。向量指令通过隐式地重新调度操作并在时间上局部化并行性，从而在循环迭代中利用并行性。矢量指令还将指令带宽减少了一个数量级，因为它们在紧凑的编码中表示出大量的并行性。在成本方面，我们表明，要获得这些性能提升，高度并行的基于整数的矢量微处理器将不再存在。与现有的有序和无序超标量微处理器相比，实施起来成本高昂。这样做的一个原因是向量寄存器文件的组织提供了巨大的带宽而又不会产生大的面积损失。第二个原因是发出矢量指令的控制逻辑相对简单，因为矢量处理器依靠矢量化编译器而不是硬件来检测并行度并以紧凑的形式表示并行度，从而可以提高性能并节省成本。硬件。这些初步结果表明，将此功能转移到编译器可提供巨大的性能/成本优势。

著录项

来源
《Annual ACM/IEEE international symposium on Microarchitecture;ACM/IEEE international symposium on Microarchitecture》|1997年|P.171-182|共12页
会议地点
作者
Corinna G. Lee; Derek J. DeVries;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类总体结构、系统结构;
关键词

相似文献

外文文献
中文文献
专利

1. Performance and energy impact of parallelization and vectorization techniques in modern microprocessors [J] . Juan M. Cebrian, Lasse Natvig, Jan Christian Meyer Computing . 2014,第12期

机译：现代微处理器中并行化和矢量化技术的性能和能量影响
2. Efficient Utilization of Vector Registers to Improve FFT Performance on SIMD Microprocessors [J] . Feng YU, Ruifeng GE, Zeke WANG IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2013,第7期

机译：向量寄存器的有效利用以提高SIMD微处理器上的FFT性能
3. Low Cost Concurrent Error Detection Strategy for the Control Logic of High Performance Microprocessors and Its Application to the Instruction Decoder [J] . D. Rossi, M. Omana, G. Garrammone, Journal of Electronic Testing: Theory and Applications: Theory and Applications . 2013,第3期

机译：高性能微处理器控制逻辑的低成本并发错误检测策略及其在指令解码器中的应用
4. Initial results on the performance and cost of vector microprocessors [C] . Lee, C.G., DeVries, . 1997

机译：向量微处理器的性能和成本的初步结果
5. Cost-performance optimizations of microprocessors. [D] . Fu, Steve Te-Hsiang. 2001

机译：微处理器的性价比优化。
6. Initial experience with a microprocessor controlled current based defibrillator. [O] . G W Dalzell, S R Cunningham, J Anderson, 1989

机译：使用微处理器控制的基于电流的除颤器的初步经验。
7. Initial Results on the Performance and Cost of Vector Microprocessors [O] . Corinna G. Lee, Derek J. DeVries 1997

机译：矢量微处理器性能和成本的初步结果
8. Initial Flight Test Evaluation of the F-15 ACTIVE Axisymmetric Vectoring Nozzle Performance [R] . Orme, John S., Hathaway, Ross, Ferguson, Michael D. 1998

机译：F-15主动轴对称矢量喷管性能的初始飞行试验评估

Initial results on the performance and cost of vector microprocessors

摘要

著录项

相似文献

相关主题

期刊订阅