Intra-Vector SIMD Instructions for Core Specialization

机译：核心专业化的载体内部SIMD指令

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current research is mainly focussing on exploiting TLP to increase performance. Another avenue, however, for achieving performance scalability is specialization. In this paper we propose application specific intra-vector instructions for two dimensional signal processing kernels. In such kernels usually significant data rearrangement overhead is required in order to use the SIMD capabilities. When using the intra-vector instructions the overhead can be avoided. We have implemented intra-vector instructions in the Cell SPU core and measured speedups of up to 2.06, with an average of 1.45.

机译：目前的研究主要集中在利用TLP增加性能。但是，为了实现性能可扩展性，另一个大道是专业化的。在本文中，我们提出了应用于二维信号处理核的特定载体指令。在这种内核中，通常需要显着的数据重新排列开销，以便使用SIMD功能。当使用媒介内指令时，可以避免开销。我们已经在单元格核心中实施了载体的内部指令，并测量了高达2.06的加速，平均为1.45。

著录项

来源
《IEEE International Confernece on Computer Design》|2009年||共6页
会议地点
作者
Cor Meenderinck; Ben Juurlink;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Hardware/Software Partitioning Algorithm for Processor Cores with Packed SIMD-Type Instructions [J] . Nozomu TOGAWA, Koichi TACHIKAKE, Yuichiro MIYAOKA, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2003,第12期

机译：带压缩SIMD类型指令的处理器内核的硬件/软件分区算法
2. A Retargetable Simulator Generator for DSP Processor Cores with Packed SIMD-type Instructions [J] . Nozomu TOGAWA, Kyosuke KASAHARA, Yuichiro MIYAOKA, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences . 2003,第12期

机译：具有打包SIMD类型指令的DSP处理器内核的可重定位模拟器生成器
3. A parallelizing compile algorithm in hardware/software cosynthesis system for processor cores with packed SIMD type instruction sets [J] . Nobuharu Suzuki, Nozomu Togawa, Masao Yanagisawa, 電子情報通信学会技術研究報告. 信号処理. Signal Processing . 2002,第168期

机译：带有压缩SIMD类型指令集的处理器内核的硬件/软件协同系统中的并行化编译算法
4. Intra-vector SIMD instructions for core specialization [C] . Meenderinck C., Juurlink B. Computer Design, 2009. ICCD 2009 . 2009

机译：用于核心专业化的矢量内SIMD指令
5. ILP-SIMD: An instruction parallel SIMD architecture with short -wire interconnects. [D] . Chung, Kee Shik. 2000

机译：ILP-SIMD：具有短线互连的指令并行SIMD体系结构。
6. CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions [O] . Yongchao Liu, Adrianto Wirawan, Bertil Schmidt 2013

机译：CUDASW ++ 3.0：通过耦合CPU和GPU SIMD指令来加速Smith-Waterman蛋白质数据库搜索
7. Intra-Vector SIMD Instructions for Core Specialization [O] . Cor Meenderinck, Ben Juurlink 2013

机译：核心专业化的矢量内sImD指令

Intra-Vector SIMD Instructions for Core Specialization

摘要

著录项

相似文献

相关主题

期刊订阅