Vectorized transforms in scalar processors

Trelewicz J.Q.; Mitchell J.L.; Brady M.T.

首页> 外文期刊>IEEE Signal Processing Magazine >Vectorized transforms in scalar processors

【24h】

Vectorized transforms in scalar processors

机译：标量处理器中的向量化转换

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We disclose a generalized approach to creating efficientnimplementations of linear, orthogonal transforms, with specific examplesndiscussed for the 8 x 8 DCT used in image compression. We connect thisnwith a method for performing signed, parallel processing in scalar,noff-the-shelf processors for integer transforms. Uniform data precisionnmay be used, but is not required for the method. The coefficientsnresulting from the new algorithm converge more quickly than thenapproximation made to the coefficients. Furthermore, the new algorithmnallows more control of the specific representation chosen for thencoefficients, as is detailed below. The methods described were designednfor addressing this need with two's-complement arithmetic. Data that cannbe processed in parallel, because of the algorithm structure, are packednin a "vector" format, described, into registers. Many signed arithmeticnoperations can be performed on these vectors, including addition,nsubtraction, multiplication by scalars, shifting, and others. When thenparallel processing is completed, the vectors can be unpacked intonscalar values for storage or subsequent processing. The importance ofnthese methods lies in their handling of carries and borrows in thenpacked vector format. The generalized method is described. Notation isngiven at the beginning to establish consistency through the article. Wendiscuss a generalized approach to integer transforms, using the DCT as anspecific example. Then we detail the vector format, which allows vectorncomputation in scalar processors of parallelizable algorithms. The IDCTnis used as a numerical example in the discussion of the vector format.nThe results were developed for high-end printers (e.g., more than 100npages per minute), where image compression and decompression must benperformed in real time, either in FPGAs, or in embedded processors;nhowever, the methods are applicable to a broad range of signalnprocessing systems

机译：我们公开了一种通用的方法来创建线性正交变换的有效实现，并针对图像压缩中使用的8 x 8 DCT讨论了具体示例。我们将此与用于在整数转换的标量，现成处理器中执行带符号并行处理的方法联系在一起。可以使用统一的数据精度，但该方法不是必需的。新算法产生的系数收敛速度比对系数的逼近速度快。此外，新算法允许对随后为系数选择的特定表示进行更多控制，如下所述。设计了所描述的方法，以通过二进制补码算法解决这一需求。由于算法结构的原因，无法并行处理的数据以描述的“矢量”格式打包到寄存器中。可以对这些向量执行许多有符号算术运算，包括加法，n减法，标量乘法，移位等。当并行处理完成时，可以解压缩矢量的intonscalar值以进行存储或后续处理。这些方法的重要性在于它们以打包的矢量格式处理进位和借位。描述了通用方法。一开始没有给出符号来建立本文的一致性。 Wendiscus以DCT为例，讨论了一种通用的整数转换方法。然后，我们详细介绍了矢量格式，该格式允许在可并行算法的标量处理器中进行矢量计算。 IDCTnis在矢量格式的讨论中用作数值示例。n结果是针对高端打印机（例如，每分钟超过100npages）开发的，在高端打印机中，必须在FPGA中实时执行图像压缩和解压缩。在嵌入式处理器中；但是，这些方法适用于广泛的信号处理系统

著录项

来源
《IEEE Signal Processing Magazine》 |2002年第4期|p.22-31|共10页
作者
Trelewicz J.Q.; Mitchell J.L.; Brady M.T.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类通信理论;
关键词
data compression; digital arithmetic; discrete cosine transforms; embedded systems; field programmable gate arrays; image coding; inverse problems; parallel algorithms; parallel architectures; transform coding; DCT; FPGA; addition; algorithm structure; coefficients co;

机译：数据压缩;数字算术;离散余弦变换;嵌入式系统;现场可编程门阵列;图像编码;反问题;并行算法;并行体系结构;变换编码;DCT;FPGA;加法;算法结构;系数co;

相似文献

外文文献
中文文献
专利

1. The least square inversion method for the exterior ray transforms of 3D scalar and vector fields [J] . Balandin A. L. Mathematical Methods in the Applied Sciences . 2017,第18期

机译：3D标量和矢量字段外光变换的最小二乘反演方法
2. THE LOCALIZED BASIS FUNCTIONS FOR SCALAR AND VECTOR 3D TOMOGRAPHY AND THEIR RAY TRANSFORMS [J] . Balandin Alexander Inverse problems and imaging . 2016,第4期

机译：标量和矢量3D断层扫描的局部基础函数及其射线变换
3. Information Rates of Densely Sampled Data: Distributed Vector Quantization and Scalar Quantization With Transforms for Gaussian Sources [J] . Neuhoff D.L., Pradhan S.S. IEEE Transactions on Information Theory . 2013,第9期

机译：密集采样数据的信息速率：高斯源变换的分布式矢量量化和标量量化
4. VP2000 series dual scalar and quadruple scalar models supercomputer systems-a new concept in vector processing [C] . Miura, K., Nagakura, . 1991

机译：VP2000系列双标量和四标量模型超级计算机系统-矢量处理的新概念
5. Analysis, detection and classification of signals using scalar and vector sparse matrix transforms. [D] . Bachega, Leonardo R. 2013

机译：使用标量和矢量稀疏矩阵变换对信号进行分析，检测和分类。
6. Isotropic scalar image visualization of vector differential image data using the inverse Riesz transform [O] . Kieran G. Larkin, Peter A. Fletcher 2014

机译：使用逆Riesz变换对矢量差分图像数据进行各向同性标量图像可视化
7. Vector Processing in Scalar Processors for Signal Processing Algorithms [O] . Michael T. Brady, J. Q. Trelewicz, Joan L. Mitchell 2001

机译：标量处理器中的矢量处理，用于信号处理算法
8. Performance evaluation of the IBM RISC System/6000: Comparison of an optimized scalar processor with two vector processors. [R] . Simmons, M. L., Wasserman, H. J. 1990

机译：IBm RIsC system / 6000的性能评估：优化的标量处理器与两个矢量处理器的比较。

Vectorized transforms in scalar processors

摘要

著录项

相似文献

相关主题

期刊订阅