Throughput-Distortion Computation of Generic Matrix Multiplication: Toward a Computation Channel for Digital Signal Processing Systems

Anastasia D.; Andreopoulos Y.

首页> 外文期刊>Signal Processing, IEEE Transactions on >Throughput-Distortion Computation of Generic Matrix Multiplication: Toward a Computation Channel for Digital Signal Processing Systems

【24h】

Throughput-Distortion Computation of Generic Matrix Multiplication: Toward a Computation Channel for Digital Signal Processing Systems

机译：通用矩阵乘法的吞吐量失真计算：面向数字信号处理系统的计算通道

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The generic matrix multiply (GEMM) function is the core element of high-performance linear algebra libraries used in many computationally demanding digital signal processing (DSP) systems. We propose an acceleration technique for GEMM based on dynamically adjusting the imprecision (distortion) of computation. Our technique employs adaptive scalar companding and rounding to input matrix blocks followed by two forms of packing in floating-point that allow for concurrent calculation of multiple results. Since the adaptive companding process controls the increase of concurrency (via packing), the increase in processing throughput (and the corresponding increase in distortion) depends on the input data statistics. To demonstrate this, we derive the optimal throughput-distortion control framework for GEMM for the broad class of zero-mean, independent identically distributed, input sources. Our approach converts matrix multiplication in programmable processors into a computation channel: when increasing the processing throughput, the output noise (error) increases due to: (i) coarser quantization; and (ii) computational errors caused by exceeding the machine-precision limitations. We show that, under certain distortion in the GEMM computation, the proposed framework can significantly surpass 100% of the peak performance of a given processor. The practical benefits of our proposal are shown in a face recognition system and a multilayer perceptron system trained for metadata learning from a large music feature database.

机译：通用矩阵乘法（GEMM）功能是许多计算要求很高的数字信号处理（DSP）系统中使用的高性能线性代数库的核心元素。我们基于动态调整计算的不精确度（失真）提出了GEMM的加速技术。我们的技术采用自适应标量压扩和舍入到输入矩阵块，然后采用两种形式的浮点打包，以允许同时计算多个结果。由于自适应压扩过程控制并发的增加（通过打包），因此处理吞吐量的增加（以及失真的相应增加）取决于输入数据的统计信息。为了证明这一点，我们为零均值，独立的均匀分布的输入源的广泛类推导了GEMM的最佳吞吐量失真控制框架。我们的方法将可编程处理器中的矩阵乘法转换为计算通道：当增加处理吞吐量时，由于以下因素而导致输出噪声（误差）增加：（i）较粗糙的量化；（ii）由于超出机器精度限制而导致的计算错误。我们表明，在GEMM计算中存在某些失真的情况下，所提出的框架可以大大超过给定处理器的峰值性能的100％。我们的建议的实际好处体现在面部识别系统和多层感知器系统中，这些系统经过训练可以从大型音乐特征数据库中进行元数据学习。

著录项

来源
《Signal Processing, IEEE Transactions on》 |2012年第4期|p.2024-2037|共14页
作者
Anastasia D.; Andreopoulos Y.;
展开▼
作者单位

Dept. of Electronic & Electrical Engineering, University College London, London, UK;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
BLAS level-3; high performance computing; matrix-based DSP; stochastic error estimation; throughput-distortion tradeoffs in DSP;

机译：BLAS 3级;高性能计算;基于矩阵的DSP;随机误差估计;DSP中的吞吐量失真折衷;

相似文献

外文文献
中文文献
专利

1. Matrix distributed processing: a set of C++ tools for implementing generic lattice computations on parallel systems [J] . Massimo Di Pierro Computer physics communications . 2001,第1期

机译：矩阵分布式处理：一组用于在并行系统上实现通用晶格计算的C ++工具
2. Computations Control in Embedded Multiprocessor Digital Signal Processing systems of Mobile Complexes. [J] . Kramskoy V.V., Cherkasov D.I. Управляющие системы и машины . 1998,第5期

机译：移动复合体的嵌入式多处理器数字信号处理系统中的计算控制。
3. Computation Error Analysis in Digital Signal Processing Systems With Overscaled Supply Voltage [J] . Very Large Scale Integration (VLSI) Systems, IEEE Transactions on . 2010,第4期

机译：电源电压过高的数字信号处理系统中的计算误差分析
4. Throughput-precision computation for generic matrix multiplication: Toward a computation channel for high-performance digital signal processing [C] . Anastasia Davide, Andreopoulos Yiannis 17th International Conference on Digital Signal Processing . 2011

机译：通用矩阵乘法的吞吐量精度计算：面向高性能数字信号处理的计算通道
5. Matrix computations in signal processing and Markov chains. [D] . Wu, Yuan-Jye Jason. 1995

机译：信号处理和马尔可夫链中的矩阵计算。
6. Reconstructing the pathways of a cellular system from genome-scale signals by using matrix and tensor computations [O] . Orly Alter, Gene H. Golub 2005

机译：通过使用矩阵和张量计算从基因组规模的信号重建细胞系统的途径
7. Throughput-Distortion Computation Of Generic Matrix Multiplication: Toward A Computation Channel For Digital Signal Processing Systems [O] . Anastasia, Davide, Andreopoulos, Yiannis 2011

机译：通用矩阵乘法的吞吐量 - 失真计算：走向数字信号处理系统的计算通道

Throughput-Distortion Computation of Generic Matrix Multiplication: Toward a Computation Channel for Digital Signal Processing Systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅