首页> 外文会议>IEEE International Workshop on Signal Processing Systems >Fast Quantized Arithmetic on x86: Trading Compute for Data Movement
【24h】

Fast Quantized Arithmetic on x86: Trading Compute for Data Movement

机译:x86上的快速量化算法:数据移动的交易计算

获取原文
获取外文期刊封面目录资料

摘要

We introduce Clover, a new library for efficient computation using low-precision data, providing mathematical routines required by fundamental methods in optimization and sparse recovery. Our library faithfully implements variants of stochastic quantization that guarantee convergence at low precision, and supports data formats from 4-bit quantized to 32-bit IEEE-754 on current Intel processors. In particular, we show that 4-bit can be implemented efficiently using Intel AVX despite the lack of native support for this data format. Experimental results with dot product, matrix-vector multiplication (MVM), gradient descent (GD), and iterative hard thresholding (IHT) demonstrate that the attainable speedups are in many cases close to linear with respect to the reduction of precision due to reduced data movement. Finally, for GD and IHT, we show examples of absolute speedup achieved by 4-bit versus 32-bit, by iterating until a given target error is achieved.
机译:我们引入Clover,这是一个使用低精度数据进行有效计算的新库,它提供了优化和稀疏恢复中基本方法所需的数学例程。我们的库忠实地实现了随机量化的变体,以确保低精度的收敛,并支持当前Intel处理器上从4位量化到32位IEEE-754的数据格式。特别是,我们展示了尽管缺乏对此数据格式的本机支持,但可以使用Intel AVX有效地实现4位。点积,矩阵向量乘法(MVM),梯度下降(GD)和迭代硬阈值(IHT)的实验结果表明,由于数据减少而导致的精度降低,在许多情况下可达到的加速比接近线性移动。最后,对于GD和IHT,我们展示了通过迭代直到达到给定的目标误差而通过4位与32位实现的绝对加速的示例。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号