Exploration of block-wise dynamic sparseness

Hadifar Amir; Deleu Johannes; Develder ChrisDemeester Thomas

首页> 外文期刊>Pattern recognition letters >Exploration of block-wise dynamic sparseness

【24h】

Exploration of block-wise dynamic sparseness

机译：Exploration of block-wise dynamic sparseness

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Neural networks have achieved state of the art performance across a wide variety of machine learning tasks, often with large and computation-heavy models. Inducing sparseness as a way to reduce the memory and computation footprint of these models has seen significant research attention in recent years. In this paper, we present a new method for dynamic sparseness , whereby part of the computations are omitted dynamically, based on the input. For efficiency, we combined the idea of dynamic sparseness with block-wise matrix-vector multiplications. In contrast to static sparseness, which permanently zeroes out selected positions in weight matrices, our method preserves the full network capabilities by potentially accessing any trained weights. Yet, matrix vector multiplications are accelerated by omitting a pre-defined fraction of weight blocks from the matrix, based on the input. Experimental results on the task of language modeling, using recurrent and quasi-recurrent models, show that the proposed method can outperform static sparseness baselines. In addition, our method can reach similar language modeling perplexities as the dense baseline, at half the computational cost at inference time. (c) 2021 Published by Elsevier B.V.

著录项

来源
《Pattern recognition letters》 |2021年第11期|187-192|共6页
作者
Hadifar Amir; Deleu Johannes; Develder ChrisDemeester Thomas;
展开▼
作者单位

Univ Ghent, IMEC, Dept Informat Technol, Technol Pk 126, B-9052 Zwijnaarde, Belgium;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Neural network; Dynamic sparseness; Block-wise matrix multiplication;

Exploration of block-wise dynamic sparseness

摘要

著录项

相关主题

期刊订阅