首页> 外国专利> SYSTEMS AND METHODS FOR NEURAL NETWORK CONVOLUTIONAL LAYER MATRIX MULTIPLICATION USING CACHE MEMORY

SYSTEMS AND METHODS FOR NEURAL NETWORK CONVOLUTIONAL LAYER MATRIX MULTIPLICATION USING CACHE MEMORY

机译：使用缓存存储器的神经网络卷积层矩阵乘法的系统和方法

页面导航

摘要
著录项
相似文献

摘要

A computer processor may include a number of cores, a shared cache shared among the cores, and a local cache associated with each core and used by that core only. Input data for a neural network (NN) layer may be partitioned into a set of tiles of size T×T, and the tile set may be partitioned into blocks of R tiles. For each block, a core may perform a transform operation on the tiles to produce transformed data matrices fitting in a local cache, and a set of multiply operations, each multiply operation using a transformed data matrix and a transformed kernel matrix from a set of transformed kernel matrices. The set of transformed kernel matrices may fit in the shared cache. The result of at least one of the multiply operations may be stored in a location used to store a transformed data matrix.

机译：计算机处理器可以包括许多核心，在核中共享的共享共享高速缓存，以及与每个核心关联的本地高速缓存，并且仅由该核使用。神经网络（NN）层的输入数据可以被划分为大小T×T的图块，并且可以将瓦片集分成R瓦片的块。对于每个块，核心可以对图块上的变换操作执行变换操作以在本地高速缓存中产生拟合的变换数据矩阵，以及使用变换的数据矩阵和来自一组变换的变换的内核矩阵的每个乘法操作的变换的数据矩阵。内核矩阵。该组变换的内核矩阵可以适合共享缓存。至少一个乘法操作的结果可以存储在用于存储变换数据矩阵的位置。

著录项

公开/公告号US2021201124A1

专利类型
公开/公告日2021-07-01

原文格式PDF
申请/专利权人 NEURALMAGIC INC.;
展开▼

申请/专利号US201917271326
发明设计人 RATI GELASHVILI;
展开▼

申请日2019-08-27
分类号G06N3/063;G06F17/16;G06N3/08;
国家 US
入库时间 2022-08-24 19:42:14

相似文献

专利
外文文献
中文文献