首页>
外国专利>
SYSTEMS AND METHODS FOR NEURAL NETWORK CONVOLUTIONAL LAYER MATRIX MULTIPLICATION USING CACHE MEMORY
SYSTEMS AND METHODS FOR NEURAL NETWORK CONVOLUTIONAL LAYER MATRIX MULTIPLICATION USING CACHE MEMORY
展开▼
机译:使用缓存存储器的神经网络卷积层矩阵乘法的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
A computer processor may include a number of cores, a shared cache shared among the cores, and a local cache associated with each core and used by that core only. Input data for a neural network (NN) layer may be partitioned into a set of tiles of size T×T, and the tile set may be partitioned into blocks of R tiles. For each block, a core may perform a transform operation on the tiles to produce transformed data matrices fitting in a local cache, and a set of multiply operations, each multiply operation using a transformed data matrix and a transformed kernel matrix from a set of transformed kernel matrices. The set of transformed kernel matrices may fit in the shared cache. The result of at least one of the multiply operations may be stored in a location used to store a transformed data matrix.
展开▼