首页> 外国专利> Neural network processing with the neural network model pinned to on-chip memories of hardware nodes

Neural network processing with the neural network model pinned to on-chip memories of hardware nodes

机译：用神经网络模型固定到硬件节点的片上存储器的神经网络处理

页面导航

摘要
著录项
相似文献

摘要

Systems and methods for neural network processing are provided. A method in a system comprising a plurality of nodes interconnected via a network, where each node includes a plurality of on-chip memory blocks and a plurality of compute units, is provided. The method includes upon service activation receiving an N by M matrix of coefficients corresponding to the neural network model. The method includes loading the coefficients corresponding to the neural network model into the plurality of the on-chip memory blocks for processing by the plurality of compute units. The method includes regardless of a utilization of the plurality of the on-chip memory blocks as part of an evaluation of the neural network model, maintaining the coefficients corresponding to the neural network model in the plurality of the on-chip memory blocks until the service is interrupted or the neural network model is modified or replaced.

机译：提供了用于神经网络处理的系统和方法。系统中的一种方法，包括经由网络互连的多个节点，其中每个节点包括多个片上存储块和多个计算单元。该方法包括通过与神经网络模型对应的系数的M矩阵接收N的服务激活。该方法包括将对应于神经网络模型的系数加载到多个片上存储器块中，用于由多个计算单元进行处理。该方法包括无论多个片上存储块的利用如何，作为神经网络模型的评估的一部分，维护与多个片上存储块中的神经网络模型相对应的系数直到服务被中断或修改或更换神经网络模型。

著录项

公开/公告号US11157801B2

专利类型
公开/公告日2021-10-26

原文格式PDF
申请/专利权人 MICROSOFT TECHNOLOGY LICENSING LLC;
展开▼

申请/专利号US201715637664
发明设计人 ERIC S. CHUNG;DOUGLAS C. BURGER;JEREMY FOWERS;KALIN OVTCHAROV;
展开▼

申请日2017-06-29
分类号G06N3/063;G06F9/38;G06N3/04;G06F17/16;
国家 US
入库时间 2022-08-24 21:53:06

相似文献

专利
外文文献
中文文献