首页> 外国专利> EFFICIENT PROCESSING OF NEURAL NETWORK MODELS

EFFICIENT PROCESSING OF NEURAL NETWORK MODELS

机译：高效处理神经网络模型

页面导航

摘要
著录项
相似文献

摘要

A compiler of a computing device is described that identifies a sequence of neural network models frequently invoked by an application of the computing device, compiles the models in that sequence, and loads a static random access memory (SRAM) of a hardware accelerator with the compiled models only when the same compiled models-from another, but same, sequence that was previously invoked-are not already present in the SRAM. This prevents unnecessary reloading of compiled models into the SRAM, thereby increasing runtime speed and conserving computational energy.

机译：描述了计算设备的编译器，其识别由计算设备的应用程序经常调用的经常调用的神经网络模型序列，在该序列中编译模型，并将硬件加速器的静态随机存取存储器（SRAM）与编译加载仅当相同的编译模型 - 来自另一个型号而且相同，先前调用的序列 - 尚未存在于SRAM中。这防止了不必要的编译模型在SRAM中重新加载到SRAM中，从而增加了运行时速度和节省计算能量。

著录项

公开/公告号WO2021183105A1

专利类型
公开/公告日2021-09-16

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号WO2020US21714
发明设计人 CHAUHAN ARUN;ASHOK RAKSIT;WOO DONG HYUK;
展开▼

申请日2020-03-09
分类号G06F8/41;G06F9/445;G06N3/063;G06F8/71;
国家 US
入库时间 2022-08-24 21:07:41

相似文献

专利
外文文献
中文文献