首页> 外国专利> RUNTIME OPTIMIZATION OF COMPUTATIONS OF AN ARTIFICIAL NEURAL NETWORK COMPILED FOR EXECUTION ON A DEEP LEARNING ACCELERATOR

RUNTIME OPTIMIZATION OF COMPUTATIONS OF AN ARTIFICIAL NEURAL NETWORK COMPILED FOR EXECUTION ON A DEEP LEARNING ACCELERATOR

机译：为在深度学习加速器上执行而编译的人工神经网络计算的运行时优化

页面导航

摘要
著录项
相似文献

摘要

Systems, devices, and methods related to a Deep Learning Accelerator and memory are described. For example, an integrated circuit device may be configured to execute instructions with matrix operands and configured with random access memory (RAM). A compiler is configured to generate instructions executable by the Deep Learning Accelerator from a description of a target artificial neural network. The instructions may call routines in a runtime library that has an embedded artificial neural network configured to predict optimized execution options available to implement the routines. The prediction is based at least in part on a pattern of data being processed in the target artificial neural network and/or a pattern of usages of the routines by the instructions.

机译：描述了与深度学习加速器和存储器相关的系统、设备和方法。例如，集成电路设备可以配置为使用矩阵操作数执行指令，并配置为使用随机存取存储器（RAM）。编译器配置为从目标人工神经网络的描述生成深度学习加速器可执行的指令。这些指令可以调用运行库中的例程，运行库中有一个嵌入式人工神经网络，用于预测可用于实现例程的优化执行选项。预测至少部分基于目标人工神经网络中正在处理的数据模式和/或指令对例程的使用模式。

著录项

公开/公告号WO2022098504A1

专利类型
公开/公告日2022-05-12

原文格式PDF
申请/专利权人 MICRON TECHNOLOGY INC.;
展开▼

申请/专利号WO2021US55610
发明设计人 MING CHANG ANDRE XIAN;ZAIDY ALIASGER TAYEB;VITEZ MARKO;CULURCIELLO EUGENIO;
展开▼

申请日2021-10-19
分类号G06N3/063;G06N3/04;G06N3/08;
国家 US
入库时间 2024-06-14 23:05:15

相似文献

专利
外文文献
中文文献