首页> 外国专利> RUNTIME OPTIMIZATION OF COMPUTATIONS OF AN ARTIFICIAL NEURAL NETWORK COMPILED FOR EXECUTION ON A DEEP LEARNING ACCELERATOR

RUNTIME OPTIMIZATION OF COMPUTATIONS OF AN ARTIFICIAL NEURAL NETWORK COMPILED FOR EXECUTION ON A DEEP LEARNING ACCELERATOR

机译:为在深度学习加速器上执行而编译的人工神经网络计算的运行时优化

摘要

Systems, devices, and methods related to a Deep Learning Accelerator and memory are described. For example, an integrated circuit device may be configured to execute instructions with matrix operands and configured with random access memory (RAM). A compiler is configured to generate instructions executable by the Deep Learning Accelerator from a description of a target artificial neural network. The instructions may call routines in a runtime library that has an embedded artificial neural network configured to predict optimized execution options available to implement the routines. The prediction is based at least in part on a pattern of data being processed in the target artificial neural network and/or a pattern of usages of the routines by the instructions.
机译:描述了与深度学习加速器和存储器相关的系统、设备和方法。例如,集成电路设备可以配置为使用矩阵操作数执行指令,并配置为使用随机存取存储器(RAM)。编译器配置为从目标人工神经网络的描述生成深度学习加速器可执行的指令。这些指令可以调用运行库中的例程,运行库中有一个嵌入式人工神经网络,用于预测可用于实现例程的优化执行选项。预测至少部分基于目标人工神经网络中正在处理的数据模式和/或指令对例程的使用模式。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号