首页> 外国专利> NEURAL NETWORK INFERENCE STRUCTURE OPTIMIZATION METHOD AND DEVICE

NEURAL NETWORK INFERENCE STRUCTURE OPTIMIZATION METHOD AND DEVICE

机译：神经网络推理结构优化方法及装置

页面导航

摘要
著录项
相似文献

摘要

A neural network inference structure optimization method, comprising: when an Mth network layer and a (M+2)th network layer of a neural network inference structure are both normalization layers, the (M+1)th network layer is a convolutional layer or a fully connected layer, and an output of the (M+1)th network layer is only connected to the (M+2)th network layer (501), invoking a first preset algorithm to process the (M+1)th network layer, so as to merge the (M+2)th network layer into the (M+1)th network layer to obtain a first optimized network layer of the (M+1)th network layer (502); and invoking a second preset algorithm to process the first optimized network layer of the (M+1)th network layer, so as to merge the Mth network layer into the first optimized network layer of the (M+1)th network layer (503). The present invention can reduce the calculation amount and processing delay in neural network inference to the greatest extent, achieving the purpose of improving the inference efficiency of a neural network model.

机译：一种神经网络推理结构优化方法，包括：当神经网络推理结构的第M网络层和第（M + 2）网络层均为归一化层时，第（M + 1）网络层为卷积层或全连接层，并且第（M + 1）网络层的输出仅连接到第（M + 2）网络层（501），从而调用第一预设算法来处理第（M + 1）网络层，以将第（M + 2）网络层合并为第（M + 1）网络层，以获得第（M + 1）网络层的第一优化网络层（502）;调用第二预设算法处理第（M + 1）网络层的第一优化网络层，以将第M网络层合并为第（M + 1）网络层的第一优化网络层（503））。本发明可以最大程度地减少神经网络推理的计算量和处理延迟，达到提高神经网络模型推理效率的目的。

著录项

公开/公告号WO2020134828A1

专利类型
公开/公告日2020-07-02

原文格式PDF
申请/专利权人 SHENZHEN YUNTIANLIFEI TECHNOLOGY CO. LTD.;
展开▼

申请/专利号WO2019CN121520
发明设计人 YI LIQIANG;
展开▼

申请日2019-11-28
分类号G06N3/04;
国家 WO
入库时间 2022-08-21 11:10:23

相似文献

专利
外文文献
中文文献