首页> 外国专利> EFFICIENT AND ACCURATE WEIGHT QUANTIZATION FOR NEURAL NETWORKS

EFFICIENT AND ACCURATE WEIGHT QUANTIZATION FOR NEURAL NETWORKS

机译：神经网络的高效、准确的权重量化

页面导航

摘要
著录项
相似文献

摘要

Various embodiments relate to a method for producing a plurality of weights for a neural network, wherein the neural network includes a plurality of layers, including: receiving a definition of the neural network including the number of layers and the size of the layers; and training the neural network using a training data set including: segmenting N weights of the plurality of weights into I weight sub-vectors {right arrow over (w)}(i) of dimension K=N/I; applying constraints that force sub-vectors {right arrow over (w)}(i) to concentrate near a (K−1)-dimensional single-valued hypersurface surrounding the origin; and quantizing sub-vectors {right arrow over (w)}(i) to a set of discrete K-dimensional quantization vectors {right arrow over (q)}(i) distributed in a regular pattern near the hypersurface, wherein each sub-vector {right arrow over (w)}(i) is mapped to its nearest quantization vector {right arrow over (q)}(i).

机译：各种实施例涉及一种用于为神经网络产生多个权重的方法，其中，神经网络包括多个层，包括：接收神经网络的定义，包括层数和层的大小;使用训练数据集训练神经网络，包括：将多个权重的 N 个权重分割成 I 个权重子向量 {右箭头（w）}（i）维数 K=N/I;应用约束，迫使子向量 {（w）上方的右箭头}（i）集中在原点周围的（K−1）维单值超曲面附近;并将子向量 {右箭头在（w）}（i）上量化为一组离散的 K 维量化向量 {右箭头上的（q）}（i）以规则模式分布在超表面附近，其中每个子向量 {右箭头上的（w）}（i）映射到其最近的量化向量 {（q）}（i）上的右箭头。

著录项

公开/公告号US20230075609A1;US2023000075609A1;US2023075609A1;US2023075609

专利类型
公开/公告日2023-03-09

原文格式PDF
申请/专利权人 NXP B.V.;
展开▼

申请/专利号US17464824;US202100017464824;US202117464824A;US202117464824
发明设计人
展开▼

申请日2021-09-02
分类号G06N3/08;G06F7/544;G06F7/50;G06F7/523;
国家
入库时间 2024-06-14 23:52:38

相似文献

专利
外文文献
中文文献