首页> 外文会议>European conference on computer vision >One Weight Bitwidth to Rule Them All
【24h】

One Weight Bitwidth to Rule Them All

机译:一个重量束缚统治它们

获取原文

摘要

Weight quantization for deep ConvNets has shown promising results for applications such as image classification and semantic segmentation and is especially important for applications where memory storage is limited. However, when aiming for quantization without accuracy degradation, different tasks may end up with different bitwidths. This creates complexity for software and hardware support and the complexity accumulates when one considers mixed-precision quantization, in which case each layer's weights use a different bitwidth. Our key insight is that optimizing for the least bitwidth subject to no accuracy degradation is not necessarily an optimal strategy. This is because one cannot decide optimality between two bitwidths if one has smaller model size while the other has better accuracy. In this work, we take the first step to understand if some weight bitwidth is better than others by aligning all to the same model size using a width-multiplier. Under this setting, somewhat surprisingly, we show that using a single bitwidth for the whole network can achieve better accuracy compared to mixed-precision quantization targeting zero accuracy degradation when both have the same model size. In particular, our results suggest that when the number of channels becomes a target hyperparameter, a single weight bitwidth throughout the network shows superior results for model compression.
机译:深度扫描的重量量化为图像分类和语义分割等应用已经显示了有希望的结果,对于存储器存储有限的应用尤为重要。然而,当瞄准量化而没有准确性降级时,不同的任务可能最终有不同的比特宽度。这为软件和硬件支持创造了复杂性,并且当一个人考虑混合精度量化时,复杂性累积,在这种情况下,每个层的权重都使用不同的比特宽度。我们的主要洞察力是,优化对没有准确性降级的最低面积不一定是最佳策略。这是因为如果一个人具有较小的模型大小而另一个具有更好的准确性,则无法在两个比特宽度之间决定最佳状态。在这项工作中,我们通过使用宽度乘法器对齐所有到相同的型号大小来迈出一些重量位宽度的第一步是超出他人。在此设置下,有些令人惊讶的是,我们表明,与整个网络的单个位线相比,与混合精度量化相比,可以实现更好的准确性,当两者都具有相同的型号尺寸时归零精度劣化。特别是,我们的结果表明,当通道的数量成为目标封路计时,整个网络中的单重比特宽度显示了模型压缩的卓越结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号