首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Rethinking Differentiable Search for Mixed-Precision Neural Networks
【24h】

Rethinking Differentiable Search for Mixed-Precision Neural Networks

机译:对混合精度神经网络的可微搜索的重新思考

获取原文

摘要

Low-precision networks, with weights and activations quantized to low bit-width, are widely used to accelerate inference on edge devices. However, current solutions are uniform, using identical bit-width for all filters. This fails to account for the different sensitivities of different filters and is suboptimal. Mixed-precision networks address this problem, by tuning the bit-width to individual filter requirements. In this work, the problem of optimal mixed-precision network search (MPS) is considered. To circumvent its difficulties of discrete search space and combinatorial optimization, a new differentiable search architecture is proposed, with several novel contributions to advance the efficiency by leveraging the unique properties of the MPS problem. The resulting Efficient differentiable MIxed-Precision network Search (EdMIPS) method is effective at finding the optimal bit allocation for multiple popular networks, and can search a large model, e.g. Inception-V3, directly on ImageNet without proxy task in a reasonable amount of time. The learned mixed-precision networks significantly outperform their uniform counterparts.
机译:权重和激活量化为低位宽度的低精度网络被广泛用于加速边缘设备的推理。但是,当前的解决方案是统一的,所有滤波器都使用相同的位宽。这不能解释不同过滤器的不同灵敏度,并且不是最佳的。混合精度网络通过将位宽调整为单独的滤波器要求来解决此问题。在这项工作中,考虑了最佳混合精度网络搜索(MPS)问题。为了解决其离散搜索空间和组合优化的难题,提出了一种新的可微分搜索体系结构,它通过利用MPS问题的独特性质,为提高效率提供了许多新颖的贡献。所产生的有效的微分混合精度网络搜索(EdMIPS)方法可有效地找到多个流行网络的最佳比特分配,并且可以搜索大型模型,例如在合理的时间内直接在ImageNet上直接运行Inception-V3,而无需执行代理任务。博学的混合精度网络明显优于统一的网络。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号