Distilling deep neural networks with reinforcement learning

机译：通过强化学习提炼深度神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep architecture can improve performance of neural networks whereas it increases the computational complexity. Compressing networks is the key to solve this problem. The framework Knowledge Distilling (KD) compresses cumbersome networks well. It improved mimic learning, enabling knowledge to be transferred from cumbersome networks to compressed networks without constraint of architectures. Inspired by AlphaGo Zero, this paper proposed an algorithm combining KD with reinforcement learning to compress networks on changing datasets. In this algorithm, the compressed networks interact with the environment made by KD to produce datasets that are appropriate w.r.t the model. Monte Carlo Tree Search (MCTS) of AlphaGo Zero is used to produce the datasets by making a trade-off between the prediction of compressed networks and the knowledge. In experiments, the algorithm proved to be effective in compressing networks by training ResNet on CIFAR datasets, with mean squared error as the object function.

机译：深度建筑可以提高神经网络的性能，而它会增加计算复杂性。压缩网络是解决此问题的关键。框架知识蒸馏（KD）很好地压缩了繁琐的网络。它改进了模仿学习，使知识能够从繁琐的网络转移到压缩网络，而不会限制架构。这篇论文提出了一种将KD与加强学习的算法组合以压缩在改变数据集中的网络。在该算法中，压缩网络与KD制造的环境交互以产生适当的数据集W.R.T模型。 alphago零的蒙特卡罗树搜索（MCTS）用于通过在压缩网络的预测和知识预测之间进行权衡来生产数据集。在实验中，该算法证明通过在CIFAR数据集上训练Reset来对网络进行有效，具有平均平方误差作为对象功能。

著录项

来源
《IEEE International Conference on Information and Automation》|2018年|133-138|共6页
会议地点
作者
You Huang; Yuanlong Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Compressing neural networks; Knowledge Distilling; Policy Iteration; Monte Carlo Tree Search;

机译：压缩神经网络知识提取策略迭代蒙特卡罗树搜索;

相似文献

外文文献
中文文献
专利

1. Reinforcement Renaissance The power of deep neural networks has sparked renewed interest in reinforcement learning, with applications to games, robotics, and beyond [J] . Krakovsky Marina Communications of the ACM . 2016,第8期

机译：强化文艺复兴深度神经网络的力量激发了人们对强化学习及其在游戏，机器人技术及其他领域的应用的新兴趣。
2. Concept learning through deep reinforcement learning with memory-augmented neural networks [J] . Shi Jing, Xu Jiaming, Yao Yiqun, Neural Networks: The Official Journal of the International Neural Network Society . 2019,第期

机译：利用内存增强神经网络深增强学习的概念学习
3. Intelligent laser welding through representation, prediction, and control learning: An architecture with deep neural networks and reinforcement learning [J] . Guenther Johannes, Pilarski Patrick M., Helfrich Gerhard, Mechatronics: The Science of Intelligent Machines . 2016,第Null期

机译：通过表示，预测和控制学习进行智能激光焊接：具有深度神经网络和强化学习的架构
4. Distilling deep neural networks with reinforcement learning [C] . You Huang, Yuanlong Yu IEEE International Conference on Information and Automation . 2018

机译：用加固学习蒸馏深神经网络
5. Statistical Machine Learning & Deep Neural Networks Applied to Neural Data Analysis [D] . Shokri Razaghi, Hooshmand. 2020

机译：统计机器学习和深神经网络应用于神经数据分析
6. Application of deep neural network and deep reinforcement learning in wireless communication [O] . Ming Li, Hui Li 2020

机译：深度神经网络和深增强学习在无线通信中的应用
7. Concept learning through deep reinforcement learning with memory-augmented neural networks [O] . Jing Shi, Jiaming Xu, Yiqun Yao, 2019

机译：通过深入加强学习与内存增强神经网络学习的概念

Distilling deep neural networks with reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅