首页> 外文会议>European Conference on Artificial Intelligence >Online Adaptation of Deep Architectures with Reinforcement Learning
【24h】

Online Adaptation of Deep Architectures with Reinforcement Learning

机译:在线适应深层建筑与强化学习

获取原文

摘要

Online learning has become crucial to many problems in machine learning. As more data is collected sequentially, quickly adapting to changes in the data distribution can offer several competitive advantages such as avoiding loss of prior knowledge and more efficient learning. However, adaptation to changes in the data distribution (also known as covariate shift) needs to be performed without compromising past knowledge already built in into the model to cope with voluminous and dynamic data. In this paper, we propose an online stacked Denoising Autoencoder whose structure is adapted through reinforcement learning. Our algorithm forces the network to exploit and explore favourable architectures employing an estimated utility function that maximises the accuracy of an unseen validation sequence. Different actions, such as Pool, Increment and Merge are available to modify the structure of the network. As we observe through a series of experiments, our approach is more responsive, robust, and principled than its counterparts for non-stationary as well as stationary data distributions. Experimental results indicate that our algorithm performs better at preserving gained prior knowledge and responding to changes in the data distribution.
机译:在线学习对机器学习中的许多问题都对此至关重要。随着更多数据被顺序收集,快速适应数据分布的变化可以提供多种竞争优势,例如避免丧失知识和更有效的学习。然而,需要对数据分布(也称为Covariate Shift)的改变进行适应,而不会损害已经内置于模型中的过去知识以应对大量和动态数据。在本文中,我们提出了一个在线堆叠的去噪自动化器,其结构通过加强学习来调整。我们的算法迫使网络利用并探索采用估计实用程序功能的良好体系结构,以最大化未经验证序列的准确性。不同的操作,例如池,递增和合并,可用于修改网络的结构。当我们观察到一系列实验时,我们的方法比其非静止和静止数据分布的对应物更敏感,强劲和原则。实验结果表明,我们的算法在保留获得的先前知识并响应数据分布的变化时表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号