Online Adaptation of Deep Architectures with Reinforcement Learning

机译：在线适应深层建筑与强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Online learning has become crucial to many problems in machine learning. As more data is collected sequentially, quickly adapting to changes in the data distribution can offer several competitive advantages such as avoiding loss of prior knowledge and more efficient learning. However, adaptation to changes in the data distribution (also known as covariate shift) needs to be performed without compromising past knowledge already built in into the model to cope with voluminous and dynamic data. In this paper, we propose an online stacked Denoising Autoencoder whose structure is adapted through reinforcement learning. Our algorithm forces the network to exploit and explore favourable architectures employing an estimated utility function that maximises the accuracy of an unseen validation sequence. Different actions, such as Pool, Increment and Merge are available to modify the structure of the network. As we observe through a series of experiments, our approach is more responsive, robust, and principled than its counterparts for non-stationary as well as stationary data distributions. Experimental results indicate that our algorithm performs better at preserving gained prior knowledge and responding to changes in the data distribution.

机译：在线学习对机器学习中的许多问题都对此至关重要。随着更多数据被顺序收集，快速适应数据分布的变化可以提供多种竞争优势，例如避免丧失知识和更有效的学习。然而，需要对数据分布（也称为Covariate Shift）的改变进行适应，而不会损害已经内置于模型中的过去知识以应对大量和动态数据。在本文中，我们提出了一个在线堆叠的去噪自动化器，其结构通过加强学习来调整。我们的算法迫使网络利用并探索采用估计实用程序功能的良好体系结构，以最大化未经验证序列的准确性。不同的操作，例如池，递增和合并，可用于修改网络的结构。当我们观察到一系列实验时，我们的方法比其非静止和静止数据分布的对应物更敏感，强劲和原则。实验结果表明，我们的算法在保留获得的先前知识并响应数据分布的变化时表现更好。

著录项

来源
《European Conference on Artificial Intelligence》|2016年|912p|共9页
会议地点
作者
Thushan Ganegedara; Lionel Ott; Fabio Ramos;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Intelligent laser welding through representation, prediction, and control learning: An architecture with deep neural networks and reinforcement learning [J] . Guenther Johannes, Pilarski Patrick M., Helfrich Gerhard, Mechatronics: The Science of Intelligent Machines . 2016,第Null期

机译：通过表示，预测和控制学习进行智能激光焊接：具有深度神经网络和强化学习的架构
2. Blind Hexapod Locomotion in Complex Terrain with Gait Adaptation Using Deep Reinforcement Learning and Classification [J] . Azayev Teymur, Zimmerman Karel Journal of Intelligent & Robotic Systems: Theory & Application . 2020,第3a4期

机译：使用深度加强学习和分类，盲目的地形中的盲人六角运动的运动
3. Deep reinforcement learning for automated radiation adaptation in lung cancer [J] . Tseng Huan‐Hsin, Luo Yi, Cui Sunan, Medical Physics . 2017,第12期

机译：肺癌自动辐射适应的深度增强学习
4. Online Adaptation of Deep Architectures with Reinforcement Learning [C] . Thushan Ganegedara, Lionel Ott, Fabio Ramos European Conference on Artificial Intelligence . 2016

机译：在线适应深层建筑与强化学习
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Deep Reinforcement Learning for Automated Radiation Adaptation in Lung Cancer [O] . Huan-Hsin Tseng, Yi Luo, Sunan Cui, -1

机译：深度强化学习用于肺癌的自动辐射适应。
7. Adaptive Online-Learning Volt-Var Control for Smart Inverters Using Deep Reinforcement Learning [O] . Kirstin Beyer, Robert Beckmann, Stefan Geißendörfer, 2021

机译：使用深度加强学习的智能逆变器自适应在线学习电压控制

Online Adaptation of Deep Architectures with Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅