首页> 外文OA文献 >Generic Online Learning for Partial Visible Dynamic Environment with Delayed Feedback
【2h】

Generic Online Learning for Partial Visible Dynamic Environment with Delayed Feedback

机译:通用在线学习部分可见和动态环境,具有延迟反馈

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Reinforcement learning (RL) has been applied to robotics and many other domains which a system must learn in real-time and interact with a dynamic environment. In most studies the state-action space that is the key part of RL is predefined. Integration of RL with deep learning method has however taken a tremendous leap forward to solve novel challenging problems such as mastering a board game of Go. The surrounding environment to the agent may not be fully visible, the environment can change over time, and the feedbacks that agent receives for its actions can have a fluctuating delay. In this paper, we propose a Generic Online Learning (GOL) system for such environments. GOL is based on RL with a hierarchical structure to form abstract features in time and adapt to the optimal solutions. The proposed method has been applied to load balancing in 5G cloud random access networks. Simulation results show that GOL successfully achieves the system objectives of reducing cache-misses and communication load, while incurring only limited system overhead in terms of number of high-level patterns needed. We believe that the proposed GOL architecture is significant for future online learning of dynamic, partially visible environments, and would be very useful for many autonomous control systems.
机译:强化学习(RL)已应用于机器人和许多系统必须实时学习并与动态环境进行交互的域。在大多数研究中,预定义的状态行动空间是R1的关键部分。然而,利用深度学习方法的RL整合越来越大跃发,以解决新颖的挑战性问题,如掌握棋盘游戏。对代理的周围环境可能无法完全可见,环境可以随时间变化,并且代理接收其动作的反馈可以具有波动的延迟。在本文中,我们提出了一种用于此类环境的通用在线学习(GOL)系统。 GOL基于RL,具有层次结构,以便及时形成抽象特征,并适应最佳解决方案。所提出的方法已应用于5G云随机接入网络中的负载平衡。仿真结果表明,GOL成功实现了减少缓存失误和通信负载的系统目标,同时仅在所需的高级模式的数量方面产生有限的系统开销。我们认为,拟议的GOL架构对于未来的动态,部分可见环境的在线学习具有重要意义,并且对于许多自主控制系统来说非常有用。

著录项

  • 作者

    Behrooz Shahriari;

  • 作者单位
  • 年度 -1
  • 总页数
  • 原文格式 PDF
  • 正文语种
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号