首页> 外文OA文献 >Generic Online Learning for Partial Visible Dynamic Environment with Delayed Feedback

【2h】

Generic Online Learning for Partial Visible Dynamic Environment with Delayed Feedback

机译：通用在线学习部分可见和动态环境，具有延迟反馈

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement learning (RL) has been applied to robotics and many other domains which a system must learn in real-time and interact with a dynamic environment. In most studies the state-action space that is the key part of RL is predefined. Integration of RL with deep learning method has however taken a tremendous leap forward to solve novel challenging problems such as mastering a board game of Go. The surrounding environment to the agent may not be fully visible, the environment can change over time, and the feedbacks that agent receives for its actions can have a fluctuating delay. In this paper, we propose a Generic Online Learning (GOL) system for such environments. GOL is based on RL with a hierarchical structure to form abstract features in time and adapt to the optimal solutions. The proposed method has been applied to load balancing in 5G cloud random access networks. Simulation results show that GOL successfully achieves the system objectives of reducing cache-misses and communication load, while incurring only limited system overhead in terms of number of high-level patterns needed. We believe that the proposed GOL architecture is significant for future online learning of dynamic, partially visible environments, and would be very useful for many autonomous control systems.

机译：强化学习（RL）已应用于机器人和许多系统必须实时学习并与动态环境进行交互的域。在大多数研究中，预定义的状态行动空间是R1的关键部分。然而，利用深度学习方法的RL整合越来越大跃发，以解决新颖的挑战性问题，如掌握棋盘游戏。对代理的周围环境可能无法完全可见，环境可以随时间变化，并且代理接收其动作的反馈可以具有波动的延迟。在本文中，我们提出了一种用于此类环境的通用在线学习（GOL）系统。 GOL基于RL，具有层次结构，以便及时形成抽象特征，并适应最佳解决方案。所提出的方法已应用于5G云随机接入网络中的负载平衡。仿真结果表明，GOL成功实现了减少缓存失误和通信负载的系统目标，同时仅在所需的高级模式的数量方面产生有限的系统开销。我们认为，拟议的GOL架构对于未来的动态，部分可见环境的在线学习具有重要意义，并且对于许多自主控制系统来说非常有用。

著录项

作者
Behrooz Shahriari;
展开▼
作者单位

展开▼
年度 -1
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Delaying elaborated feedback within computer-based learning environments: The role of summative and question-based feedback [J] . Candel Carmen, Manez Ignacio, Cerdan Raquel, Journal of Computer Assisted Learning . 2021,第4期

机译：延迟基于计算机的学习环境中的详细反馈：总结和基于问题的反馈的作用
2. Global Stabilization of Uncertain SISO Dynamical Systems Using a Multiple Delayed Partial State Feedback Sliding Mode Control [J] . Soni Sandeep, Kamal Shyam, Yu Xinghuo, IEEE transactions on circuits and systems. II, Express briefs . 2020,第7期

机译：使用多个延迟部分反馈滑模控制的不确定SISO动力系统的全局稳定
3. Online Learning with Local Permutations and Delayed Feedback [J] . Ohad Shamir, Liran Szlak JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：具有局部排列和延迟反馈的在线学习
4. Online learning classifiers in dynamic environments with incomplete feedback [C] . Behdad Mohammad, French Tim IEEE Congress on Evolutionary Computation . 2013

机译：动态环境中的在线学习分类器，反馈不完整
5. The Effect of Delayed Feedback and Visual Hints Within a Gaming Environment to Facilitate Achievement of Different Learning Objectives [D] . Zeglen, Eric 2015

机译：延迟反馈和视觉提示在游戏环境中的影响，促进实现不同学习目标的影响
6. Online Visual Feedback during Error-Free Channel Trials Leads to Active Unlearning of Movement Dynamics: Evidence for Adaptation to Trajectory Prediction Errors [O] . Angel Lago-Rodriguez, R. Chris Miall 2016

机译：在无错误通道试验期间的在线视觉反馈导致运动动力学的主动失灵：适应轨迹预测误差的证据
7. Generic Online Learning for Partial Visible Dynamic Environment with Delayed Feedback [O] . Behrooz Shahriari -1

机译：通用在线学习部分可见和动态环境，具有延迟反馈

Generic Online Learning for Partial Visible Dynamic Environment with Delayed Feedback

摘要

著录项

相似文献

相关主题

期刊订阅