首页> 外文会议>International Conference on Complexity, Future Information Systems and Risk >Using the Ornstein-Uhlenbeck Process for Random Exploration
【24h】

Using the Ornstein-Uhlenbeck Process for Random Exploration

机译:使用Ornstein-Uhlenbeck进程进行随机探索

获取原文
获取外文期刊封面目录资料

摘要

In model-based Reinforcement Learning, an agent aims to learn a transition model between attainable states. Since the agent initially has zero knowledge of the transition model, it needs to resort to random exploration in order to learn the model. In this work, we demonstrate how the Ornstein-Uhlenbeck process can be used as a sampling scheme to generate exploratory Brownian motion in the absence of a transition model. Whereas current approaches rely on knowledge of the transition model to generate the steps of Brownian motion, the Ornstein-Uhlenbeck process does not. Additionally, the Ornstein-Uhlenbeck process naturally includes a drift term originating from a potential function. We show that this potential can be controlled by the agent itself, and allows executing non-equilibrium behavior such as ballistic motion or local trapping.
机译:在基于模型的强化学习中,代理人旨在学习可达到的状态之间的过渡模型。由于代理商最初对转换模型进行零知识,因此需要采取随机探索以便学习模型。在这项工作中,我们展示了Ornstein-Uhlenbeck进程如何用作采样方案,以在不存在转换模型的情况下产生探索性布朗运动。虽然目前的方法依赖于转换模型的知识来生成布朗运动的步骤,但是ornstein-uhlenbeck过程没有。另外,Ornstein-Uhlenbeck过程自然地包括源自潜在功能的漂移项。我们表明该潜力可以由代理本身控制,并允许执行非平衡行为,例如弹道运动或局部捕获。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号