Using the Ornstein-Uhlenbeck Process for Random Exploration

机译：使用Ornstein-Uhlenbeck进程进行随机探索

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In model-based Reinforcement Learning, an agent aims to learn a transition model between attainable states. Since the agent initially has zero knowledge of the transition model, it needs to resort to random exploration in order to learn the model. In this work, we demonstrate how the Ornstein-Uhlenbeck process can be used as a sampling scheme to generate exploratory Brownian motion in the absence of a transition model. Whereas current approaches rely on knowledge of the transition model to generate the steps of Brownian motion, the Ornstein-Uhlenbeck process does not. Additionally, the Ornstein-Uhlenbeck process naturally includes a drift term originating from a potential function. We show that this potential can be controlled by the agent itself, and allows executing non-equilibrium behavior such as ballistic motion or local trapping.

机译：在基于模型的强化学习中，代理人旨在学习可达到的状态之间的过渡模型。由于代理商最初对转换模型进行零知识，因此需要采取随机探索以便学习模型。在这项工作中，我们展示了Ornstein-Uhlenbeck进程如何用作采样方案，以在不存在转换模型的情况下产生探索性布朗运动。虽然目前的方法依赖于转换模型的知识来生成布朗运动的步骤，但是ornstein-uhlenbeck过程没有。另外，Ornstein-Uhlenbeck过程自然地包括源自潜在功能的漂移项。我们表明该潜力可以由代理本身控制，并允许执行非平衡行为，例如弹道运动或局部捕获。

著录项

来源
《International Conference on Complexity, Future Information Systems and Risk》|2019年|1(CD-ROM)|共8页
会议地点
作者
Johannes Nauta; Yara Khaluf; Pieter Simoens;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
Brownian motion; Exploration; Ornstein-Uhlenbeck process;

机译：布朗运动;勘探;Ornstein-Uhlenbeck流程;

相似文献

外文文献
中文文献
专利

1. Randomization moduli of continuity for $lsp 2$-norm squared Ornstein-Uhlenbeck processes [J] . M. Cs?rg?, Q.-M. Shao, Z.-Y. Lin Canadian Journal of Mathematics . 1993,第1993期

机译：$ lsp 2 $范数平方的Ornstein-Uhlenbeck过程的连续性的随机模
2. EXIT PROBLEM FOR ORNSTEIN-UHLENBECK PROCESSES: A RANDOM WALK APPROACH [J] . SAMUEL HERRMANN, NICOLAS MASSIN Discrete and continuous dynamical systems . 2020,第8期

机译：Ornstein-Uhlenbeck进程的退出问题：随机步行方法
3. Almost sure central limit theorems for random ratios and applications to LSE for fractional Ornstein-Uhlenbeck processes [J] . P.Cénac, K. Es-Sebaiy Probability and Mathematical Statistics . 2015,第2期

机译：随机比率的几乎确定的中心极限定理以及分数Ornstein-Uhlenbeck过程对LSE的应用
4. Using the Ornstein-Uhlenbeck Process for Random Exploration [C] . Johannes Nauta, Yara Khaluf, Pieter Simoens International Conference on Complexity, Future Information Systems and Risk . 2019

机译：使用Ornstein-Uhlenbeck进程进行随机探索
5. Inference in Multivariate Generalized Ornstein-Uhlenbeck Processes With a Change-Point [D] . Shen, Lei. 2018

机译：具有变化点的多元广义Ornstein-Uhlenbeck过程的推断
6. Brownian motion in non-equilibrium systems and the Ornstein-Uhlenbeck stochastic process [O] . F. Donado, R. E. Moctezuma, L. López-Flores, -1

机译：非平衡系统中的布朗运动和Ornstein-Uhlenbeck随机过程
7. NUMERICAL EXPLORATION OF DYNAMIC BEHAVIOR OF ORNSTEIN-UHLENBECK PROCESSES VIA EHRENFEST PROCESS APPROXIMATION(Advanced Planning and Scheduling for Supply Chain Management) [O] . Ushio Sumita, Jun-ya Gotoh, Hui Jin 2006

机译：奥恩斯坦 - uhlenbeck过程的数值探索通过Ehrenfest过程近似（<特刊>供应链管理的高级规划和调度）

Using the Ornstein-Uhlenbeck Process for Random Exploration

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅