Exploration in Metric State Spaces

机译：公制状态空间中的探索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present metric-E~3, a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows the construction of accurate local models. The algorithm is a generalization of the E~3 algorithm of Kearns and Singh, and assumes a black box for approximate planning. Unlike the original E~3, metric-E~3 finds a near optimal policy in an amount of time that does not directly depend on the size of the state space, but instead depends on the covering number of the state space. Informally, the covering number is the number of neighborhoods required for accurate local modeling.

机译：我们提出metric-E〜3，这是在马尔可夫决策过程中用于强化学习的可证明的近最佳算法，其中状态空间上存在自然度量，可以构建准确的局部模型。该算法是Kearns和Singh的E〜3算法的推广，并为近似规划假设了一个黑匣子。与原始的E〜3不同，metric-E〜3在一段时间内找到了一个接近最优的策略，该时间不直接取决于状态空间的大小，而是取决于状态空间的覆盖数。非正式地，覆盖数是精确的局部建模所需的邻域数。

著录项

来源
《20th International Conference on Machine Learning》|2003年|P.306-312|共7页
会议地点
作者
Sham Kakade; Michael Kearns; John Langford;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机的应用;
关键词
入库时间 2022-08-26 14:52:10

相似文献

外文文献
中文文献
专利

1. On the topology ofD-metric spaces and generation ofD-metric spaces from metric spaces [J] . S. V. R.Naidu, K. P. R.Rao, N. SrinivasaRao International journal of mathematics and mathematical sciences . 2004,第51期

机译：关于度量空间的拓扑以及从度量空间生成度量空间
2. ON THE TOPOLOGY OF D-METRIC SPACES AND GENERATION OF D-METRIC SPACES FROM METRIC SPACES [J] . S. V. R. Naidu, K. P. R. Rao, N. Srinivasa Rao International journal of mathematics and mathematical sciences . 2004,第49a52期

机译：D-度量空间的拓扑及从度量空间生成D-度量空间
3. A Structure-Based Distance Metric for High-Dimensional Space Exploration with Multidimensional Scaling [J] . IEEE transactions on visualization and computer graphics . 2014,第3期

机译：多维缩放的高维空间探索的基于结构的距离度量
4. Tradespace Exploration of Space Settlement Architectures Using Long-term Cost and Benefit Metrics [C] . George Lordos, Markus Guerster, Bruce Cameron, IEEE Aerospace Conference . 2020

机译：使用长期成本和收益指标的空间结算架构的贸易空间探索
5. Length spectrum metric and modified length spectrum metric on Teichmuller spaces. [D] . Jimenez Lopez, Francisco Gerardo. 2013

机译：Teichmuller空间上的长度谱度量和修改后的长度谱度量。
6. Implicit Contractive Mappings in Modular Metric and Fuzzy Metric Spaces [O] . N. Hussain, P. Salimi -1

机译：模块化度量和模糊度量空间中的隐式压缩映射
7. On the topology of D-metric spaces and generation of D-metric spaces from metric spaces [O] . S. V. R. Naidu, K. P. R. Rao, N. Srinivasa Rao 2004

机译：关于D度量空间的拓扑和从度量空间生成D度量空间

Exploration in Metric State Spaces

摘要

著录项

相似文献

相关主题

期刊订阅