A Unified Analysis of Value-Function-Based Reinforcement- Learning Algorithms

Cdaba Szepesvari; Michael L. Littman

首页> 外文期刊>Neural computation >A Unified Analysis of Value-Function-Based Reinforcement- Learning Algorithms

【24h】

A Unified Analysis of Value-Function-Based Reinforcement- Learning Algorithms

机译：基于价值函数的强化学习算法的统一分析

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning is the problem of generating optimal behavior in a sequential decision-making environment given the opportunity of interacting with it. Many algorithms for solving reinforcement-learning problems work by computing improved estimates of the optimal value function. We extend prior analyses of reinforcement-learning algorithms and present a powerful new theorem that can provide a unified analysis of such value-function-based reinforcement-learning algorithms.

机译：强化学习是在有机会与之互动的情况下，在顺序决策环境中生成最佳行为的问题。许多用于解决强化学习问题的算法都是通过计算最佳值函数的改进估计来起作用的。我们扩展了强化学习算法的先前分析，并提出了一个功能强大的新定理，该定理可以为此类基于价值函数的强化学习算法提供统一的分析。

著录项

来源
《Neural computation》 |1999年第8期|p.2017-2060|共44页
作者
Cdaba Szepesvari; Michael L. Littman;
展开▼
作者单位

展开▼
收录信息美国《科学引文索引》(SCI);美国《化学文摘》(CA);
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms [J] . Szepesvári C, Littman M Neural computation . 1999,第8期

机译：基于价值函数的强化学习算法的统一分析
2. Unified Algorithms for Online Learning and Competitive Analysis [J] . Buchbinder Niv, Chen Shahar, Naor Joseph (Seffi), Mathematics of operations research . 2016,第2期

机译：在线学习和竞争分析的统一算法
3. A Unified Framework of Online Learning Algorithms for Training Recurrent Neural Networks [J] . Owen Marschall, Kyunghyun Cho, Cristina Savin Journal of machine learning research . 2020,第a期

机译：用于培训经常性神经网络的在线学习算法的统一框架
4. Bayesian ying-yang system and theory as a unified statistical learning approach: (III) models and algorithms for dependence reduction, data dimension reduction, ICA and supervised learning [C] . Lei Xu International workshop on theoretical aspects of neural computation : A multidisciplinary perspective . 1998

机译：Bayesian Ying-Yang系统和理论作为统一统计学习方法：（iii）依赖性减少，数据维度减少，ICA和监督学习的模型和算法
5. Submodular Optimization and Machine Learning: Theoretical Results, Unifying and Scalable Algorithms, and Applications [D] . Iyer, Rishabh 2015

机译：次模块优化和机器学习：理论结果，统一和可扩展的算法及其应用
6. Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM) [O] . Gregory R. Grant, Michael H. Farkas, Angel D. Pizarro, -1

机译：RNA-Seq比对算法和RNA-Seq统一映射器（RUM）的比较分析
7. A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms [O] . Csaba Szepesvári, Michael L. Littman 1998

机译：基于价值函数的强化学习算法的统一分析

A Unified Analysis of Value-Function-Based Reinforcement- Learning Algorithms

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅