Classifying Stochastic Sequential Decision Processes: An Information-Spectrum Approach

Kazunori IWATA

首页> 外文期刊>電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing >Classifying Stochastic Sequential Decision Processes: An Information-Spectrum Approach

【24h】

Classifying Stochastic Sequential Decision Processes: An Information-Spectrum Approach

机译：Classifying Stochastic Sequential Decision Processes: An Information-Spectrum Approach

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Markov decision processes are the most popular stochastic sequential decision processes in reinforcement learning for representing the framework of interactions between an agent and an environment. We frequently regard the Markov decision process as a stationary and ergodic process, but most stochastic sequential decision processes arising in reinforcement learning are in fact, not necessarily Markovian, stationary, or ergodic. In this paper, we show that an information-spectrum property plays an important role in return maximization in more general processes than stationary and ergodic Markov decision processes. We also present a class of stochastic sequential decision processes with the necessary condition for return maximization. We provide several examples of best sequences in terms of return maximization in the class.

著录项

来源
《電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing》 |2009年第461期|333-338|共6页
作者
Kazunori IWATA;
展开▼
作者单位

Graduate School of Information Sciences, Hiroshima City University;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类人工智能理论;
关键词
Stochastic sequential decision process; Reinforcement learning; Information spectrum;
入库时间 2024-01-25 20:28:36

Classifying Stochastic Sequential Decision Processes: An Information-Spectrum Approach

摘要

著录项

相关主题

期刊订阅