RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate

Gang Zhao; Shoji Tatsumi; Ruoying Sun

首页> 外文期刊>IEICE Transactions on fundamentals of electronics, communications & computer sciences >RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate

【24h】

RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate

机译：RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相关主题

摘要

Reinforcement learning is an efficient method for solving MarkovDecision Processes that an agent improves its performance by usingscalar reward value with higher capa- bility of reactive and adaptivebehaviors. Q-learning is repre- sentative reinforcement learningmethod which is guaranteed to obtain an optimal policy needs numeroustrials to achieve it. k-Certainty Exploration Learning Systemrealizes active ex- rated into two phases and estimate values are notderived during the process of identifying the environment.

著录项

来源
《IEICE Transactions on fundamentals of electronics, communications & computer sciences 》 |1999年第10期| 2266-2273| 共8页
作者
Gang Zhao; Shoji Tatsumi; Ruoying Sun;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类无线电电子学、电信技术 ;
关键词
reinforcement learning; planning; reacting;

RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate

摘要

著录项

相关主题

期刊订阅