...
首页> 外文期刊>IEICE Transactions on fundamentals of electronics, communications & computer sciences >RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate
【24h】

RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate

机译:RTP-Q: a reinforcement learning system with time constraints exploration planning for accelerating the learning rate

获取原文
获取原文并翻译 | 示例

摘要

Reinforcement learning is an efficient method for solving MarkovDecision Processes that an agent improves its performance by usingscalar reward value with higher capa- bility of reactive and adaptivebehaviors. Q-learning is repre- sentative reinforcement learningmethod which is guaranteed to obtain an optimal policy needs numeroustrials to achieve it. k-Certainty Exploration Learning Systemrealizes active ex- rated into two phases and estimate values are notderived during the process of identifying the environment.

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号