首页> 外国专利> SMOOTHED SARSA: REINFORCEMENT LEARNING FOR ROBOT DELIVERY TASKS

SMOOTHED SARSA: REINFORCEMENT LEARNING FOR ROBOT DELIVERY TASKS

机译:SMOOTHED SARSA:机器人交付任务的强化学习

摘要

The present invention provides a method for learning a policy used by a computing system to perform a task, such delivery of one or more objects by the computing system. During a first time interval, the computing system determines a first state, a first action and a first reward value. As the computing system determines different states, actions and reward values during subsequent time intervals, a state description identifying the current sate, the current action, the current reward and a predicted action is stored. Responsive to a variance of a stored state description falling below a threshold value, the stored state description is used to modify one or more weights in the policy associated with the first state.
机译:本发明提供了一种用于学习由计算系统用来执行任务的策略的方法,例如由计算系统传递一个或多个对象。在第一时间间隔期间,计算系统确定第一状态,第一动作和第一奖励值。当计算系统在随后的时间间隔期间确定不同的状态,动作和奖励值时,存储标识当前状态,当前动作,当前奖励和预测动作的状态描述。响应于所存储的状态描述的方差下降到阈值以下,所存储的状态描述用于修改与第一状态相关联的策略中的一个或多个权重。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号