首页> 外国专利> SEQUENTIAL LEARNING OF CONSTRAINTS FOR HIERARCHICAL REINFORCEMENT LEARNING

SEQUENTIAL LEARNING OF CONSTRAINTS FOR HIERARCHICAL REINFORCEMENT LEARNING

机译:分层学习的约束的顺序学习

摘要

A computer-implemented method, computer program product, and computer processing system are provided for Hierarchical Reinforcement Learning (HRL) with a target task. The method includes obtaining, by a processor device, a sequence of tasks based on hierarchical relations between the tasks, the tasks constituting the target task. The method further includes learning, by a processor device, a sequence of constraints corresponding to the sequence of tasks by repeating, for each of the tasks in the sequence, reinforcement learning and supervised learning with a set of good samples and a set of bad samples and by applying an obtained constraint for a current task to a next task.
机译:提供了一种计算机实现的方法,计算机程序产品和计算机处理系统,用于具有目标任务的分层强化学习(HRL)。该方法包括由处理器设备基于任务之间的层次关系获得任务序列,该任务构成目标任务。该方法进一步包括由处理器设备通过对序列中的每个任务重复用一组好的样本和一组不良的样本对强化学习和监督学习来学习与任务序列相对应的约束序列。通过将获得的当前任务的约束应用于下一个任务。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号