首页> 外国专利> SEQUENTIAL LEARNING OF CONSTRAINTS FOR HIERARCHICAL REINFORCEMENT LEARNING

SEQUENTIAL LEARNING OF CONSTRAINTS FOR HIERARCHICAL REINFORCEMENT LEARNING

机译：分层学习的约束的顺序学习

页面导航

摘要
著录项
相似文献

摘要

A computer-implemented method, computer program product, and computer processing system are provided for Hierarchical Reinforcement Learning (HRL) with a target task. The method includes obtaining, by a processor device, a sequence of tasks based on hierarchical relations between the tasks, the tasks constituting the target task. The method further includes learning, by a processor device, a sequence of constraints corresponding to the sequence of tasks by repeating, for each of the tasks in the sequence, reinforcement learning and supervised learning with a set of good samples and a set of bad samples and by applying an obtained constraint for a current task to a next task.

机译：提供了一种计算机实现的方法，计算机程序产品和计算机处理系统，用于具有目标任务的分层强化学习（HRL）。该方法包括由处理器设备基于任务之间的层次关系获得任务序列，该任务构成目标任务。该方法进一步包括由处理器设备通过对序列中的每个任务重复用一组好的样本和一组不良的样本对强化学习和监督学习来学习与任务序列相对应的约束序列。通过将获得的当前任务的约束应用于下一个任务。

著录项

公开/公告号US2020034704A1

专利类型
公开/公告日2020-01-30

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201816048569
发明设计人 DON JOVEN RAVOY AGRAVANTE;GIOVANNI DE DE MAGISTRIS;TU-HOA PHAM;RYUKI TACHIBANA;
展开▼

申请日2018-07-30
分类号G06N3/08;G06N3/04;
国家 US
入库时间 2022-08-21 11:20:07

相似文献

专利
外文文献
中文文献