机译:终身加固学习中的零射精政策生成
Univ Chinese Acad Sci UCAS Sch Artificial Intelligence Beijing 100049 Peoples R China|Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China;
Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China|Meituan Beijing Peoples R China;
Univ Chinese Acad Sci UCAS Sch Artificial Intelligence Beijing 100049 Peoples R China|Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China|Chinese Acad Sci CAS Ctr Excellence Brain Sci & Intelligence Techn Shanghai 200031 Peoples R China;
Lifelong reinforcement learning; Generalization policy; Task domain;
机译:终身加固学习中的政策和价值转移
机译:欧洲联盟终身学习政策:增强竞争力与增强社会稳定之间
机译:欧洲联盟终身学习政策:增强竞争力与增强社会稳定之间
机译:基于鲁棒控制的自动驾驶零散深强化学习驾驶策略传递
机译:不确定系统的最优跟踪控制:基于策略和基于策略的强化学习方法
机译:利用等级强化学习的多意图对话的情感对话策略学习
机译:基于鲁棒控制的自主车辆零击零钢筋学习驾驶政策转移