首页>
外国专利>
SAFE AND FAST EXPLORATION FOR REINFORCEMENT LEARNING USING CONSTRAINED ACTION MANIFOLDS
SAFE AND FAST EXPLORATION FOR REINFORCEMENT LEARNING USING CONSTRAINED ACTION MANIFOLDS
展开▼
机译:使用约束动作集的安全性和快速探索,用于强化学习
展开▼
页面导航
摘要
著录项
相似文献
摘要
According to an aspect of the present invention, a computer-implemented method is provided for reinforcement learning. The method includes reading, by a processor device, an action manifold which is described as a n-polytope, at least one physical action limit, and at least one safety constraint. The method further includes updating, by the processor device, the action manifold based on the at least one physical action limit and the at least one safety constraint. The method also includes performing, by the processor device, the reinforcement learning by selecting a constrained action from among a set of constrained actions in the action manifold.
展开▼