首页> 外国专利> AUTOMATED REINFORCEMENT-LEARNING-BASED APPLICATION MANAGER THAT LEARNS AND IMPROVES A REWARD FUNCTION

AUTOMATED REINFORCEMENT-LEARNING-BASED APPLICATION MANAGER THAT LEARNS AND IMPROVES A REWARD FUNCTION

机译：基于自动学习的学习管理和奖励功能的应用程序管理器

页面导航

摘要
著录项
相似文献

摘要

The current document is directed to automated reinforcement-learning-based application managers that learn and improve the reward function that steers reinforcement-learning-based systems towards optimal or near-optimal policies. Initially, when the automated reinforcement-learning-based application manager is first installed and launched, the automated reinforcement-learning-based application manager may rely on human-application-manager action inputs and resulting state/action trajectories to accumulate sufficient information to generate an initial reward function. During subsequent operation, when it is determined that the automated reinforcement-learning-based application manager is no longer following a policy consistent with the type of management desired by human application managers, the automated reinforcement-learning-based application manager may use accumulated trajectories to improve the reward function.

机译：当前文档针对基于增强学习的自动化应用程序管理器，该应用程序管理器可以学习和改进奖励功能，从而使基于增强学习的系统转向最佳或接近最优的策略。最初，当首次安装和启动基于自动增强学习的应用程序管理器时，基于自动增强学习的应用程序管理器可能会依赖于人类应用程序管理器的动作输入和结果状态/动作轨迹来积累足够的信息以生成一个初始奖励功能。在后续操作期间，当确定基于自动增强学习的应用程序管理器不再遵循与人类应用程序管理器期望的管理类型一致的策略时，基于自动增强学习的应用程序管理器可以使用累积的轨迹来完善奖励功能。

著录项

公开/公告号US2020065157A1

专利类型
公开/公告日2020-02-27

原文格式PDF
申请/专利权人 VMWARE INC.;
展开▼

申请/专利号US201916518763
发明设计人 DEV NAG;YANISLAV YANKOV;DONGNI WANG;GREGORY T. BURK;NICHOLAS MARK GRANT STEPHEN;
展开▼

申请日2019-07-22
分类号G06F9/50;G06N20;G06N3/02;G06F17/16;
国家 US
入库时间 2022-08-21 11:20:51

相似文献

专利
外文文献
中文文献