首页> 外国专利> AUTOMATED EXPLAINER OF REINFORCEMENT LEARNING ACTIONS USING OCCUPATION MEASURES

AUTOMATED EXPLAINER OF REINFORCEMENT LEARNING ACTIONS USING OCCUPATION MEASURES

机译：使用职业措施自动解释器强化学习行动

页面导航

摘要
著录项
相似文献

摘要

Automatic identification of features that drive a reinforcement learning model to recommend an action of interest. The identification is based on a calculation of occupation measures of state-action pairs associated with the reinforcement learning model. High occupation measures of certain action-state pairs indicate that the states of these pairs likely include the sought-after features.

机译：自动识别驱动加强学习模型以推荐感兴趣的行为。该识别基于与加强学习模型相关的状态动作对的占用测量的计算。某些动作状态对的高占用措施表明这些对的状态可能包括追捧的特征。

著录项

公开/公告号US2021073674A1

专利类型
公开/公告日2021-03-11

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201916566907
发明设计人 ALEXANDER ZADOROJNIY;MICHAEL MASIN;
展开▼

申请日2019-09-11
分类号G06N20;G06N7;
国家 US
入库时间 2022-08-24 17:38:16

相似文献

专利
外文文献
中文文献