首页> 外国专利> AUTOMATED EXPLAINER OF REINFORCEMENT LEARNING ACTIONS USING OCCUPATION MEASURES

AUTOMATED EXPLAINER OF REINFORCEMENT LEARNING ACTIONS USING OCCUPATION MEASURES

机译:使用职业措施自动解释器强化学习行动

摘要

Automatic identification of features that drive a reinforcement learning model to recommend an action of interest. The identification is based on a calculation of occupation measures of state-action pairs associated with the reinforcement learning model. High occupation measures of certain action-state pairs indicate that the states of these pairs likely include the sought-after features.
机译:自动识别驱动加强学习模型以推荐感兴趣的行为。该识别基于与加强学习模型相关的状态动作对的占用测量的计算。某些动作状态对的高占用措施表明这些对的状态可能包括追捧的特征。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号