首页> 外国专利> SYSTEMS AND METHODS FOR RISK-SENSITIVE REINFORCEMENT LEARNING

SYSTEMS AND METHODS FOR RISK-SENSITIVE REINFORCEMENT LEARNING

机译:风险敏感强化学习的系统和方法

摘要

Systems and methods for risk-sensitive reinforcement learning are disclosed. In one embodiment, a method for a method for training a risk-sensitive reinforcement learning policy may include: (1) receiving, from a data source, a plurality of sets of training data for a plurality of time steps; (2) receiving a training budget comprising a plurality of episodes, a risk aversion coefficient, and an end state; and (3) calculating a correction factor using the training data. The correction factor may minimize stochasticity based on a risk aversion coefficient.
机译:公开了风险敏感强度学习的系统和方法。在一个实施例中,用于训练风险敏感的增强学习策略的方法的方法可以包括:(1)从数据源接收多个时间步长的多组训练数据; (2)接收包括多个发作,风险厌恶系数和最终状态的培训预算; (3)使用培训数据计算校正因子。校正因子可以基于风险厌恶系数最小化随机性。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号