首页> 外国专利> MAKING A FAILURE SCENARIO USING ADVERSARIAL REINFORCEMENT LEARNING BACKGROUND

MAKING A FAILURE SCENARIO USING ADVERSARIAL REINFORCEMENT LEARNING BACKGROUND

机译:使用对抗钢筋学习背景的失败情景

摘要

Making failure scenarios using adversarial reinforcement learning is performed by storing, in a first storage, a variety of first experiences of failures of a player agent due to an adversarial agent, and performing a simulation of an environment including the player agent and the adversarial agent. It also includes calculating a similarity of a second experience of a failure of the player agent in the simulation and each of the variety of first experiences in the first storage, and updating the first storage by adding the second experience as a new first experience of the variety of first experiences in response to the similarity being less than a threshold. Additionally, the use of adversarial reinforcement learning can include training the adversarial agent by using at least one of the plurality of first experiences in the first storage to generate an adversarial agent having diverse experiences.
机译:通过在第一存储器中存储使用对抗性增强学习的失效场景,在第一存储器中存储由于越野患者而在第一储存,并且表演包括播放器剂和对抗性剂的环境的模拟。它还包括计算在仿真中的播放器代理失败的第二经验的相似性,并且通过将第二种体验添加为新的第一体验,并通过将第二种体验添加为新的第一体验来更新第一存储器各种各样的第一次经验响应于相似性小于阈值。另外,使用对抗性增强学习可以通过使用第一储存中的多个第一经验中的至少一个来包括训练对抗性剂,以产生具有不同经验的抗敌剂。

著录项

  • 公开/公告号US2021064915A1

    专利类型

  • 公开/公告日2021-03-04

    原文格式PDF

  • 申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;

    申请/专利号US201916557453

  • 发明设计人 AKIFUMI WACHI;

    申请日2019-08-30

  • 分类号G06K9/62;G06N20;G05D1;

  • 国家 US

  • 入库时间 2022-08-24 17:29:54

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号