首页> 外国专利> Reinforcement learning model and model weight reduction and optimization method for esports strategy optimization

Reinforcement learning model and model weight reduction and optimization method for esports strategy optimization

机译:电子竞技策略优化的强化学习模型和模型减重优化方法

摘要

It relates to a method of optimizing an e-sports strategy for e-sports education, and a reinforcement learning model for strategy analysis and a method for lightening and optimizing the model are provided, and in this case, a reinforcement learning model algorithm for real-time game analysis is provided. In the Observation process of the reinforcement learning process, real-time observations are acquired from e-sports matches, the acquired observations are processed by the Deep RL Agent, and the input values of the Deep Neural Network are generated as a single batch. And the state value s(t) from the observation value, and the state value s(t) come out as a(t) action value through the inference process. ) to generate a value. The generated state, action, and reward values are stored in the Experience Buffer to minimize external memory access, and the Policy Network updates the weights of the Deep Neural Network and receives input values in Multi Batch In order to accelerate the operation, all environments are allocated and calculated in parallel in the simulator.
机译:本发明涉及一种用于电子体育教育的电子体育策略优化方法,提供了一种用于策略分析的强化学习模型以及简化和优化该模型的方法,在这种情况下,提供了一种用于实时游戏分析的强化学习模型算法。在强化学习过程的观察过程中,从电子竞技比赛中获取实时观察结果,由Deep RL Agent处理获得的观察结果,并将Deep神经网络的输入值作为单个批次生成。状态值s(t)来自观察值,状态值s(t)通过推理过程作为(t)动作值出来。)产生价值。生成的状态、动作和奖励值存储在经验缓冲区中,以最小化外部内存访问,策略网络更新深度神经网络的权重并多批次接收输入值,以加速操作,所有环境在模拟器中并行分配和计算。

著录项

  • 公开/公告号KR20220027624A

    专利类型

  • 公开/公告日2022-03-08

    原文格式PDF

  • 申请/专利权人 (주)에이엄;

    申请/专利号KR1020200108718

  • 发明设计人 김민서;이용수;

    申请日2020-08-27

  • 分类号G06N3/08;G06N3/04;

  • 国家 KR

  • 入库时间 2022-08-25 00:05:45

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号