首页> 外国专利> NON-ZERO-SUM GAME SYSTEM FRAMEWORK WITH TRACTABLE NASH EQUILIBRIUM SOLUTION

NON-ZERO-SUM GAME SYSTEM FRAMEWORK WITH TRACTABLE NASH EQUILIBRIUM SOLUTION

机译:具有可跟踪纳什均衡解的非零和博弈系统框架

摘要

A computer-implemented device and corresponding method are provided for processing a multi-agent system input to form an at least partially optimised output indicative of an action policy. The method comprises receiving the multi-agent system input, the multi-agent system input comprising a definition of a multi-agent system and defining behaviour patterns of a plurality of agents based on system states; receiving an indication of an input system state; performing an iterative machine learning process to estimate a single aggregate function representing the behaviour patterns of the plurality of agents over a set of system states; and iteratively processing the single aggregate function for the input system state to estimate an at least partially optimised set of actions for each of the plurality of agents in the input system state. This may allow policies corresponding to the Nash equilibrium to be learned.
机译:提供了一种计算机实现的设备和相应的方法,用于处理多代理系统输入以形成指示动作策略的至少部分优化的输出。该方法包括接收多代理系统输入,多代理系统输入包括多代理系统的定义和基于系统状态定义多个代理的行为模式;接收输入系统状态的指示;执行迭代机器学习过程以估计表示多个代理在一组系统状态上的行为模式的单个聚合函数;以及迭代处理输入系统状态的单个聚合函数,以估计输入系统状态中多个代理中的每个代理的至少部分优化的动作集。这可能允许学习与纳什均衡相对应的策略。

著录项

  • 公开/公告号US2022147847A1

    专利类型

  • 公开/公告日2022-05-12

    原文格式PDF

  • 申请/专利权人 HUAWEI TECHNOLOGIES CO. LTD.;

    申请/专利号US202217568493

  • 发明设计人 DAVID MGUNI;YAODONG YANG;

    申请日2022-01-04

  • 分类号G06N5/04;G06N20;G06N7;

  • 国家 US

  • 入库时间 2022-08-25 00:56:45

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号