首页> 外国专利> A NON-ZERO-SUM GAME SYSTEM FRAMEWORK WITH TRACTABLE NASH EQUILIBRIUM SOLUTION

A NON-ZERO-SUM GAME SYSTEM FRAMEWORK WITH TRACTABLE NASH EQUILIBRIUM SOLUTION

机译:具有易纳什均衡解决方案的非零游戏系统框架

摘要

Described is a computer-implemented device (1200) and method (1000) for processing a multi-agent system input to form an at least partially optimised output indicative of an action policy. The method (1000) comprises receiving (1001) the multi-agent system input, the multi-agent system input comprising a definition of a multi-agent system and defining behaviour patterns of a plurality of agents based on system states; receiving (1002) an indication of an input system state; performing (1003) an iterative machine learning process to estimate a single aggregate function representing the behaviour patterns of the plurality of agents over a set of system states; and iteratively processing (1004) the single aggregate function for the input system state to estimate an at least partially optimised set of actions for each of the plurality of agents in the input system state. This may allow policies corresponding to the Nash equilibrium to be learned.
机译:描述是一种计算机实现的设备(1200)和方法(1000),用于处理多代理系统输入以形成指示动作策略的至少部分优化的输出。 该方法(1000)包括接收(1001)多代理系统输入,包括基于系统状态的多种子体系统的定义和定义多个代理的行为模式的多代理系统输入; 接收(1002)输入系统状态的指示; 执行(1003)迭代机器学习过程,以估计表示在一组系统状态下的多个代理的行为模式的单个聚合函数; 并且迭代地处理(1004)输入系统状态的单个聚合函数,以估计输入系统状态中的多个代理中的每一个的至少部分优化的一组动作。 这可能允许与纳什均衡对应的策略学习。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号