首页> 中国专利> 一种多智能体深度强化学习方法、系统及应用

一种多智能体深度强化学习方法、系统及应用

页面导航

摘要
著录项
相似文献

摘要

本发明公开了一种基于分区经验与多线程交互的多智能体深度强化学习算法。首先，该算法使用分区缓存区的经验重放形式，通过划分奖励空间来区分正面经验、负面经验与中性经验，并在训练时使用分层随机的采样方式抽取这些经验数据。其次，算法运用多线程的交互方式促进了智能体与环境的试错过程，通过智能体的多个克隆体并行的学习并整合它们的学习经验来训练网络模型的参数。优点是：本发明提出的基于缓存区重放与多线程交互的多智能体深度强化学习算法，结合分区经验缓存区及多线程交互方式的优势，引入到多智能体的深度强化学习算法中；在收敛速度与训练效率上均优于现有的模型，在多智能体环境中具有更高的可用性，可用于解决多智能体的协同追踪目标问题。

著录项

公开/公告号CN112801290B

专利类型发明专利
公开/公告日2021-11-05

原文格式PDF
申请/专利权人中国人民解放军陆军工程大学;
展开▼

申请/专利号CN202110216405.9
发明设计人张婷婷;董会;张赛男;
展开▼

申请日2021-02-26
分类号G06N3/08(20060101);
代理机构32224 南京纵横知识产权代理有限公司;
代理人何春廷
地址 210014 江苏省南京市秦淮区后标营路88号
入库时间 2022-08-23 12:44:52

相似文献

专利
中文文献
外文文献

1. 一种多智能体深度强化学习方法、系统及应用 [P] . 中国专利： CN112801290B . 2021.11.05
2. 一种基于多智能体深度强化学习算法的智能博弈系统 [P] . 中国专利： CN110428057A . 2019-11-08
3. (54) Title: A CONTENT BASED APPROACH TO EXTENDING THE FORM AND FUNCTION OF A BUSINESS INTELLI¬GENCE SYSTEM (57) Abstract: A business intelligence (BI) system includes the ability to extend its functionality outside of the project life cycle by means of specific content. Complex multidimensional queries are interpreted as trees of atomic sub-expressions that are com¬bined in a parse-tree-like structure to form the overall query. Each sub tree is valid in isolation when provided with the proper context. Any sub tree can be an expression template, stored as application content, which at generation time uses simple text sub¬stitution with instance specific parameters to produce multidimensional expression syntax. The system includes a sophisticated type system and semantic layer that hides the user from the complexities inherent in working with OLAP databases. A business in¬telligence expert can provide type and semantic cues for each expression template, held as content. [P] . IN2012CN02100A . 2012-11-02

机译：（54）标题：一种扩展商务智能系统的形式和功能的基于内容的方法（57）摘要：商务智能（BI）系统具有通过以下方式将其功能扩展到项目生命周期之外的能力：具体内容。复杂的多维查询被解释为原子子表达式的树，这些原子子表达式组合成类似解析树的结构以形成整体查询。每个子树在提供适当的上下文时都是有效的。任何子树都可以是作为应用程序内容存储的表达模板，该表达模板在生成时使用带有实例特定参数的简单文本替换来生成多维表达语法。该系统包括一个复杂的类型系统和语义层，使用户摆脱了使用OLAP数据库所固有的复杂性。商业智能专家可以为每个作为内容的表达模板提供类型和语义提示。
4. A system and method for a multi-application smart card which can facilitate a post-issuance download of an application onto the smart card [P] . 欧洲知识产权局专利： EP1004992A2 . 2000-05-31

机译：一种用于多应用智能卡的系统和方法，其可以促进将应用发布后下载到智能卡上
5. A system and method for a multi-application smart card which can facilitate a post-issuance download of an application onto the smart card [P] . AU6578698A . 1998-10-20

机译：一种用于多应用智能卡的系统和方法，其可以促进将应用发布后下载到智能卡上