首页> 外国专利> SYSTEM AND METHOD FOR COLLABORATIVE DECENTRALIZED PLANNING USING DEEP REINFORCEMENT LEARNING AGENTS IN AN ASYNCHRONOUS ENVIRONMENT

SYSTEM AND METHOD FOR COLLABORATIVE DECENTRALIZED PLANNING USING DEEP REINFORCEMENT LEARNING AGENTS IN AN ASYNCHRONOUS ENVIRONMENT

机译：异步环境中使用深度强化学习代理进行协同分散计划的系统和方法

页面导航

摘要
著录项
相似文献

摘要

A method (300), and corresponding systems (400) and computer-readable mediums (426), for implementing a hierarchical multi-agent control system for an environment. A method (300) includes generating (306) an observation of an environment (104) by a first agent process (206) and sending (308) a first message (464) that includes the observation to a meta-agent process (202). The method includes receiving (312) a second message (464) that includes a goal (454), by the first agent process (206) and from the meta-agent process (202). The method includes evaluating (314) a plurality of actions (456), by the first agent process (206) and based on the goal (454), to determine a selected action (456). The method includes applying (316) the selected action (456) to the environment (104) by the first agent process (206).

机译：一种用于实现环境的分层多主体控制系统的方法（300），相应的系统（400）和计算机可读介质（426）。方法（300）包括通过第一代理过程（206）生成（306）对环境（104）的观察，并将包括观察的第一消息（464）发送（308）到元代理过程（202）。。该方法包括通过第一代理过程（206）并从元代理过程（202）接收（312）包括目标（454）的第二消息（464）。该方法包括通过第一代理过程（206）并基于目标（454）来评估（314）多个动作（456），以确定所选择的动作（456）。该方法包括通过第一代理过程（206）将所选动作（456）应用（316）到环境（104）。

著录项

公开/公告号WO2019183195A1

专利类型
公开/公告日2019-09-26

原文格式PDF
申请/专利权人 SIEMENS CORPORATION;
展开▼

申请/专利号WO2019US23125
发明设计人 CHALUPKA KRZYSZTOF;SRIVASTAVA SANJEEV;
展开▼

申请日2019-03-20
分类号G06Q10/06;
国家 WO
入库时间 2022-08-21 11:53:03

相似文献

专利
外文文献
中文文献