Local Utility Estimation in Model-Free, Multi-Agent Environments

机译：无模型，多智能体环境中的局部效用估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Software agents are an enabling technology that supports rapid, automated, distributed decision making. Many joint task environments provide reward that is based on the performance of the collective, making it difficult to assign reward accurately to individual agents based on their performance. Some method is needed to assign the proper amount of credit to each of the agents in a collective, referred to as structural credit assignment, in an effort to maximize global utility. Within the multi-credit assignment problem the objective is to accurately estimate an agent's local utility based only on a global observation or global reward. To achieve an initial local estimate for each agent a Kalman filter technique is employed. The local utility estimates created through this technique however are independent of knowledge held by other agents in the environment. This leads to the intuition that there is room to improve local utility estimation through the sharing of knowledge between agents. Hence, different communication schemes are explored in order to not only improve the local estimates provided by the Kalman filter but in an effort to allow the agents to more rapidly converge to good policies.

著录项

作者
Hudack, J.; Gemelli, N.; Scalzo, M.;
展开▼
作者单位

展开▼
年度 2010
页码 1-13
总页数 13
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Multiagent systems ; Kalman filtering ; Learning machines ; Software programs;

机译：多代理系统;卡尔曼滤波;学习机;软件程序;

相似文献

外文文献
中文文献
专利

1. Utility distribution matters: enabling fast belief propagation for multi-agent optimization with dense local utility function [J] . Deng Yanchen, An Bo Autonomous agents and multi-agent systems . 2021,第2期

机译：公用事业分配事项：使多功能局部函数的多代理优化能够快速信仰传播
2. Decentralized Multi-agent information-theoretic control for target estimation and localization: finding gas leaks [J] . Joseph R Bourne, Matthew N Goodell, Xiang He, The International journal of robotics research . 2020,第13期

机译：分散的多代理信息 - 目标估算和定位信息控制：寻找煤气泄漏
3. Multi-agent Simulation for Promoting Clean Energy Vehicles from the Perspective of Concern for the Environment and Local Interactions [J] . Masashi OKUSHIMA Asian Transport Studies . 2016,第1期

机译：从关注环境和局部相互作用的角度看促进清洁能源汽车的多主体仿真
4. Exploiting Two-Dimensional Symmetry and Unimodality for Model-Free Source Localization in Harsh Environment [C] . Junting Chen IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：在恶劣环境中利用二维对称性和单峰性进行无模型源定位
5. The Utility of Using Virtue Locales to Explain Criminogenic Environments [D] . ?Boehme, Hunter Max 2020

机译：使用美德语言环境来解释犯因环境的效用
6. Model-free estimation of COVID-19 transmission dynamics from a complete outbreak [O] . Alex James, Michael J. Plank, Shaun Hendy, 2021

机译：完全爆发的无Covid-19传输动态的无模型估算
7. Distributed cooperative control and estimation with multi-agent robotic systems: stabilization notwithstanding unreliable communications, and optimization in evolving environments [O] . Davide Spinello 2019

机译：多智能体机械系统的分布式协作控制和估算：稳定规定，尽管不可靠的通信，以及不可靠环境的优化

Local Utility Estimation in Model-Free, Multi-Agent Environments

摘要

著录项

相似文献

相关主题

期刊订阅