首页> 外文会议>International Conference on Intelligent Robotics and Applications >Evaluating Q-Learning Policies for Multi-objective Foraging Task in a Multi-agent Environment

【24h】

Evaluating Q-Learning Policies for Multi-objective Foraging Task in a Multi-agent Environment

机译：评估多代理环境中的多目标觅食任务的Q学习策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper evaluates the performances of the reported q-learning policies for multi-agent systems. A set of extensively used policies were identified in the open literature namely greedy, e-greedy, Boltzmann Distribution, Simulated Annealing and Probabiliy Matching. Five agents are modeled to search and retrieve pucks back to a home location in the environment under specified constraints. A number of simulation-based experiments was conducted and based on the numerical results that was obtained, the performances of the learning policies are discussed.

机译：本文评估了Multi-Agent系统报告的Q学习策略的表演。在开放文献中确定了一系列广泛的使用政策，即贪婪，电子贪婪，Boltzmann分布，模拟退火和概率匹配。在指定的约束下，五个代理被建模以搜索和检索冰球回到环境中的归属位置。进行了许多基于仿真的实验，并基于获得的数值结果，讨论了学习政策的性能。

著录项

来源
《International Conference on Intelligent Robotics and Applications 》|2010年||共12页
会议地点
作者
Yogeswaran M.; Ponnambalam S. G.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18－53;
关键词

相似文献

外文文献
中文文献
专利

1. Q-learning by the nth step state and multi-agent negotiation in unknown environment [J] . Job Josip, Jovi? Franjo, Livada ?aslav Technical Gazette . 2012 ,第3期

机译：在未知环境中通过第n步状态进行Q学习和多主体协商
2. Pesticide taxation and multi-objective policy-Making: farm Modelling to evaluate profit/environment trade-offs [J] . Katherine Falconer, Ian Hodge Ecological Economics . 2001 ,第2期

机译：农药税收和多目标政策制定：用于评估利润/环境平衡的农场模型
3. Low load DIDS task scheduling based on Q-learning in edge computing environment [J] . Zhao Xu, Huang Guangqiu, Gao Ling, Journal of network and computer applications . 2021 ,第Auga期

机译：低负载基于边缘计算环境中的Q学习完成任务调度
4. Evaluating Q-Learning Policies for Multi-objective Foraging Task in a Multi-agent Environment [C] . Yogeswaran M, Ponnambalam S.G. ICIRA 2010;International conference on intelligent robotics and applications . 2010

机译：在多主体环境中评估多目标搜寻任务的Q学习策略
5. Task Planning for Heterogeneous Multi-Agent Systems in Dynamic Environments [D] . Dadvar, Mehdi. 2020

机译：动态环境中异构多代理系统的任务规划
6. Policies to Create Healthier Food Environments in Canada: Experts’ Evaluation and Prioritized Actions Using the Healthy Food Environment Policy Index (Food-EPI) [O] . Lana Vanderlee, Sahar Goorang, Kimiya Karbasy, 2019

机译：在加拿大创造更健康的食品环境的政策：使用健康食品环境政策指数（Food-EPI）进行的专家评估和优先行动
7. Task Allocation on Layered Multi-Agent Systems: When Evolutionary Many-Objective Optimization Meets Deep Q-Learning [O] . Mincan Li, Zidong Wang, Kenli Li, 2021

机译：分层多助理系统上的任务分配：当进化的多目标优化符合深度Q学习时
8. Quicker Q-Learning in Multi-Agent Systems [R] . Agogino, Adrian K., Tumer, Kagan 2005

机译：多代理系统中的快速Q-Learning

Evaluating Q-Learning Policies for Multi-objective Foraging Task in a Multi-agent Environment

摘要

著录项

相似文献

相关主题

期刊订阅