An actor-Critic algorithm for multi-agent learning in queue-based stochastic games

D. Krishna Sundar; K. Ravikumar

首页> 外文期刊>Neurocomputing >An actor-Critic algorithm for multi-agent learning in queue-based stochastic games

【24h】

An actor-Critic algorithm for multi-agent learning in queue-based stochastic games

机译：基于队列的随机博弈中多主体学习的actor-Critic算法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider state-dependent pricing in a two-player service market stochastic game where state of the game and its transition dynamics are modeled using a semi-Markovian queue. We propose a multi-time scale actor-critic based reinforcement algorithm for multi-agent learning under self-play and provide experimental results on Nash convergence.

机译：我们在两人服务市场随机游戏中考虑基于状态的定价，其中使用半马尔可夫排队对游戏状态及其过渡动态进行建模。我们提出了一种基于时间尺度行为者批评的自增强下的多智能体学习强化算法，并提供了关于纳什收敛的实验结果。

著录项

来源
《Neurocomputing》 |2014年第15期|258-265|共8页
作者
D. Krishna Sundar; K. Ravikumar;
展开▼
作者单位

Indian Institute of Management Bangalore, Bangalore-560076, India;

D-103. Marsh Palm Retreat Outer Ring Road, Bangalore-560103, India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Service markets; Queues; Dynamic pricing; Stochastic games; Learning in games; Reinforcement learning;

机译：服务市场;队列;动态定价;随机游戏;在游戏中学习;强化学习;

相似文献

外文文献
中文文献
专利

1. A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning [J] . Wesley Suttle, Zhuoran Yang, Kaiqing Zhang, IFAC PapersOnLine . 2020,第2期

机译：用于分布式强化学习的多功能脱机演员 - 批评算法
2. An algorithm of pretrained fuzzy actor-critic learning applying in fixed-time space differential game [J] . Wang Xiao, Shi Peng, Schwartz Howard, Proceedings of the Institution of Mechanical Engineers . 2021,第14期

机译：固定时间空间差异游戏申请普里雷普雷斯模糊演员 - 评论家算法
3. Multi-agent Inverse Reinforcement Learning for Certain General-Sum Stochastic Games [J] . Lin Xiaomin, Adams Stephen C., Beling Peter A. The Journal of Artificial Intelligence Research . 2019,第期

机译：用于某一般性加速游戏的多功能逆钢筋学习
4. DISTRIBUTED MULTI-AGENT ACTOR-CRITIC ALGORITHMS WITH APPLICATIONS TO STOCHASTIC PATH FINDING PROBLEMS [C] . Paris Pennesi, Yimin Yu, Ioannis Ch. Paschalidis IFAC Symposium on Robust Control Design . 2009

机译：分布式多功能演员 - 批评算法，应用于随机路径发现问题
5. A Bounded Actor-Critic Algorithm for Reinforcement Learning [D] . Lawhead, Ryan Jacob. 2017

机译：一种有限于钢筋学习的批评算法
6. Multi-agent reinforcement learning with approximate model learning for competitive games [O] . Young Joon Park, Yoon Sang Cho, Seoung Bum Kim 2012

机译：多主体强化学习和近似模型学习的竞技游戏
7. A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning [O] . Wesley Suttle, Zhuoran Yang, Kaiqing Zhang, 2020

机译：用于分布式强化学习的多功能脱机演员 - 批评算法

An actor-Critic algorithm for multi-agent learning in queue-based stochastic games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅