交互式动态影响图及其精确求解算法

李波; 曹浪财; 庄进发

首页> 中文期刊>解放军理工大学学报（自然科学版） >交互式动态影响图及其精确求解算法

交互式动态影响图及其精确求解算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

To represent the dynamic relationship between agents in multi-agent Markov decision process with partially observable settings shared by other agents, the interactive dynamic influence diagrams (IDIDs) were presented by extending influence diagrams (IDs) over time and structure.I-DIDs are graphical models for sequential decision making in partially observable setting shared by other agents.It may be used to compute the policy of an agent given its belief as the agent acts and observes in the setting.Exact algorithms for solving I-DIDs demand the solutions of possible models of the agents and then update all models at every time step.The space of other models grows exponentially with the number of time steps,increasing the computational complexity.Thus an exact solution of I-DIDs based on minimal sets was presented by reducing the space of other agents' possible models and updating the selected models, thereby the computational complexity was simplified.Finally, model instances were given.The experimental results show the validity of the algorithm.%为了表示部分可观察马尔可夫环境下,多Agent决策中各Agent之间的动态结构关系,对影响图(IDs)在结构和时间上进行扩展,形成一种能够对其他Agent建模的决策模型:交互式动态影响图(I-DIDs).I-DIDs是不确定环境下多Agent进行序贯决策的图模型.该模型的解是在对其Agent行为概率分布的预测下提供给该Agent的最优决策,能更有效地解决多Agent的决策问题.但I-DIDs状态空间太大,Agents候选模型空间随着时间片的增加成指数级增长,使计算变得复杂.因此,提出了一种基于行为等价的最小化模型集合的方法,通过限制模型增长来缓解模型空间不断扩大的趋势,以达到简化计算的目的.在模型实例上的仿真实验结果显示了该算法的有效性.

著录项

来源
《解放军理工大学学报（自然科学版）》|2011年第2期|119-124|共6页
作者
李波; 曹浪财; 庄进发;
展开▼
作者单位

厦门大学信息科学与技术学院,福建厦门,361005;

厦门大学信息科学与技术学院,福建厦门,361005;

厦门东南融通系统工程有限公司,福建厦门,361005;

解放军信息工程大学通信与信息学院,河南郑州,450002;

展开▼
原文格式 PDF
正文语种 chi
中图分类人工智能理论;
关键词
多Agent决策; 交互式动态影响图; 行为等价; 最小模型更新集;

相似文献

中文文献
外文文献
专利

1. 交互式动态影响图研究及其最优K模型解法 [J] . 潘颖慧 ,曾一锋 . 计算机学报 . 2018,第001期
2. 基于 lookahead 的交互式动态影响图的DMU 改进算法 [J] . 田乐 ,曹浪财 . 系统工程与电子技术 . 2014,第006期
3. 基于KL距离的交互式动态影响图近似算法 [J] . 田乐 ,罗键 ,曹浪财 . 系统工程与电子技术 . 2013,第001期
4. 基于多Agent的交互式动态影响图研究、应用与展望 [J] . 罗键 ,李波 ,潘颖慧 . 厦门大学学报（自然科学版） . 2011,第002期
5. 交互式群组决策问题的一种Pareto最优求解算法 [J] . 何建敏 ,曹文彬 . 管理工程学报 . 2002,第003期
6. 交互式动态影响图及其精确求解算法研究 [C] . 李波 ,曹浪财 ,庄进发 . 第七届全国计算机支持的协同工作学术会议暨第五届全国智能信息网络学术会议 . 2010
7. 基于交互式动态影响图的多Agent序贯决策问题求解 [A] . 田乐 . 2014

交互式动态影响图及其精确求解算法

摘要

著录项

相似文献

相关主题

期刊订阅