Fast MCVI Based on Improved NSGA2

机译：基于改进的NSGA2的快速MCVI

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, the partially observable Markov decision processes (POMDPs) is widely used in many fields. The solutions to POMDP suffer from prohibitive computational complexity due to curse of dimensionality, but MCVI for POMDP is envisioned as a promising approach to break the curse. Although MCVI is a great breakthrough toward solving this problem, it still has some defects, such as the slow convergence rate and the continuous growth of nodes' number of policy graph. To this end, the purpose of this paper is to provide a fast MCVI based on improved NSGA2. Different from the general NSGA2, the improved NSGA2 initializes the population by experiential knowledge and uses a self-adjustable value as the probability of cross and mutation. Before executing the MCVI, the algorithm will set a series of thresholds. When the algorithm gets a temporary policy graph which reaches one of the thresholds, it will use a discount operator to update the threshold and use the improved NSGA2 to update policy graph. After that, the algorithm will execute the MCVI again and repeat this process until the end. Numerical experiments show that the fast MCVI achieves about 8% increase in convergence rate over original MCVI, and about 60% decrease in nodes' number of policy graph, for the classic problem of corridor.

机译：如今，部分可观察的马尔可夫决策过程（POMDP）广泛用于许多领域。由于维数的诅咒，POMDP的解决方案遭受了计算量过大的困扰，但是将POMDP的MCVI设想为打破该诅咒的一种有前途的方法。尽管MCVI在解决这个问题上是一个巨大的突破，但它仍然存在一些缺陷，例如收敛速度慢和节点图数量的持续增长。为此，本文的目的是提供一种基于改进的NSGA2的快速MCVI。与一般的NSGA2不同，改进的NSGA2通过经验知识来初始化种群，并使用可自我调整的值作为交叉和变异的概率。在执行MCVI之前，该算法将设置一系列阈值。当算法获得达到阈值之一的临时策略图时，它将使用折扣运算符更新阈值，并使用改进的NSGA2更新策略图。之后，该算法将再次执行MCVI，并重复此过程直至结束。数值实验表明，对于经典的走廊问题，快速的MCVI的收敛速度比原始MCVI大约提高了8％，而节点的策略图数量则减少了约60％。

著录项

来源
《International Conference on Intelligent Human-Machine Systems and Cybernetics》|2014年|123-126|共4页
会议地点
作者
Liu Yin; Zhou Yingping; Chen Shuai;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Algorithm design and analysis; Convergence; Genetic algorithms; Robots; Sociology; Sorting; Statistics; MCVI; NSGA2; POMDPs;

机译：算法设计与分析;收敛;遗传算法;机器人;社会学;排序;统计; MCVI; NSGA2;聚甲醛;

相似文献

外文文献
中文文献
专利

1. An improved NSGA2 to solve a bi-objective optimization problem of multi-state electronic transaction network [J] . Yeh Cheng-Ta Reliability Engineering & System Safety . 2019,第Nova期

机译：改进的NSGA2解决多状态电子交易网络的双目标优化问题
2. Adaptive directed evolved NSGA2 based node placement optimization for wireless sensor networks [J] . Zhang Yijie, Liu Mandan Wireless Networks . 2020,第5期

机译：无线传感器网络的自适应定向展开基于NSGA2的节点放置优化
3. A Scheduling Method Based on NSGA2 for Steelmaking and Continuous Casting Production Process [J] . Qing Li, Xiuying Wang, Xiaofeng Zhang IFAC PapersOnLine . 2018,第18期

机译：基于NSGA2的炼钢连铸生产调度方法
4. Fast MCVI Based on Improved NSGA2 [C] . Liu Yin, Zhou Yingping, Chen Shuai International Conference on Intelligent Human-Machine Systems and Cybernetics . 2014

机译：基于改进的NSGA2的快速MCVI
5. Improved framework for fast and efficient memory-based frame data reconfiguration for multi-row spanning designs on field programmable gate arrays. [D] . Sreeram, Rohan. 2010

机译：用于现场可编程门阵列上多行跨越设计的快速有效的基于存储器的帧数据重新配置的改进框架。
6. Pneumonia Detection Using an Improved Algorithm Based on Faster R-CNN [O] . Shangjie Yao, Yaowu Chen, Xiang Tian, 2021

机译：基于更快的R-CNN的改进算法肺炎检测
7. Detection and Classification of Lung Nodule in Diagnostic CT: A TsDN method based on Improved 3D-Faster R-CNN and Multi-Scale Multi-Crop Convolutional Neural Network [O] . Muhammad Bilal Zia, Juan Juan Zhao, Xiao Ning 2020

机译：诊断CT中肺结节的检测和分类：基于改进的3D速率R-CNN和多尺度多作物卷积神经网络的TSDN方法

Fast MCVI Based on Improved NSGA2

摘要

著录项

相似文献

相关主题

期刊订阅