首页> 外文会议>IEEE/WIC/ACM International Conference on Intelligent Agent Technology >Rough Sets-Based Prototype Optimization in Kanerva-Based Function Approximation
【24h】

Rough Sets-Based Prototype Optimization in Kanerva-Based Function Approximation

机译:基于Kanerva的函数逼近中基于粗糙集的原型优化

获取原文

摘要

Problems involving multi-agent systems can be complex and involve huge state-action spaces, making such problems difficult to solve. Function approximation schemes such as Kanerva coding with dynamic, frequency-based prototype selection can improve performance. However, selecting the number of prototypes is difficult and the approach often still gives poor performance. In this paper, we solve a collection of hard instances of the predator-prey pursuit problem and argue that poor performance is caused by inappropriate selection of the prototypes for Kanerva coding, including the number and allocation of these prototypes. We use rough sets theory to reformulate the selection of prototypes and their implementation in Kanerva coding. We introduce the equivalence class structure to explain how prototype collisions occur, use a reduct of the set of prototypes to eliminate unnecessary prototypes, and generate new prototypes to split the equivalence classes causing prototype collisions. The Rough Sets-based approach increases the fraction of predator-prey test instances solved by up to 24.5% over frequency-based Kanerva coding. We conclude that prototype optimization based on rough set theory can adaptively explore the optimal number of prototypes and greatly improve a Kanerva-based reinforcement learner's ability to solve large-scale multi-agent problems.
机译:涉及多主体系统的问题可能很复杂,并且涉及巨大的状态作用空间,这使得此类问题难以解决。函数逼近方案(例如具有基于频率的动态原型选择的Kanerva编码)可以提高性能。但是,选择原型的数量很困难,并且该方法通常仍会产生较差的性能。在本文中,我们解决了捕食者—猎物追捕问题的一系列难题,并认为性能差是由于对Kanerva编码的原型选择不当造成的,包括这些原型的数量和分配。我们使用粗糙集理论来重构原型的选择及其在Kanerva编码中的实现。我们介绍了等价类结构来解释原型冲突是如何发生的,使用一组原型的简化来消除不必要的原型,并生成新的原型以拆分导致原型冲突的等价类。与基于频率的Kanerva编码相比,基于粗糙集的方法将求解的捕食者-猎物测试实例的比例提高了多达24.5%。我们得出的结论是,基于粗糙集理论的原型优化可以自适应地探索最优数量的原型,并极大地提高了基于Kanerva的强化学习者解决大规模多主体问题的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号