...
首页> 外文期刊>Frontiers of computer science >Parallel exploration via negatively correlated search
【24h】

Parallel exploration via negatively correlated search

机译:通过负相关搜索并行探索

获取原文
获取原文并翻译 | 示例
           

摘要

Effective exploration is key to a successful search process. The recently proposed negatively correlated search (NCS) tries to achieve this by coordinated parallel exploration, where a set of search processes are driven to be negatively correlated so that different promising areas of the search space can be visited simultaneously. Despite successful applications of NCS, the negatively correlated search behaviors were mostly devised by intuition, while deeper (e.g., mathematical) understanding is missing. In this paper, a more principled NCS, namely NCNES, is presented, showing that the parallel exploration is equivalent to a process of seeking probabilistic models that both lead to solutions of high quality and are distant from previous obtained probabilistic models. Reinforcement learning, for which exploration is of particular importance, are considered for empirical assessment. The proposed NCNES is applied to directly train a deep convolution network with 1.7 million connection weights for playing Atari games. Empirical results show that the significant advantages of NCNES, especially on games with uncertain and delayed rewards, can be highly owed to the effective parallel exploration ability.
机译:有效的探索是成功搜索过程的关键。最近提出的负相关搜索(NCS)尝试通过协调并行探索来实现这一目标,其中驱动了一组搜索过程,以否定相关,从而可以同时访问搜索空间的不同承诺区域。尽管成功地申请了NCS,但绝对相关的搜索行为主要由直觉设计,而更深(例如,数学)理解缺失。在本文中,提出了一种更原则的NCS,即NCNES,表明并行探索等同于寻求概率模型的过程,这些模型都导致高质量的解决方案,并且与之前获得的概率模型远离。勘探学习特别重要的强化学习被认为是为了实证评估。拟议的NCNES应用于直接培训一个深度卷积网络,为参加atari游戏的170万个连接权重训练。经验结果表明,NCNES的显着优势,尤其是在不确定和延迟奖励的游戏中,可以高度归功于有效的并行探索能力。

著录项

  • 来源
    《Frontiers of computer science》 |2021年第5期|155333.1-155333.13|共13页
  • 作者单位

    Guangdong Provincial Key Laboratory of Brain-inspired Intelligent Computation Department of Computer Science and Engineering Southern University of Science and Technology Shenzhen 518055 China;

    Guangdong Provincial Key Laboratory of Brain-inspired Intelligent Computation Department of Computer Science and Engineering Southern University of Science and Technology Shenzhen 518055 China;

    Guangdong Provincial Key Laboratory of Brain-inspired Intelligent Computation Department of Computer Science and Engineering Southern University of Science and Technology Shenzhen 518055 China;

    Guangdong Provincial Key Laboratory of Brain-inspired Intelligent Computation Department of Computer Science and Engineering Southern University of Science and Technology Shenzhen 518055 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    evolutionary computation; reinforcement learn-ing; exploration;

    机译:进化计算;强化学习;勘探;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号