...
首页> 外文期刊>Neural computation >Discovering the Structure of a Reactive Environment by Exploration
【24h】

Discovering the Structure of a Reactive Environment by Exploration

机译:通过探索发现反应性环境的结构

获取原文
获取原文并翻译 | 示例

摘要

Consider a robot wandering around an unfamiliar environment, performing actions and observing the consequences. The robot's task is to construct an internal model of its environment, a model that will allow it to predict the effects of its actions and to determine what sequences of actions to take to reach particular goal states. Rivest and Schapire (1987a,b; Schapire 1988) have studied this problem and have designed a symbolic algorithm to strategically explore and infer the structure of “finite state” environments. The heart of this algorithm is a clever representation of the environment called an update graph. We have developed a connectionist implementation of the update graph using a highly specialized network architecture. With backpropagation learning and a trivial exploration strategy — choosing random actions — the connectionist network can outperform the Rivest and Schapire algorithm on simple problems. Our approach has additional virtues, including the fact that the network can accommodate stochastic environments and that it suggests generalizations of the update graph representation that do not arise from a traditional, symbolic perspective.
机译:考虑一个机器人在陌生的环境中游荡,执行动作并观察后果。机器人的任务是构建其环境的内部模型,该模型将使其能够预测其动作的效果并确定为达到特定目标状态应采取的动作顺序。 Rivest和Schapire(1987a,b; Schapire 1988)研究了这个问题,并设计了一种符号算法来从策略上探索和推断“有限状态”环境的结构。该算法的核心是对环境的巧妙表示,称为更新图。我们已经使用高度专业的网络体系结构开发了更新图的连接器实现。通过反向传播学习和简单的探索策略(选择随机动作),在简单的问题上,连接主义网络可以胜过Rivest和Schapire算法。我们的方法还有其他优点,包括网络可以适应随机环境的事实,并且它建议对更新图表示形式进行概括,而不是从传统的符号角度出发。

著录项

  • 来源
    《Neural computation》 |1990年第4期|447-457|共11页
  • 作者

    Mozer M; Bachrach J;

  • 作者单位

    Department of Computer Science and Institute of Cognitive Science, University of Colorado, Boulder, CO 80309-0430 USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《化学文摘》(CA);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号