首页> 美国政府科技报告 >Using Neural Networks and Dyna Algorithm for Integrated Planning, Reacting andLearning in Systems

【24h】

Using Neural Networks and Dyna Algorithm for Integrated Planning, Reacting andLearning in Systems

机译：用神经网络和Dyna算法进行系统集成规划，反应和学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The traditional AI answer to the decision making problem for a robot is planning.However, planning is usually CPU-time consuming, depending on the availability and accuracy of a world model. The Dyna system generally described in earlier work, uses trial and error to learn a world model which is simultaneously used to plan reactions resulting in optimal action sequences. It is an attempt to integrate planning, reactive, and learning systems. The architecture of Dyna is presented. The different blocks are described. There are three main components of the system. The first is the world model used by the robot for internal world representation. The input of the world model is the current state and the action taken in the current state. The output is the corresponding reward and resulting state. The second module in the system is the policy. The policy observes the current state and outputs the action to be executed by the robot. At the beginning of program execution, the policy is stochastic and through learning progressively becomes deterministic. The policy decides upon an action according to the output of an evaluation function, which is the third module of the system. The evaluation function takes the following as input: the current state of the system, the action taken in that state, the resulting state, and a reward generated by the world which is proportional to the current distance from the goal state. Originally, the work proposed was as follows: (1) to implement a simple 2-D world where a 'robot' is navigating around obstacles, to learn the path to a goal, by using lookup tables; (2) to substitute the world model and Q estimate function Q by neural networks; and (3) to apply the algorithm to a more complex world where the use of a neural network would be fully justified. In this paper, the system design and achieved results will be described. First we implement the world model with a neural network and leave Q implemented as a look up table. Next, we use a lookup table for the world model and implement the Q function with a neural net. Time limitations prevented the combination of these two approaches. The final section discusses the results and gives clues for future work.

著录项

作者
Lima, P.; Beard, R.;
展开▼
作者单位

展开▼
年度 1992
页码
总页数 27
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Algorithms; Autonomous navigation; Decision making; Machine learning; Neuralnets; Planning; Robots; Artificial intelligence; Errors; Robotics; Sequencing; Stochastic processes;

机译：算法;自主导航;决策;机器学习;神经网络;规划;机器人;人工智能;错误;机器人;测序;随机过程;

相似文献

外文文献
中文文献
专利

1. Cascading artificial neural networks optimized by genetic algorithms and integrated with global navigation satellite system to offer accurate ubiquitous positioning in urban environment [J] . Hamid Mehmood, Nitin K. Tripathi Computers，environment and urban systems . 2013,第jana期

机译：通过遗传算法优化并与全球导航卫星系统集成的级联人工神经网络，可在城市环境中提供准确的普遍定位
2. Cell loading and shipment optimisation in a cellular manufacturing system: an integrated genetic algorithms and neural network approach [J] . Gokhan Egilmez, Can Celikbilek, Melih Altun, International journal of industrial and systems engineering . 2016,第3期

机译：细胞制造系统中的细胞装载和运输优化：集成的遗传算法和神经网络方法
3. Dyna-H: A heuristic planning reinforcement learning algorithm applied to role-playing game strategy decision systems [J] . Matilde Santos, Jose Antonio Martin H., Victoria Lopez, Knowledge-Based Systems . 2012,第期

机译：Dyna-H：一种启发式计划强化学习算法，应用于角色扮演游戏策略决策系统
4. Applying adaptive structured genetic algorithm to reasoning andlearning method for fuzzy rules using neural networks [C] . Ichimura T., Tazaki E. 1998 IEEE Symposium on Advances in Digital Filtering and Signal Processing, 1998 . 1998

机译：自适应结构遗传算法在神经网络模糊规则推理学习中的应用
5. Integrated neural systems and algorithms for analysis of population activity during dexterous hand movements [D] . Mollazadeh, Mohsen 2011

机译：集成的神经系统和算法，用于在灵巧的手运动过程中分析种群活动
6. Analysis of Landscape Ecological Planning Based on the High-Order Multiwavelet Neural Network Algorithm [O] . ChuanDong Yu, Nan Du 2021

机译：基于高阶多小波神经网络算法的景观生态规划分析
7. Dyna, an Integrated Architecture for Learning, Planning, and Reacting [O] . Richard S. Sutton 1991

机译：Dyna，学习，规划和反应的集成架构
8. Using neural networks and Dyna algorithm for integrated planning, reacting and learning in systems [R] . Lima, Pedro, Beard, Randal 1992

机译：使用神经网络和Dyna算法进行系统中的综合规划，反应和学习

Using Neural Networks and Dyna Algorithm for Integrated Planning, Reacting andLearning in Systems

摘要

著录项

相似文献

相关主题

期刊订阅