首页> 外文会议>IEEE EUROCON;EUROCON '09 >Concept extraction using temporal-difference network EUROCON2009

【24h】

Concept extraction using temporal-difference network EUROCON2009

机译：使用时差网络EUROCON2009进行概念提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel framework to extract temporally extended concepts in a grid world environment using a probable data structure named temporal-difference network. First a reinforcement-learning agent tries to learn its environment for the task of wall following. After that we train a newly introduced temporal-difference network (TDN) in the brain of the agent in order to gain a predictive model of the environment. At last the most promising sequences of action-observation of the given environment will be sorted out based on their probability.

机译：在本文中，我们提出了一种新颖的框架，该框架使用称为时差网络的可能数据结构来提取网格世界环境中的时间扩展概念。首先，强化学习代理尝试学习其环境以完成墙面跟踪任务。之后，我们在代理人的大脑中训练了一个新引入的时差网络（TDN），以获取环境的预测模型。最后，将根据概率对给定环境进行最有希望的行动观察序列。

著录项

来源
《IEEE EUROCON;EUROCON '09》|2009年|1888-1894|共7页
会议地点 Saint Petersburg(RU);Saint Petersburg(RU)
作者
Karbasian, H.; Ahmadabadi, M.N.; Araabi, B.N.;
展开▼
作者单位

Control & Intell. Process. Center of Excellence Univ. of Tehran Tehran Iran;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Markov processes; decision theory; intelligent robots; learning (artificial intelligence); mobile robots; predictive control; probability; tree data structures; POMDP problem; action-observation sequence; concept extraction; grid world environment; partially-observable Markov decision process; predictive model; probable tree data structure; reinforcement-learning agent; robot wall following task; temporal-difference network; Concept; MDP; POMDP; Reinforcement Learning;

机译：马尔可夫过程；决策理论智能机器人学习（人工智能）；移动机器人；预测控制；可能性;树数据结构； POMDP问题；动作观察顺序；概念提取；网格世界环境；可部分观察的马尔可夫决策过程；预测模型可能的树数据结构；强化学习剂机器人墙跟随任务；时差网络概念; MDP； POMDP；强化学习;

相似文献

外文文献
中文文献
专利

1. A 1-mW CMOS Temporal-Difference AER Sensor for Wireless Sensor Networks [J] . Kim D., Fu Z., Park J. H., Electron Devices, IEEE Transactions on . 2009,第11期

机译：用于无线传感器网络的1mW CMOS时差AER传感器
2. VNE-TD: A virtual network embedding algorithm based on temporal-difference learning [J] . Wang Sen, Bi Jun, Wu Jianping, Computer networks . 2019,第Octa9期

机译：VNE-TD：基于时差学习的虚拟网络嵌入算法
3. VNE-TD: A virtual network embedding algorithm based on temporal-difference learning [J] . Wang Sen, Bi Jun, Wu Jianping, Computer networks . 2019,第OCTa9期

机译：VNE-TD：基于时差学习的虚拟网络嵌入算法
4. CONCEPT EXTRACTION USING TEMPORAL-DIFFERENCE NETWORK EUROCON2009 [C] . Habib Karbasian, Majid N. Ahmadabadi, Babak N. Araabi International Conference Devoted to the Anniversary of Alexander Popov . 2009

机译：使用时间差异网络欧洲欧洲欧洲差异网络提取概念提取
5. Temporal-difference networks. [D] . Tanner, Brian Timothy. 2005

机译：时差网络。
6. A Visual Sensing Concept for Robustly Classifying House Types through a Convolutional Neural Network Architecture Involving a Multi-Channel Features Extraction [O] . Vahid Tavakkoli, Kabeh Mohsenzadegan, Kyandoghere Kyamakya 2020

机译：通过卷积神经网络架构涉及多通道特征提取的卷积神经网络架构的视觉感应概念
7. Using decision trees as the answer network in temporal-difference networks [O] . Antanas Laura, Driessens Kurt, Croonenborghs Tom, 2008

机译：在时差网络中将决策树用作答案网络
8. US long distance fiber optic networks: Technology, evolution and advanced concepts. Volume 2: Fiber optic technology and long distance networks [R] . 1986

机译：美国长途光纤网络：技术，发展和先进概念。第2卷：光纤技术和长途网络

Concept extraction using temporal-difference network EUROCON2009

摘要

著录项

相似文献

相关主题

期刊订阅