Employing Automatic Temporal Abstractions to Accelerate Utile Suffix Memory Algorithm

机译：利用自动时间抽象来加速Utile后缀存储算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The main objective of the memory based reinforcement learning algorithms for hidden state problems is to overcome the state aliasing issue using a form of short term memory during learning. Extended sequence tree method, on the other hand, is a sequence based automated temporal abstraction mechanism that can be appended to a reinforcement learning algorithm. Assuming a fully observable problem setting, it tries to find useful sub-policies in solution space that can be reused as timed actions, providing significant savings in terms of learning time. This paper presents a way to expand a well known memory based model-free reinforcement learning algorithm, namely Utile Suffix Memory, by using a modified version of extended sequence tree method. By this way, learning speed of the algorithm is increased under certain conditions. Enhancement is shown empirically via experimentation on some benchmark problems.

机译：基于存储器的用于隐藏状态问题的强化学习算法的主要目标是在学习过程中使用短期记忆的形式来克服状态混叠问题。另一方面，扩展序列树方法是一种基于序列的自动时间抽象机制，可以附加到强化学习算法中。假设问题的解决方案是完全可观察的，它会尝试在解决方案空间中找到有用的子策略，这些策略可以作为定时操作重用，从而在学习时间方面节省了很多时间。本文提出了一种使用扩展序列树方法的改进版本来扩展基于众所周知的基于内存的无模型增强学习算法的方法，即Utile Suffix Memory。通过这种方式，在一定条件下提高了算法的学习速度。通过对一些基准问题进行实验，经验表明了这种增强。

著录项

来源
《German conference on multiagent system technologies》|2014年|156-169|共14页
会议地点
作者
Erkin Cilden; Faruk Polat;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Reinforcement Learning; Utile Suffix Memory; Partially Observable Markov Decision Process; Extended Sequence Tree;

机译：强化学习; Utile后缀内存;部分可观察的马尔可夫决策过程;扩展序列树;
入库时间 2022-08-26 15:17:53

相似文献

外文文献
中文文献
专利

1. A survey of practical algorithms for suffix tree construction in external memory [J] . M. Barsky, U. Stege, A. Thomo Software . 2010,第11期

机译：外部存储器中后缀树构建的实用算法综述
2. Toward optimizing the cache performance of suffix trees for sequence analysis algorithms suffix tree cache performance optimization. [J] . Lee C, Huang CH Advances in Experimental Medicine and Biology . 2010,第Null期

机译：为了优化后缀树的缓存性能，以进行序列分析算法后缀树的缓存性能优化。
3. High-abstraction level complexity analysis and memory architecture simulations of multimedia algorithms [J] . Ravasi M., Mattavelli M. IEEE Transactions on Circuits and Systems for Video Technology . 2005,第5期

机译：多媒体算法的高抽象层次复杂度分析和存储架构仿真
4. Employing Automatic Temporal Abstractions to Accelerate Utile Suffix Memory Algorithm [C] . Erkin Cilden, Faruk Polat German Conference on Multiagent System Technologies . 2014

机译：采用自动时间抽象加速UTIle后缀内存算法
5. Convicted by memory: Automatically recovering spatial-temporal evidence from memory images. [D] . Saltaformaggio, Brendan D. 2016

机译：被记忆定罪：自动从记忆图像中恢复时空证据。
6. GHOSTX: An Improved Sequence Homology Search Algorithm Using a Query Suffix Array and a Database Suffix Array [O] . Shuji Suzuki, Masanori Kakuta, Takashi Ishida, -1

机译：GHOSTX：使用查询后缀数组和数据库后缀数组的改进的序列同源性搜索算法
7. High-abstraction level complexity analysis and memory architecture simulations of multimedia algorithms [O] . Massimo Ravasi, Marco Mattavelli 2005

机译：多抽象级复杂性分析和多媒体算法的内存架构模拟

Employing Automatic Temporal Abstractions to Accelerate Utile Suffix Memory Algorithm

摘要

著录项

相似文献

相关主题

期刊订阅