A Conclusive Analysis of the Finite-Time Behavior of the Discretized Pursuit Learning Automaton

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >A Conclusive Analysis of the Finite-Time Behavior of the Discretized Pursuit Learning Automaton

【24h】

A Conclusive Analysis of the Finite-Time Behavior of the Discretized Pursuit Learning Automaton

机译：离散追踪学习自动机有限时间行为的结论性分析

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with the finite-time analysis (FTA) of learning automata (LA), which is a topic for which very little work has been reported in the literature. This is as opposed to the asymptotic steady-state analysis for which there are, probably, scores of papers. As clarified later, unarguably, the FTA of Markov chains, in general, and of LA, in particular, is far more complex than the asymptotic steady-state analysis. Such an FTA provides rigid bounds for the time required for the LA to attain to a given convergence accuracy. We concentrate on the FTA of the Discretized Pursuit Automaton (DPA), which is probably one of the fastest and most accurate reported LA. Although such an analysis was carried out many years ago, we record that the previous work is flawed. More specifically, in all brevity, the flaw lies in the wrongly "derived" monotonic behavior of the LA after a certain number of iterations. Rather, we claim that the property should be invoked is the submartingale property. This renders the proof to be much more involved and deep. In this paper, we rectify the flaw and reestablish the FTA based on such a submartingale phenomenon. More importantly, from the derived analysis, we are able to discover and clarify, for the first time, the underlying dilemma between the DPA's exploitation and exploration properties. We also nontrivially confirm the existence of the optimal learning rate, which yields a better comprehension of the DPA itself.

机译：本文涉及学习自动机（LA）的有限时间分析（FTA），这是一个文献报道很少的工作。这与可能有数十篇论文的渐近稳态分析相反。正如后面将要阐明的，毫无疑问，一般来说，马尔可夫链的自由贸易区，特别是洛杉矶的自由贸易区，要比渐进稳态分析复杂得多。这样的FTA为LA达到给定的收敛精度所需的时间提供了严格的界限。我们专注于离散追踪自动机（DPA）的FTA，它可能是最快，最准确的LA报告之一。尽管这种分析是在很多年前进行的，但我们记录到以前的工作是有缺陷的。更具体地说，简而言之，缺陷在于经过一定数量的迭代后，LA的错误“推导”单调行为。相反，我们声称应该调用的属性是submartingale属性。这使得证据更加复杂和深入。在本文中，我们纠正了这一缺陷，并基于这种子市场现象重新建立了FTA。更重要的是，从派生的分析中，我们能够首次发现和澄清DPA的开采和勘探性质之间的潜在困境。我们还毫不费力地确认了最佳学习率的存在，这可以更好地理解DPA本身。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2020年第1期|284-294|共11页
作者

展开▼
作者单位

Univ Agder Ctr Artificial Intelligence Res N-4879 Grimstad Norway|Confirmit AS N-4878 Grimstad Norway;

Univ Agder Dept ICT N-4879 Grimstad Norway;

Univ Agder Dept ICT N-4879 Grimstad Norway|Carleton Univ Sch Comp Sci Ottawa ON K1S 5B6 Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Convergence; Markov processes; Learning automata; Eigenvalues and eigenfunctions; Pursuit algorithms; Maximum likelihood estimation; Discretized pursuit automaton (DPA); finite-time analysis (FTA); learning automaton; pursuit algorithms (PAs);

机译：收敛;马尔可夫过程;学习自动机;特征值和特征函数;追踪算法;最大似然估计;离散追踪自动机（DPA）;有限时间分析（FTA）;学习自动机追踪算法（PA）;

相似文献

外文文献
中文文献
专利

1. Solutions for Multiagent Pursuit-Evasion Games on Communication Graphs: Finite-Time Capture and Asymptotic Behaviors [J] . Lopez Victor G., Lewis Frank L., Wan Yan, IEEE Transactions on Automatic Control . 2020,第5期

机译：用于通信图的多验追求逃避游戏的解决方案：有限时间捕获和渐近行为
2. Finite-time boundedness and finite-time l_2 gain analysis of discrete-time switched linear systems with average dwell time [J] . Xiangze Lin, Haibo Du, Shihua Li, Journal of the Franklin Institute . 2013,第4期

机译：具有平均停留时间的离散时间切换线性系统的有限时间有界性和有限时间l_2增益分析
3. Generalized pursuit learning schemes: new families of continuous and discretized learning automata [J] . Agache M., Oommen B.J. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2002,第6期

机译：广义追求学习计划：连续和离散学习自动机的新家族
4. An Improved Carbon Trading Behavioral Modelling Method Combining Discretized Statistical Analysis and Extreme Learning Machine [C] . Bang Jin, Yusheng Xue, Jie Huang, IEEE Innovative Smart Grid Technologies - Asia . 2018

机译：离散统计分析与极限学习机相结合的改进碳交易行为建模方法
5. A learning automaton approach to trajectory learning and control system design using dynamic recurrent neural networks. [D] . Condarcure, Thomas A. 1993

机译：一种使用动态递归神经网络进行轨迹学习和控制系统设计的学习自动机方法。
6. Enhanced robust finite-time passivity for Markovian jumping discrete-time BAM neural networks with leakage delay [O] . C Sowmiya, R Raja, Jinde Cao, -1

机译：具有泄漏延迟的马尔可夫跳跃离散时间BAM神经网络的增强鲁棒有限时间无源性
7. Beyond single discrete responses: An integrative and multidimensional analysis of behavioral dynamics assisted by Machine Learning [O] . Alejandro Leon, Varsovia Hernandez-Eslava, Juan Lopez, 2021

机译：除了单一离散的反应之外：对机器学习辅助的行为动力学的一体化和多维分析
8. Finite-Time Lagrangian Transport Analysis: Stable and Unstable Manifolds of Hyperbolic Trajectories and Finite-Time Lyapunov Exponents [R] . Branicki, M., Wiggins, S. 2009

机译：有限时滞拉格朗日输运分析：双曲线轨迹和有限时间Lyapunov指数的稳定和不稳定流形

A Conclusive Analysis of the Finite-Time Behavior of the Discretized Pursuit Learning Automaton

摘要

著录项

相似文献

相关主题

期刊订阅