首页> 外文期刊>JMLR: Workshop and Conference Proceedings >On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration
【24h】

On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration

机译:线性随机近似:细粒聚鲁普特和非渐近浓度

获取原文
       

摘要

We undertake a precise study of the asymptotic and non-asymptotic properties of stochastic approximation procedures with Polyak-Ruppert averaging for solving a linear system $ar{A} heta = ar{b}$. When the matrix $ar{A}$ is Hurwitz, we prove a central limit theorem (CLT) for the averaged iterates with fixed step size and number of iterations going to infinity. The CLT characterizes the exact asymptotic covariance matrix, which is the sum of the classical Polyak-Ruppert covariance and a correction term that scales with the step size. Under assumptions on the tail of the noise distribution, we prove a non-asymptotic concentration inequality whose main term matches the covariance in CLT in any direction, up to universal constants. When the matrix $ar{A}$ is not Hurwitz but only has non-negative real parts in its eigenvalues, we prove that the averaged LSA procedure actually achieves an $O(1/T)$ rate in mean-squared error. Our results provide a more refined understanding of linear stochastic approximation in both the asymptotic and non-asymptotic settings. We also show various applications of the main results, including the study of momentum-based stochastic gradient methods as well as temporal difference algorithms in reinforcement learning.
机译:我们对Colyak-ruppert平均进行了对随机近似程序的渐近和非渐近性能的精确研究,用于求解线性系统$ bar {a} theta = bar {b} $。当Matrix $ Bar {A} $ urwitz时,我们证明了一个中央限位定理(CLT),以固定的步长和迭代的迭代迭代到无穷大。 CLT表征了精确的渐近协方差矩阵,这是经典的Polyak-ruppert协方差和校正术语的总和,该校正术语与阶梯尺寸缩放。在噪声分布尾的假设下,我们证明了非渐近浓度不平等,其主要术语与任何方向的CLT中的协方差相匹配,直至普遍常数。当矩阵$ {a} $不是赫尔维茨时,但只有其特征值中的非负实零件,我们证明了平均的LSA程序实际上实现了平均误差的$ O(1 / t)$率。我们的结果在渐近和非渐近设置中提供了更加精致的线性随机近似的理解。我们还显示了主要结果的各种应用,包括对动力基于随机梯度方法的研究以及加固学习中的时间差分算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号