...
首页> 外文期刊>IEEE Transactions on Signal Processing >Decentralized Online Convex Optimization With Event-Triggered Communications
【24h】

Decentralized Online Convex Optimization With Event-Triggered Communications

机译:与事件触发通信分散在线凸优化

获取原文
获取原文并翻译 | 示例

摘要

Decentralized multi-agent optimization usually relies on information exchange between neighboring agents, which can incur unaffordable communication overhead in practice. To reduce the communication cost, we apply event-triggering technique to the decentralized multi-agent online convex optimization problem, where each agent is associated with a time-varying local loss function and the goal is to minimize the accumulated total loss (the sum of all local loss functions) by choosing appropriate actions sequentially. We first develop an event-triggered decentralized online subgradient descent algorithm for the full information case, where the local loss function is fully revealed to each agent at each time. We establish an upper bound for the regret of each agent in terms of the event-triggering thresholds. It is shown that the regret is sublinear provided that the event-triggering thresholds converge to zero as time goes to infinity. The algorithm and analysis are further extended to the scenario of bandit feedback, where only the values of the local loss function at two random points close to the current action are disclosed to each agent. We show that the two-point bandit feedback does not degrade the performance of the proposed algorithm in order sense and a regret bound similar to the full information case can be established. Finally, numerical results on the problem of decentralized online least squares are presented to validate the proposed algorithms.
机译:分散的多代理优化通常依赖于邻近代理之间的信息交换,这在实践中可能会产生不适算的通信开销。为了降低通信成本,我们将事件触发技术应用于分散的多代理在线凸优化问题,其中每个代理与时变的本地丢失功能相关联,目标是最小化累积的总损失(总和)所有本地丢失功能)通过顺序选择适当的行动。我们首先开发一个事件触发的分散的在线子缩放性缩减算法,用于完整的信息情况,其中当地丢失功能每次都会完全显示给每个代理。在事件触发阈值方面,我们建立了每个代理的遗憾的上限。结果表明,随着时间触发到无穷大,事件触发阈值会聚到零。该算法和分析进一步扩展到强盗反馈的场景,其中仅向每个代理公开了靠近当前动作的两个随机点的局部丢失功能的值。我们表明,两点强盗反馈不会降低所提出的算法在订单意义上的性能,并且可以建立类似于完整信息情况的遗憾。最后,提出了对分散的在线最小二乘问题的数值结果来验证所提出的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号