【24h】

The Shortest Path Problem Under Partial Monitoring

机译:部分监视下的最短路径问题

获取原文
获取原文并翻译 | 示例

摘要

The on-line shortest path problem is considered under partial monitoring scenarios. At each round, a decision maker has to choose a path between two distinguished vertices of a weighted directed acyclic graph whose edge weights can change in an arbitrary (adversarial) way such that the loss of the chosen path (defined as the sum of the weights of its composing edges) be small. In the multi-armed bandit setting, after choosing a path, the decision maker learns only the weights of those edges that belong to the chosen path. For this scenario, an algorithm is given whose average cumulative loss in n rounds exceeds that of the best path, matched off-line to the entire sequence of the edge weights, by a quantity that is proportional to 1 / (n~(1/2)) and depends only polynomially on the number of edges of the graph. The algorithm can be implemented with linear complexity in the number of rounds n and in the number of edges. This result improves earlier bandit-algorithms which have performance bounds that either depend exponentially on the number of edges or converge to zero at a slower rate than O(1 / (n~(1/2))). An extension to the so-called label efficient setting is also given, where the decision maker is informed about the weight of the chosen path only with probability ε < 1. Applications to routing in packet switched networks along with simulation results are also presented.
机译:在部分监视方案下考虑了在线最短路径问题。在每个回合中,决策者必须在加权有向无环图的两个不同顶点之间选择一条路径,其有向边的权重可以以任意(对抗性)方式变化,从而使所选路径的损失(定义为权重之和)组成边缘的宽度)很小。在多臂匪徒设置中,选择路径后,决策者仅了解属于所选路径的那些边的权重。对于这种情况,给出了一种算法,其算法在n轮中的平均累积损耗超过了最佳路径的平均损耗,并与边缘权重的整个序列进行离线匹配,其数量与1 /(n〜(1 / 2)),并且仅取决于多项式的边数。可以在回合数n和边数中以线性复杂度实现该算法。该结果改进了较早的匪徒算法,其性能范围要么以指数方式取决于边的数量,要么以比O(1 /(n〜(1/2))更低的速率收敛到零。还扩展了所谓的标签有效设置,其中仅以概率ε<1通知决策者有关所选路径的权重。还介绍了在分组交换网络中路由的应用以及仿真结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号