首页> 外文会议>Conference on Uncertainty in Artificial Intelligence >Efficient Online Learning for Optimizing Value of Information: Theory and Application to Interactive Troubleshooting
【24h】

Efficient Online Learning for Optimizing Value of Information: Theory and Application to Interactive Troubleshooting

机译:高效在线学习,以优化信息价值:理论和应用于交互式故障排除

获取原文

摘要

We consider the optimal value of information problem, where the goal is to sequentially select a set of tests with a minimal cost, so that one can efficiently make the best decision based on the observed outcomes. Existing algorithms are either heuristics with no guarantees, or scale poorly (with exponential run time in terms of the number of available tests). Moreover, these methods assume a known distribution over the test outcomes, which is often not the case in practice. We propose a sampling-based online learning framework to address the above issues. First, assuming the distribution over hypotheses is known, we propose a dynamic hypothesis enumeration strategy, which allows efficient information gathering with strong theoretical guarantees. We show that with sufficient amount of samples, one can identify a near-optimal decision with high probability. Second, when the parameters of the hypotheses distribution are unknown, we propose an algorithm which learns the parameters progressively via posterior sampling in an online fashion. We further establish a rigorous bound on the expected regret. We demonstrate the effectiveness of our approach on a real-world interactive troubleshooting application, and show that one can efficiently make high-quality decisions with low cost.
机译:我们考虑信息问题的最佳价值,其中目标是以最小的成本顺序地选择一组测试,因此可以基于观察结果有效地做出最佳决策。现有的算法是具有保证的启发式,或者没有保证,或比例差(具有可用测试数量的指数运行时间)。此外,这些方法在测试结果上假设已知分布,这通常不是实践中的情况。我们提出了一种基于采样的在线学习框架来解决上述问题。首先,假设已知假设的分布,我们提出了一种动态假设枚举策略,这允许有效地收集具有强烈的理论担保。我们表明,具有足够量的样品,可以识别具有高概率的近乎最佳决策。其次,当假设分布的参数未知时,我们提出了一种算法,该算法通过以在线方式通过后部采样逐渐了解参数。我们进一步建立了预期遗憾的严格束缚。我们展示了我们对真实互动故障排除应用程序的方法的有效性,并表明人们可以有效地具有低成本的高质量决策。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号