Selecting Computations: Theory and Applications

机译：选择计算：理论与应用

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Sequential decision problems are often approximately solvable by simulating possible future action sequences. Metalevel decision procedures have been developed for selecting which action sequences to simulate, based on estimating the expected improvement in decision quality that would result from any particular simulation; an example is the recent work on using bandit algorithms to control Monte Carlo tree search in the game of Go. In this paper we develop a theoretical basis for metalevel decisions in the statistical framework of Bayesian selection problems, arguing (as others have done) that this is more appropriate than the bandit framework. We derive a number of basic results applicable to Monte Carlo selection problems, including the first finite sampling bounds for optimal policies in certain cases; we also provide a simple counterexample to the intuitive conjecture that an optimal policy will necessarily reach a decision in all cases. We then derive heuristic approximations in both Bayesian and distribution-free settings and demonstrate their superiority to bandit-based heuristics in one-shot decision problems and in Go.

机译：通过模拟可能的未来动作序列，顺序决策问题通常可以大致解决。已经开发出了元级决策程序，用于基于对任何特定模拟所导致的决策质量的预期改进的估计，来选择要模拟的动作序列;一个例子是最近在Go游戏中使用强盗算法来控制Monte Carlo树搜索的工作。在本文中，我们为贝叶斯选择问题的统计框架中的元级决策开发了理论基础，并认为（如其他人所做的那样）这比强盗框架更合适。我们得出了许多适用于蒙特卡洛选择问题的基本结果，包括在某些情况下最优政策的第一个有限采样界限;我们还为直觉猜想提供了一个简单的反例，即在所有情况下最优策略都必定会做出决定。然后，我们在贝叶斯和无分布环境中得出启发式近似值，并在单发决策问题和Go中证明它们优于基于强盗的启发式方法。

著录项

来源
《Conference on uncertainty in artificial intelligence》|2012年|346-355|共10页
会议地点
作者
Nicholas Hay; Stuart Russell; David Tolpin; Solomon Eyal Shimony;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. PROCEEDINGS OF THE THIRD CONFERENCE ON COMPUTATIONAL ALGEBRA, COMPUTATIONAL NUMBER THEORY AND APPLICATIONS [J] . Ali Reza Ashrafi, Hassan Daghigh Facta Universitatis. Series Mathematics and Informatics . 2019,第4期

机译：第三次计算代数，计算数理论和应用会议的诉讼程序
2. Towards a Unified Modeling and Knowledge-Representation Based on Lattice Theory: Computational Intelligence and Soft Computing Applications (Studies in Computational Intelligence) (Kaburlasos, V.G.; 2006) [book review] [J] . Kaburlasos V. G. IEEE Transactions on Neural Networks . 2007,第3期

机译：迈向基于格子理论的统一建模和知识表示：计算智能和软计算应用（计算智能研究）（Kaburlasos，V.G。； 2006年）[书评]
3. Some novel classification and learning methods and applications for neural networks-Selected papers from the Second International Conference on Bio-Inspired Computing: Theories and Applications [J] . Xiaopeng Wei, Qiang Zhang, Guangzhao Cui Neurocomputing . 2010,第4a6期

机译：神经网络的一些新颖的分类和学习方法及其应用-第二届国际生物启发计算国际会议论文选集：理论与应用
4. Selecting Computations: Theory and Applications [C] . Nicholas Hay, Stuart Russell, David Tolpin, Conference on Uncertainty in Artificial Intelligence . 2012

机译：选择计算：理论和应用
5. An efficient stochastic-based approach for biased Brownian motion: Fundamental theory and selected applications. [D] . Golbayani, Parvin. 2014

机译：一种有效的基于随机的偏布朗运动方法：基本理论和所选应用。
6. Chemical Theory and Computation Special Feature: Ab initio quantum chemistry: Methodology and applications [O] . Richard A. Friesner 2005

机译：化学理论与计算专题：从头算量子化学：方法论与应用
7. Estimation of the Computational Cost of Super-Droplet Method (Fast Algorithms in Computational Fluids : theory and applications) [O] . Shima Shin-ichiro 2008

机译：超小滴法的计算成本估算（计算流体中的快速算法：理论和应用）
8. Two Selected Topics Involving Theory and Applications of Infinite Arrays ofMicrostrip Elements [R] . Targonski, S. D. 1995

机译：涉及微带元素无限阵列理论与应用的两个专题

Selecting Computations: Theory and Applications

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅