Design for an Optimal Probe

机译：最佳探针的设计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given a Markov decision process (MDP) with expressed prior uncertainties in the process transition probabilities, we consider the problem of computing a policy that optimizes expected total (finite-horizon) reward. Implicitly, such a policy would effectively resolve the "exploration-versus-exploitation tradeoff' faced, for example, by an agent that seeks to optimize total reinforcement obtained over the entire duration of its interaction with an uncertain world. A Bayesian formulation leads to an associated MDP defined over a set of generalized process "hy-perstates" whose cardinality grows exponentially with the planning horizon. Here we retain the full Bayesian framework, but sidestep intractability by applying techniques from reinforcement learning theory. We apply our resulting actor-critic algorithm to a problem of "optimal probing," in which the task is to identify unknown transition probabilities of an MDP using online experience.

机译：给定一个马尔可夫决策过程（MDP），该过程在过程转换概率中具有明确的先验不确定性，因此我们考虑计算优化预期总（有限水平）报酬的策略的问题。隐含地，这样的政策将有效地解决例如寻求最佳化在与不确定世界相互作用的整个过程中获得的总加固量的代理商所面临的“探索与开发的权衡”。关联的MDP定义了一组通用过程“ hy-perstate”，其基数随规划范围呈指数增长，此处我们保留了完整的贝叶斯框架，但通过应用强化学习理论中的技术来回避难处理性，并应用了所得的actor-critic算法解决“最佳探测”的问题，其中的任务是使用在线经验来确定MDP的未知转换概率。

著录项

来源
《20th International Conference on Machine Learning》|2003年|P.131-138|共8页
会议地点
作者
Michael Duff;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机的应用;
关键词
入库时间 2022-08-26 14:52:10

相似文献

外文文献
中文文献
专利

1. Optimally designed nanolayered metal-dielectric particles as probes for massively multiplexed and ultrasensitive molecular assays [J] . Anil K. Kodali, rnXavier Llora, rnRohit Bhargava Proceedings of the National Academy of Sciences of the United States of America . 2010,第31期

机译：优化设计的纳米层金属介电粒子，作为大规模多重和超灵敏分子测定的探针
2. Finding the optimum design of the planar cutoff probe through a computational study [J] . S. J. Kim, J. J. Lee, Y. S. Lee, AIP Advances . 2021,第2期

机译：通过计算研究找到平面切断探针的最佳设计
3. Zemax-Based Optimum Structural Design of Probe of an Optical-Fiber Sensor [J] . Hang Tianyuan, Wang Xiaolei, Liu Feng, 南京航空航天大学学报（英文版） . 2018,第002期

机译：基于Zemax的光纤传感器探头的最佳结构设计
4. Enhanced Power System Damping Estimation via Optimal Probing Signal Design [C] . S. Boersma, X. Bombois, L. Vanfretti, European Conference on Power Electronics and Applications . 2020

机译：通过最佳探测信号设计增强的电力系统阻尼估计
5. Rapid Enrollment Design for Finding the Optimal Dose in Immunotherapy Trials with Ordered Groups and Optimal Design of Experiments with Observation Censoring Driven by Random Enrollment [D] . Xue, Xiaoqiang 2019

机译：在有序组的免疫治疗试验中寻找最佳剂量的快速入组设计，以及随机入组驱动的观察检查实验的优化设计
6. Optimally designed nanolayered metal-dielectric particles as probes for massively multiplexed and ultrasensitive molecular assays [O] . Anil K. Kodali, Xavier Llora, Rohit Bhargava 2010

机译：优化设计的纳米层金属介电粒子作为大规模多重和超灵敏分子测定的探针
7. Optimal Probe Length Varies for Targets with High Sequence Variation: Implications for Probe Library Design for Resequencing Highly Variable Genes [O] . Haslam, Niall J., Whiteford, Nava E., Weber, Gerald, 2008

机译：具有高序列变异的靶标的最佳探针长度各不相同：对高度可变基因进行重测序的探针库设计的意义

Design for an Optimal Probe

摘要

著录项

相似文献

相关主题

期刊订阅