首页> 美国卫生研究院文献>other >Q-learning for estimating optimal dynamic treatment rules from observational data

【2h】

Q-learning for estimating optimal dynamic treatment rules from observational data

机译：Q学习用于从观测数据来估计最佳动态处理规则

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

The area of dynamic treatment regimes (DTR) aims to make inference about adaptive, multistage decision-making in clinical practice. A DTR is a set of decision rules, one per interval of treatment, where each decision is a function of treatment and covariate history that returns a recommended treatment. Q-learning is a popular method from the reinforcement learning literature that has recently been applied to estimate DTRs. While, in principle, Q-learning can be used for both randomized and observational data, the focus in the literature thus far has been exclusively on the randomized treatment setting. We extend the method to incorporate measured confounding covariates, using direct adjustment and a variety of propensity score approaches. The methods are examined under various settings including non-regular scenarios. We illustrate the methods in examining the effect of breastfeeding on vocabulary testing, based on data from the Promotion of Breastfeeding Intervention Trial.

机译：动态治疗方案（DTR）的领域旨在推断临床实践中的自适应，多阶段决策。 DTR是一组决策规则，每个治疗间隔一个，其中每个决策都是治疗的函数和返回推荐治疗的协变量历史。 Q学习是强化学习文献中的一种流行方法，最近已被用于估计DTR。虽然原则上可以将Q学习用于随机数据和观察数据，但迄今为止，文献中的焦点仅集中在随机治疗设置上。我们使用直接调整和各种倾向得分方法扩展了方法，以纳入测得的混杂变量。在各种设置（包括非常规场景）下检查方法。我们基于“母乳喂养干预试验促进”的数据，说明了检查母乳喂养对词汇测试效果的方法。

著录项

期刊名称 other
作者
Erica E. M. Moodie; Bibhas Chakraborty; Michael S. Kramer;
展开▼
作者单位

展开▼
年(卷),期 -1(40),4
年度 -1
页码 629–645
总页数 21
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes [J] . Fan Yanqin, He Ming, Su Liangjun, Scandinavian journal of statistics . 2019 ,第2期

机译：用于估计最佳动态治疗方案的平滑Q学习算法
2. Amorphous/Crystalline (A/C) Thermodynamic “Rules of Thumb”: Estimating Standard Thermodynamic Data for Amorphous Materials Using Standard Data for Their Crystalline Counterparts [J] . Diane Holland and H. Donald Brooke Jenkins Inorganic Chemistry . 2012 ,第9期

机译：非晶/晶体（A / C）热力学“经验法则”：使用晶体对应物的标准数据估算非晶材料的标准热力学数据
3. Amorphous/crystalline (A/C) thermodynamic "rules of thumb": Estimating standard thermodynamic data for amorphous materials using standard data for their crystalline counterparts [J] . Holland D., Brooke Jenkins H.D. Inorganic Chemistry: A Research Journal that Includes Bioinorganic, Catalytic, Organometallic, Solid-State, and Synthetic Chemistry and Reaction Dynamics . 2012 ,第9期

机译：非晶/晶体（A / C）热力学“经验法则”：使用晶体对应物的标准数据估算非晶态材料的标准热力学数据
4. Deep-Treat: Learning Optimal Personalized Treatments from Observational Data Using Neural Networks [C] . Onur Atan, James Jordon, Mihaela van der Schaar AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：深处：使用神经网络从观察数据中学习最佳个性化治疗
5. Estimating optimal treatment policies from clinical data. [D] . Amit, Ohad. 2002

机译：根据临床数据估算最佳治疗策略。
6. Super-Learning of an Optimal Dynamic Treatment Rule [O] . Alexander R. Luedtke, Mark J. van der Laan -1

机译：最佳动态处理规则的超级学习
7. A new approach of fitting biomass dynamics models to real data based on a linear total allowable catch (TAC) rule : an optimal control approach [O] . Ussif Al-Amin M., Sandal Leif Kristoffer, Steinshamn Stein Ivar 2000

机译：基于线性总允许捕获量（TAC）规则将生物量动力学模型拟合到实际数据的新方法：最优控制方法

Q-learning for estimating optimal dynamic treatment rules from observational data

摘要

著录项

相似文献

相关主题

期刊订阅