首页> 美国卫生研究院文献>Computational and Mathematical Methods in Medicine >Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction
【2h】

Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction

机译:随机森林生存模型在提高决策树通用性中的应用:以急性心肌梗死为例

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), and RSF. Methods. Data of 607 patients was randomly divided into training and test set applying 10-fold cross-validation. Using training sets, all three models were applied. Using Log-Rank test, ST was constructed by searching for optimal cutoffs. PT was selected plotting error rate versus minimum sample size in terminal nodes. In construction of RSF, 1000 bootstrap samples were drawn from the training set. C-index and integrated Brier score (IBS) statistic were used to compare models. Results. ST provides the most overoptimized statistics. Mean difference between C-index in training and test set was 0.237. Corresponding figure in PT and RSF was 0.054 and 0.007. In terms of IBS, the difference was 0.136 in ST, 0.021 in PT, and 0.0003 in RSF. Conclusion. Pruning of tree and assessment of its performance of a test set partially improve the generalizability of decision trees. RSF provides results that are highly generalizable.
机译:背景。树模型提供了易于解释的预测工具,但结果不稳定。修剪和随机生存森林(RSF)是提高结果的可推广性的两种方法。这项研究的目的是评估饱和树(ST),修剪树(PT)和RSF的推广性。方法。将607例患者的数据随机分为训练和应用10倍交叉验证的测试集。使用训练集,应用了所有三个模型。使用Log-Rank检验,通过搜索最佳截止值构建ST。选择PT来绘制错误率与终端节点中最小样本量的关系。在构建RSF中,从训练集中抽取了1000个引导程序样本。使用C指数和综合Brier得分(IBS)统计量来比较模型。结果。 ST提供最优化的统计信息。训练中的C指数和测试集之间的平均差为0.237。 PT和RSF的对应数字为0.054和0.007。就IBS而言,ST的差异为0.136,PT的差异为0.021,RSF的差异为0.0003。结论。修剪树并评估其测试集的性能可部分提高决策树的通用性。 RSF提供了高度可推广的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号