首页> 外文期刊>Methods of information in medicine >Censoring weighted separate-and- conquer rule induction from survival data
【24h】

Censoring weighted separate-and- conquer rule induction from survival data

机译:从生存数据中检查加权加权独立规则

获取原文
获取原文并翻译 | 示例
           

摘要

Objectives: Rule induction is one of the major methods of machine learning. Rulebased models can be easily read and interpreted by humans, that makes them particularly useful in survival studies as they can help clinicians to better understand analysed data and make informed decisions about patient treatment. Although of such usefulness, there is still a little research on rule learning in survival analysis. In this paper we take a step towards rule-based analysis of survival data. Methods: We investigate so-called covering or separate-and-conquer method of rule induction in combination with a weighting scheme for handling censored observations. We also focus on rule quality measures being one of the key elements differentiating particular implementations of separate-andconquer rule induction algorithms. We examine 15 rule quality measures guiding rule induction process and reflecting a wide range of different rule learning heuristics. Results: The algorithm is extensively tested on a collection of 20 real survival datasets and compared with the state-of-the-art survival trees and random survival forests algorithms. Most of the rule quality measures outperform Kaplan-Meier estimate and perform at least equally well as tree-based algorithms. Conclusions: Separate-and-conquer rule induction in combination with weighting scheme is an effective technique for building rule-based models of survival data which, according to predictive accuracy, are competitive with tree-based representations.
机译:目标:规则归纳是机器学习的主要方法之一。基于规则的模型可以很容易地被人类阅读和解释,这使得它们在生存研究中特别有用,因为它们可以帮助临床医生更好地理解分析数据并做出有关患者治疗的明智决定。尽管有这样的用处,但是在生存分析中仍然很少有关于规则学习的研究。在本文中,我们朝着基于规则的生存数据分析迈出了一步。方法:我们研究所谓的覆盖或分离征服法则,结合加权方案来处理被检查的观测值。我们还将重点放在规则质量度量上,这是区分单独征服规则归纳算法的特定实现的关键要素之一。我们研究了15条规则质量度量,指导规则归纳过程并反映了各种不同的规则学习启发式方法。结果:该算法在20个真实生存数据集上进行了广泛测试,并与最新的生存树和随机生存森林算法进行了比较。大多数规则质量度量均胜过Kaplan-Meier估计,并且至少与基于树的算法具有相同的性能。结论:分离和征服规则归纳与加权方案相结合是一种有效的技术,可用于建立基于规则的生存数据模型,根据预测准确性,该模型可与基于树的表示形式相竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号