首页> 外文期刊>BMC Bioinformatics >A nonparametric Bayesian method of translating machine learning scores to probabilities in clinical decision support
【24h】

A nonparametric Bayesian method of translating machine learning scores to probabilities in clinical decision support

机译:一种将机器学习分数转换为临床决策支持概率的非参数贝叶斯方法

获取原文
           

摘要

Background Probabilistic assessments of clinical care are essential for quality care. Yet, machine learning, which supports this care process has been limited to categorical results. To maximize its usefulness, it is important to find novel approaches that calibrate the ML output with a likelihood scale. Current state-of-the-art calibration methods are generally accurate and applicable to many ML models, but improved granularity and accuracy of such methods would increase the information available for clinical decision making. This novel non-parametric Bayesian approach is demonstrated on a variety of data sets, including simulated classifier outputs, biomedical data sets from the University of California, Irvine (UCI) Machine Learning Repository, and a clinical data set built to determine suicide risk from the language of emergency department patients. Results The method is first demonstrated on support-vector machine (SVM) models, which generally produce well-behaved, well understood scores. The method produces calibrations that are comparable to the state-of-the-art Bayesian Binning in Quantiles (BBQ) method when the SVM models are able to effectively separate cases and controls. However, as the SVM models’ ability to discriminate classes decreases, our approach yields more granular and dynamic calibrated probabilities comparing to the BBQ method. Improvements in granularity and range are even more dramatic when the discrimination between the classes is artificially degraded by replacing the SVM model with an ad hoc k-means classifier. Conclusions The method allows both clinicians and patients to have a more nuanced view of the output of an ML model, allowing better decision making. The method is demonstrated on simulated data, various biomedical data sets and a clinical data set, to which diverse ML methods are applied. Trivially extending the method to (non-ML) clinical scores is also discussed.
机译:背景技术临床护理的概率评估对于高质量护理至关重要。但是,支持这种护理过程的机器学习仅限于分类结果。为了最大程度地发挥其作用,重要的是找到一种新颖的方法来用可能性标度校准ML输出。当前最先进的校准方法通常是准确的,并且适用于许多ML模型,但是此类方法的改进的粒度和准确性将增加可用于临床决策的信息。这种新颖的非参数贝叶斯方法在各种数据集上得到了证明,包括模拟分类器输出,加利福尼亚大学欧文分校(UCI)机器学习存储库中的生物医学数据集以及用于确定自杀风险的临床数据集。急诊科患者的语言。结果该方法首先在支持向量机(SVM)模型上得到了证明,该模型通常会产生行为良好,易于理解的分数。当SVM模型能够有效分离案例和控件时,该方法所产生的校准结果可与最新的贝叶斯分位数分贝(BBQ)方法相媲美。但是,随着SVM模型区分类别的能力下降,与BBQ方法相比,我们的方法产生的粒度和动态校准概率更高。当通过使用ad hoc k-means分类器替换SVM模型来人为地降低类之间的区别时,粒度和范围的改进甚至会更加显着。结论该方法使临床医生和患者对ML模型的输出都有更细微的了解,从而可以做出更好的决策。在模拟数据,各种生物医学数据集和临床数据集上演示了该方法,并对其应用了各种ML方法。还讨论了将该方法简单扩展到(非ML)临床评分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号