首页> 外文会议>Health Informatics Meets Digital Healtd Conference >Evaluating Performance and Interpretability of Machine Learning Methods for Predicting Delirium in Gerontopsychiatric Patients
【24h】

Evaluating Performance and Interpretability of Machine Learning Methods for Predicting Delirium in Gerontopsychiatric Patients

机译:在非洲经济学患者预测谵妄预测机床学习方法的性能和可解释性

获取原文

摘要

Delirium is an acute mental disturbance that particularly occurs during hospital stay. Current clinical assessment instruments include the Delirium Observation Screening Scale (DOSS) or the Confusion Assessment Method (CAM). The aim of this work is to analyze the performance of machine learning approaches to detect delirium based on DOSS and CAM information obtained from two geropsychiatric wards in Tyrol. From a machine learning perspective, the questions of these two assessment instruments represent the features and the ICD 10 diagnoses of delirium (yes/no) is the corresponding class variable. We compare seven popular classification methods and analyze the performance and interpretability of the learning models. As our dataset is highly imbalanced, we also evaluate the effect of common sampling methods including down- and up-sampling methods as well as hybrid methods. Our results indicate a high predictive ability of advanced methods such as Random Forest that can handle even unbalanced datasets. Overall, combining a good performance of a prediction model with the ability of users to understand the prediction is challenging. However, for clinical application in fully electronic settings, a good performance seems to be more important than an easy interpretation of the prediction by the user. On the other hand, explanations of decisions are often needed to assess other criteria such as safety.
机译:谵妄是一种急性心动障碍,特别是在住院期间发生的剧烈障碍。目前的临床评估仪器包括谵妄观察筛分秤(DOS)或混乱评估方法(CAM)。这项工作的目的是分析机器学习方法的性能,以基于从蒂罗尔两位Geropsychiatric病房获得的DOSS和CAM信息来检测谵妄。从机器学习的角度来看,这两个评估仪器的问题代表了谵妄(是/否)的特征和ICD 10诊断是相应的类变量。我们比较七种流行的分类方法,分析学习模型的性能和可解释性。随着我们的数据集高度不平衡,我们还评估了常用采样方法的效果,包括下抽样方法以及混合方法。我们的结果表明了高级方法的高预测能力,例如可以处理甚至不平衡数据集的随机林。总的来说,将预测模型的良好性能与用户理解预测的能力相结合,是挑战性的。然而,对于完全电子设置的临床应用,良好的性能似乎比用户预测的简单解释更重要。另一方面,通常需要解释决策来评估其他标准,如安全性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号