首页> 外文会议>Artificial Intelligence in Medicine >Drifting Concepts as Hidden Factors in Clinical Studies
【24h】

Drifting Concepts as Hidden Factors in Clinical Studies

机译:漂移概念是临床研究中的隐性因素

获取原文

摘要

Most statistical, Machine Learning and Data Mining algorithms assume that the data they use is a random sample drawn from a stationary distribution. Unfortunately, many of the databases available for mining today violate this assumption. They were gathered over months or years, and the underlying processes generating them may have changed during this time, sometimes radically (this is also known as a concept drift). In clinical institutions, where the patients' data are regularly stored in a central computer databases, similar situations may occur. Expert physicians may easily, even unconsciously, adapt to the changed environment, whereas Machine Learning and Data Mining tools may fail due to their underlaying assumptions. It is therefore important to detect and adapt to the changed situation. In the paper we review several techniques for dealing with concept drift in Machine Learning and Data Mining frameworks and evaluate their use in clinical studies with a case study of coronary artery disease diagnostics.
机译:大多数统计,机器学习和数据挖掘算法都假定它们使用的数据是从平稳分布中提取的随机样本。不幸的是,当今许多可用于挖掘的数据库都违反了这一假设。它们是经过数月或数年收集的,在此期间,生成它们的基本过程可能已更改,有时是根本性的更改(这也称为概念漂移)。在临床机构中,患者的数据定期存储在中央计算机数据库中,可能会发生类似情况。专家医生可能会轻松地甚至不自觉地适应变化的环境,而机器学习和数据挖掘工具可能会因其基础假设而失败。因此,重要的是检测并适应变化的情况。在本文中,我们回顾了在机器学习和数据挖掘框架中处理概念漂移的几种技术,并以冠状动脉疾病诊断为例对它们在临床研究中的应用进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号