首页> 外文会议>International conference on information technology in bio- and medical informatics >Modelling of Cancer Patient Records: A Structured Approach to Data Mining and Visual Analytics
【24h】

Modelling of Cancer Patient Records: A Structured Approach to Data Mining and Visual Analytics

机译:癌症患者记录建模:数据挖掘和可视化分析的结构化方法

获取原文

摘要

This research presents a methodology for health data analytics through a case study for modelling cancer patient records. Timeline-structured clinical data systems represent a new approach to the understanding of the relationship between clinical activity, disease pathologies and health outcomes. The novel Southampton Breast Cancer Data System contains episode and timeline-structured records on > 17,000 patients who have been treated in University Hospital Southampton and affiliated hospitals since the late 1970s. The system is under continuous development and validation. Modern data mining software and visual analytics tools permit new insights into temporally-structured clinical data. The challenges and outcomes of the application of such software-based systems to this complex data environment are reported here. The core data was anonymised and put through a series of pre-processing exercises to identify and exclude anomalous and erroneous data, before restructuring within a remote data warehouse. A range of approaches was tested on the resulting dataset including multidimensional modelling, sequential patterns mining and classification. Visual analytics software has enabled the comparison of survival times and surgical treatments. The systems tested proved to be powerful in identifying episode sequencing patterns which were consistent with real-world clinical outcomes. It is concluded that, subject to further refinement and selection, modern data mining techniques can be applied to large and heterogeneous clinical datasets to inform decision making.
机译:这项研究通过对癌症患者记录进行建模的案例研究,提出了一种用于健康数据分析的方法。时间轴结构的临床数据系统代表了一种了解临床活动,疾病病理和健康结果之间关系的新方法。新颖的南安普敦乳腺癌数据系统包含情节和时间轴结构的记录,记录了自1970年代后期以来在南安普敦大学附属医院和附属医院接受治疗的17,000多名患者。该系统正在不断开发和验证中。现代数据挖掘软件和可视化分析工具使人们能够对时间结构化的临床数据提供新的见解。本文报告了将这种基于软件的系统应用于这种复杂的数据环境的挑战和结果。在对远程数据仓库进行重组之前,对核心数据进行匿名处理,并进行了一系列预处理操作,以识别和排除异常和错误数据。在所得数据集上测试了一系列方法,包括多维建模,顺序模式挖掘和分类。视觉分析软件可以比较生存时间和手术治疗。被测试的系统被证明在识别与现实世界临床结果一致的情节测序模式方面功能强大。结论是,在进一步完善和选择的基础上,现代数据挖掘技术可以应用于大型且异构的临床数据集,以为决策提供依据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号