首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Predicting hospital readmission from longitudinal healthcare data using graph pattern mining based temporal phenotypes
【24h】

Predicting hospital readmission from longitudinal healthcare data using graph pattern mining based temporal phenotypes

机译:使用基于图模式挖掘的时间表型从纵向医疗保健数据预测医院的再入院

获取原文

摘要

The rapidly increasing availability of healthcare data from multiple heterogeneous sources has spearheaded the adoption of data-driven approaches for improved clinical research, decision making, and patient management. The patient healthcare data are usually longitudinal and can be expressed as medical event sequences, where the events include clinical diagnosis, medications, laboratory reports, etc. Because healthcare data has both longitudinal and heterogeneous attributes, analyzing healthcare data is an inherently difficult challenge. In this paper, we propose a hospital readmission prediction method using temporal phenotypes, namely the Tephe. Specifically, each patient's medical event sequence is first represented by a temporal graph, which captures temporal relationships of the medical events in each event sequence and makes the raw data more intuitive. Based on graph pattern mining, we define more significant frequent subgraphs as temporal phenotypes. This enables us to better understand the disease evolving patterns and treatment approach. In addition, we designed an improved greedy algorithm to find the optimal expression coefficient of frequent subgraphs for each patient. Finally, based on the optimal expression coefficient of the frequent subgraph, random forests are used to perform prediction tasks. The experimental results show that our proposed method is more accurate in the prediction tasks compared with the baselines.
机译:来自多个异构来源的医疗保健数据的快速增加性已经刺激了采用数据驱动的方法,以改善临床研究,决策和患者管理。患者医疗保健数据通常是纵向的,可以表达为医疗事件序列,其中事件包括临床诊断,药物,实验室报告等。因为医疗保健数据具有纵向和异质属性,分析医疗保健数据是一种本质上难以挑战。在本文中,我们提出了一种使用时间表型的医院入院预测方法,即Tephe。具体地,每个患者的医疗事件序列首先由时间图表示,该时间图表捕获了每个事件序列中医疗事件的时间关系,并使原始数据更直观。基于图形模式挖掘,我们将更重要的频繁子图定义为时间表型。这使我们能够更好地了解不断发展的模式和治疗方法。此外,我们设计了一种改进的贪婪算法,以找到每个患者的频繁子图的最佳表达系数。最后,基于频繁子图的最佳表达系数,使用随机林用于执行预测任务。实验结果表明,与基线相比,我们所提出的方法在预测任务中更准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号