首页> 外文会议>Biomedical data management and graph online querying >Vector Space Models for Encoding and Retrieving Longitudinal Medical Record Data
【24h】

Vector Space Models for Encoding and Retrieving Longitudinal Medical Record Data

机译:用于纵向医疗记录数据编码和检索的向量空间模型

获取原文
获取原文并翻译 | 示例

摘要

Vector space models (VSMs) are widely used as information retrieval methods and have been adapted to many applications. In this paper, we propose a novel use of VSMs for classification and retrieval of longitudinal electronic medical record data. These data contain sequences of clinical events that are based on treatment decisions, but the treatment plan is not recorded with the events. The goals of our VSM methods are (1) to identify which plan a specific patient treatment sequence best matches and (2) to find patients whose treatment histories most closely follow a specific plan. We first build a traditional VSM that uses standard terms corresponding to the events found in clinical plans and treatment histories. We also consider temporal terms that represent binary relationships of precedence between or co-occurrence of these events. We create four alternative VSMs that use different combinations of standard and temporal terms as dimensions, and we evaluate their performance using manually annotated data on chemotherapy plans and treatment histories for breast cancer patients. In classifying treatment histories, the best approach used temporal terms, which had 87 % accuracy in identifying the correct clinical plan. For information retrieval, our results showed that the traditional VSM performed best. Our results indicate that VSMs have good performance for classification and retrieval of longitudinal electronic medical records, but the results depend on how the model is constructed.
机译:向量空间模型(VSM)被广泛用作信息检索方法,并已适应许多应用。在本文中,我们提出了VSM在纵向电子病历数据分类和检索中的一种新颖用途。这些数据包含基于治疗决策的临床事件序列,但治疗计划未随事件记录。我们的VSM方法的目标是(1)确定最适合特定患者治疗顺序的计划,以及(2)查找治疗史最接近特定计划的患者。我们首先构建一个传统的VSM,它使用与临床计划和治疗历史中发现的事件相对应的标准术语。我们还考虑了表示这些事件之间优先顺序或同时发生的二进制关系的时间术语。我们创建了四个替代VSM,它们使用标准和时间术语的不同组合作为维度,并且我们使用有关乳腺癌患者的化疗计划和治疗历史的人工注释数据来评估其性能。在对治疗史进行分类时,最好的方法是使用时间术语,该术语在识别正确的临床计划中的准确性为87%。对于信息检索,我们的结果表明传统的VSM表现最佳。我们的结果表明,VSM在纵向电子病历的分类和检索方面具有良好的性能,但结果取决于模型的构建方式。

著录项

  • 来源
  • 会议地点 Waikoloa(US)
  • 作者

    Haider Syed; Amar K. Das;

  • 作者单位

    Social Computing Health Informatics Lab, Hanover, USA,Department of Computer Science, Dartmouth College, Hanover, USA;

    Social Computing Health Informatics Lab, Hanover, USA,Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号