首页> 外文会议>International Conference on Informatics, Management, and Technology in Healthcare >Taming Big Data: An Information Extraction Strategy for Large Clinical Text Corpora
【24h】

Taming Big Data: An Information Extraction Strategy for Large Clinical Text Corpora

机译:驯服大数据:大型临床文本语料库的信息提取策略

获取原文

摘要

Concepts of interest for clinical and research purposes are not uniformly distributed in clinical text available in electronic medical records. The purpose of our study was to identify filtering techniques to select 'high yield' documents for increased efficacy and throughput. Using two large corpora of clinical text, we demonstrate the identification of 'high yield' document sets in two unrelated domains: homelessness and indwelling urinary catheters. For homelessness, the high yield set includes homeless program and social work notes. For urinary catheters, concepts were more prevalent in notes from hospitalized patients; nursing notes accounted for a majority of the high yield set. This filtering will enable customization and refining of information extraction pipelines to facilitate extraction of relevant concepts for clinical decision support and other uses.
机译:临床和研究目的的景观概念并不均匀分布在电子医疗记录中可用的临床文本中。我们研究的目的是识别过滤技术,以选择“高收益率”文档,以提高疗效和吞吐量。使用两个大型临床文本语言,我们展示了两个无关域中的“高产”文件集的识别:无家可归者和留置尿导管。对于无家可归,高产套装包括无家可归的计划和社会工作票据。对于尿道导管,概念在住院患者的票据中更为普遍;护理票据占大多数高收益率集。该滤波将启用信息提取管道的定制和精炼,以便于提取相关概念以进行临床决策支持和其他用途。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号