Taming Big Data: An Information Extraction Strategy for Large Clinical Text Corpora

机译：驯服大数据：大型临床文本语料库的信息提取策略

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Concepts of interest for clinical and research purposes are not uniformly distributed in clinical text available in electronic medical records. The purpose of our study was to identify filtering techniques to select 'high yield' documents for increased efficacy and throughput. Using two large corpora of clinical text, we demonstrate the identification of 'high yield' document sets in two unrelated domains: homelessness and indwelling urinary catheters. For homelessness, the high yield set includes homeless program and social work notes. For urinary catheters, concepts were more prevalent in notes from hospitalized patients; nursing notes accounted for a majority of the high yield set. This filtering will enable customization and refining of information extraction pipelines to facilitate extraction of relevant concepts for clinical decision support and other uses.

机译：临床和研究目的的景观概念并不均匀分布在电子医疗记录中可用的临床文本中。我们研究的目的是识别过滤技术，以选择“高收益率”文档，以提高疗效和吞吐量。使用两个大型临床文本语言，我们展示了两个无关域中的“高产”文件集的识别：无家可归者和留置尿导管。对于无家可归，高产套装包括无家可归的计划和社会工作票据。对于尿道导管，概念在住院患者的票据中更为普遍;护理票据占大多数高收益率集。该滤波将启用信息提取管道的定制和精炼，以便于提取相关概念以进行临床决策支持和其他用途。

著录项

来源
《International Conference on Informatics, Management, and Technology in Healthcare》|2015年||共4页
会议地点
作者
Adi V. GUNDLAPALLP; Guy DIVITA; Marjorie E. CARTER; Andrew REDD; Matthew H. SAMORE; Kalpana GUPTA Barbara TRAUTNER;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 R-058;
关键词

相似文献

外文文献
中文文献
专利

1. C-C4-01: Rapid Exploration of Large Clinical Text Corpora for Information Extraction Feasibility Studies [J] . Clinical medicine & research. . 2011,第3a4期

机译：C-C4-01：大型临床文本语料库的快速探索，以进行信息提取可行性研究
2. Validation of the TOtal Visual acuity extraction Algorithm (TOVA) for automated extraction of visual acuity and intraocular pressure data from free text clinical records [J] . Baughman Doug, Lee Cecilia, Lee Aaron Y. Investigative ophthalmology & visual science . 2017,第8期

机译：从自由文本临床记录中验证可视敏锐度和眼内压力数据的自动提取敏锐提取算法（TOVA）
3. Validation of the TOtal Visual acuity extraction Algorithm (TOVA) for automated extraction of visual acuity and intraocular pressure data from free text clinical records [J] . Baughman Doug, Lee Cecilia, Lee Aaron Y. Investigative ophthalmology & visual science . 2017,第8期

机译：从自由文本临床记录中验证可视敏锐度和眼内压力数据的自动提取敏锐提取算法（TOVA）
4. Taming Big Data: An Information Extraction Strategy for Large Clinical Text Corpora [C] . Adi V. GUNDLAPALLP, Guy DIVITA, Marjorie E. CARTER, International Conference on Informatics, Management, and Technology in Healthcare . 2015

机译：驯服大数据：大型临床文本语料库的信息提取策略
5. Factorizing information extraction from text corpora [D] . Feng, Donghui 2007

机译：从文本语料库分解信息提取
6. C-C4-01: Rapid Exploration of Large Clinical Text Corpora for Information Extraction Feasibility Studies [O] . Sharon Fuller, David Carrell 2011

机译：C-C4-01：用于信息提取可行性研究的大型临床文本语料库的快速探索
7. C-C4-01: Rapid Exploration of Large Clinical Text Corpora for Information Extraction Feasibility Studies [O] . Fuller, Sharon, Carrell, David 2011

机译：C-C4-01：用于信息提取可行性研究的大型临床文本语料库的快速探索

Taming Big Data: An Information Extraction Strategy for Large Clinical Text Corpora

摘要

著录项

相似文献

相关主题

期刊订阅