首页> 美国卫生研究院文献>other >Recognition and Evaluation of Clinical Section Headings in Clinical Documents Using Token-Based Formulation with Conditional Random Fields
【2h】

Recognition and Evaluation of Clinical Section Headings in Clinical Documents Using Token-Based Formulation with Conditional Random Fields

机译:使用基于令牌的带条件随机字段的公式对临床文档中临床节标题的识别和评估

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Electronic health record (EHR) is a digital data format that collects electronic health information about an individual patient or population. To enhance the meaningful use of EHRs, information extraction techniques have been developed to recognize clinical concepts mentioned in EHRs. Nevertheless, the clinical judgment of an EHR cannot be known solely based on the recognized concepts without considering its contextual information. In order to improve the readability and accessibility of EHRs, this work developed a section heading recognition system for clinical documents. In contrast to formulating the section heading recognition task as a sentence classification problem, this work proposed a token-based formulation with the conditional random field (CRF) model. A standard section heading recognition corpus was compiled by annotators with clinical experience to evaluate the performance and compare it with sentence classification and dictionary-based approaches. The results of the experiments showed that the proposed method achieved a satisfactory F-score of 0.942, which outperformed the sentence-based approach and the best dictionary-based system by 0.087 and 0.096, respectively. One important advantage of our formulation over the sentence-based approach is that it presented an integrated solution without the need to develop additional heuristics rules for isolating the headings from the surrounding section contents.
机译:电子健康记录(EHR)是一种数字数据格式,用于收集有关单个患者或人群的电子健康信息。为了增强EHR的有意义的使用,已经开发了信息提取技术来识别EHR中提到的临床概念。然而,不能仅基于公认的概念而不考虑其上下文信息来了解EHR的临床判断。为了提高电子病历的可读性和可及性,这项工作开发了一种用于临床文件的章节标题识别系统。与将章节标题识别任务表述为句子分类问题相反,这项工作提出了一种带有条件随机场(CRF)模型的基于令牌的表述。注释者由具有临床经验的注释者编写了标准的标题识别语料库,以评估其性能并将其与句子分类和基于字典的方法进行比较。实验结果表明,所提出的方法获得了令人满意的F分数0.942,比基于句子的方法和基于最佳字典的系统分别高0.087和0.096。与基于句子的方法相比,我们的表述的一个重要优点是它提供了一种集成的解决方案,而无需开发其他启发式规则来将标题与周围的章节内容隔离开来。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号