首页> 外文期刊>Journal of the American Medical Informatics Association : >Automated encoding of clinical documents based on natural language processing.
【24h】

Automated encoding of clinical documents based on natural language processing.

机译:基于自然语言处理的临床文档自动编码。

获取原文
获取原文并翻译 | 示例
           

摘要

OBJECTIVE: The aim of this study was to develop a method based on natural language processing (NLP) that automatically maps an entire clinical document to codes with modifiers and to quantitatively evaluate the method. METHODS: An existing NLP system, MedLEE, was adapted to automatically generate codes. The method involves matching of structured output generated by MedLEE consisting of findings and modifiers to obtain the most specific code. Recall and precision applied to Unified Medical Language System (UMLS) coding were evaluated in two separate studies. Recall was measured using a test set of 150 randomly selected sentences, which were processed using MedLEE. Results were compared with a reference standard determined manually by seven experts. Precision was measured using a second test set of 150 randomly selected sentences from which UMLS codes were automatically generated by the method and then validated by experts. RESULTS: Recall of the system for UMLS coding of all terms was .77 (95% CI.72-.81), and for coding terms that had corresponding UMLS codes recall was .83 (.79-.87). Recall of the system for extracting all terms was .84 (.81-.88). Recall of the experts ranged from .69 to .91 for extracting terms. The precision of the system was .89 (.87-.91), and precision of the experts ranged from .61 to .91. CONCLUSION: Extraction of relevant clinical information and UMLS coding were accomplished using a method based on NLP. The method appeared to be comparable to or better than six experts. The advantage of the method is that it maps text to codes along with other related information, rendering the coded output suitable for effective retrieval.
机译:目的:本研究的目的是开发一种基于自然语言处理(NLP)的方法,该方法可自动将整个临床文档映射到带有修饰符的代码并进行定量评估。方法:现有的NLP系统MedLEE被修改为自动生成代码。该方法涉及由MedLEE生成的结构化输出(包括发现和修饰符)的匹配,以获取最具体的代码。在两项单独的研究中,对应用于统一医学语言系统(UMLS)编码的召回率和精度进行了评估。使用150个随机选择的句子的测试集对召回率进行测量,这些句子使用MedLEE处理。将结果与7位专家手动确定的参考标准进行比较。使用包含150个随机选择的句子的第二个测试集测量精度,该方法从该句子中自动生成UMLS代码,然后由专家进行验证。结果:所有术语的UMLS编码系统的召回率为0.77(95%CI.72-.81),具有相应UMLS代码的编码词的召回率为0.83(.79-.87)。提取所有项的系统的召回率为0.84(.81-.88)。召回术语的专家召回范围从.69到.91。系统的精度为0.89(.87-.91),专家的精度范围为0.61至.91。结论:采用基于NLP的方法完成了相关临床信息的提取和UMLS编码。该方法似乎可以媲美或优于六位专家。该方法的优点是它将文本与其他相关信息一起映射到代码,使编码输出适合有效检索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号