首页> 美国卫生研究院文献>Journal of the American Medical Informatics Association : JAMIA >Improved Identification of Noun Phrases in Clinical Radiology Reports Using a High-Performance Statistical Natural Language Parser Augmented with the UMLS Specialist Lexicon
【2h】

Improved Identification of Noun Phrases in Clinical Radiology Reports Using a High-Performance Statistical Natural Language Parser Augmented with the UMLS Specialist Lexicon

机译:使用高性能统计自然语言解析器和UMLS专家词典增强了临床放射学报告中名词短语的识别度

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

>Objective: The aim of this study was to develop and evaluate a method of extracting noun phrases with full phrase structures from a set of clinical radiology reports using natural language processing (NLP) and to investigate the effects of using the UMLS® Specialist Lexicon to improve noun phrase identification within clinical radiology documents.>Design: The noun phrase identification (NPI) module is composed of a sentence boundary detector, a statistical natural language parser trained on a nonmedical domain, and a noun phrase (NP) tagger. The NPI module processed a set of 100 XML-represented clinical radiology reports in Health Level 7 (HL7)® Clinical Document Architecture (CDA)–compatible format. Computed output was compared with manual markups made by four physicians and one author for maximal (longest) NP and those made by one author for base (simple) NP, respectively. An extended lexicon of biomedical terms was created from the UMLS Specialist Lexicon and used to improve NPI performance.>Results: The test set was 50 randomly selected reports. The sentence boundary detector achieved 99.0% precision and 98.6% recall. The overall maximal NPI precision and recall were 78.9% and 81.5% before using the UMLS Specialist Lexicon and 82.1% and 84.6% after. The overall base NPI precision and recall were 88.2% and 86.8% before using the UMLS Specialist Lexicon and 93.1% and 92.6% after, reducing false-positives by 31.1% and false-negatives by 34.3%.>Conclusion: The sentence boundary detector performs excellently. After the adaptation using the UMLS Specialist Lexicon, the statistical parser's NPI performance on radiology reports increased to levels comparable to the parser's native performance in its newswire training domain and to that reported by other researchers in the general nonmedical domain.
机译:>目的:该研究的目的是开发和评估一种使用自然语言处理(NLP)从一组临床放射学报告中提取具有完整短语结构的名词短语的方法,并研究使用该方法的效果UMLS®Specialist Lexicon,以改善临床放射学文档中的名词短语识别。>设计:名词短语识别(NPI)模块由句子边界检测器组成,这是一种在非医学领域训练的统计自然语言解析器,以及名词短语(NP)标记器。 NPI模块以兼容Health 7(HL7)®临床文档架构(CDA)的格式处理了100组以XML表示的临床放射学报告。将计算的输出与由四位医生和一位作者为最大(最长)NP进行的人工标记和由一位作者为基本(简单)NP进行的人工标记进行比较。从UMLS专家词典中创建了扩展的生物医学术语词典,并用于改善NPI性能。>结果:测试集是50个随机选择的报告。句子边界检测器的准确率达到99.0%,召回率达到98.6%。使用UMLS Specialist Lexicon之前,整体最大NPI精度和召回率分别为78.9%和81.5%,之后为82.1%和84.6%。使用UMLS Specialist Lexicon之前,总体基本NPI精度和召回率分别为88.2%和86.8%,之后分别为93.1%和92.6%,从而使误报率降低了31.1%,误报率降低了34.3%。>结论:句子边界检测器表现出色。在使用UMLS专家词典进行改编之后,统计分析器在放射学报告上的NPI性能提高到与该分析器在新闻专线训练领域的本机性能和其他非医学领域的其他研究人员所报告的性能相当的水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号