首页> 外文会议> >Identifying Anatomical Phrases in Clinical Reports by Shallow Semantic Parsing Methods
【24h】

Identifying Anatomical Phrases in Clinical Reports by Shallow Semantic Parsing Methods

机译:浅层语义分析方法识别临床报告中的解剖短语

获取原文

摘要

Natural language processing (NLP) is being applied for several information extraction tasks in the biomedical domain. The unique nature of clinical information requires the need for developing an NLP system designed specifically for the clinical domain. We describe a method to identify semantically coherent phrases within clinical reports. This is an important step towards full syntactic parsing within a clinical NLP system. We use this semantic phrase chunker to identify anatomical phrases within radiology reports related to the genitourinary domain. A discriminative classifier based on support vector machines was used to classify words into one of five phrase classification categories. Training of the classifier was performed using 1000 hand-tagged sentences from a corpus of genitourinary radiology reports. Features used by the classifier include n-grams, syntactic tags and semantic labels. Evaluation was conducted on a blind test set of 250 sentences from the same domain. The system achieved overall performance scores of 0.87 (precision), 0.91 (recall) and 0.89 (balanced f-score). Anatomical phrase extraction can be rapidly and accurately accomplished
机译:自然语言处理(NLP)正在生物医学领域中用于多种信息提取任务。临床信息的独特性质要求开发专门为临床领域设计的NLP系统。我们描述了一种在临床报告中识别语义一致短语的方法。这是迈向临床NLP系统中完整语法分析的重要一步。我们使用此语义短语组块器来识别放射线报告中与泌尿生殖系统领域有关的解剖学短语。基于支持向量机的判别式分类器用于将单词分类为五个短语分类类别之一。使用来自泌尿生殖放射学报告语料库的1000个带有手工标记的句子对分类器进行训练。分类器使用的功能包括n-gram,语法标签和语义标签。对来自相同领域的250个句子的盲测集进行了评估。该系统的整体性能得分分别为0.87(精度),0.91(召回)和0.89(平衡f得分)。解剖短语提取可以快速而准确地完成

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号