首页> 美国卫生研究院文献>other >Identification of Patients with Family History of Pancreatic Cancer - Investigation of an NLP System Portability
【2h】

Identification of Patients with Family History of Pancreatic Cancer - Investigation of an NLP System Portability

机译:鉴定具有胰腺癌家族史的患者-NLP系统便携性调查

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this study we have developed a rule-based natural language processing (NLP) system to identify patients with family history of pancreatic cancer. The algorithm was developed in a Unstructured Information Management Architecture (UIMA) framework and consisted of section segmentation, relation discovery, and negation detection. The system was evaluated on data from two institutions. The family history identification precision was consistent across the institutions shifting from 88.9% on Indiana University (IU) dataset to 87.8% on Mayo Clinic dataset. Customizing the algorithm on the the Mayo Clinic data, increased its precision to 88.1%. The family member relation discovery achieved precision, recall, and F-measure of 75.3%, 91.6% and 82.6% respectively. Negation detection resulted in precision of 99.1%. The results show that rule-based NLP approaches for specific information extraction tasks are portable across institutions; however customization of the algorithm on the new dataset improves its performance.
机译:在这项研究中,我们开发了一种基于规则的自然语言处理(NLP)系统,以识别具有胰腺癌家族史的患者。该算法是在非结构化信息管理体系结构(UIMA)框架中开发的,包括节分段,关系发现和否定检测。该系统是根据来自两个机构的数据进行评估的。整个机构的家族史识别精度均保持一致,从印第安纳大学(IU)数据集的88.9%变为Mayo Clinic数据集的87.8%。根据Mayo Clinic数据自定义算法,其精度提高到88.1%。亲属关系发现的准确率,召回率和F测度分别达到75.3%,91.6%和82.6%。否定检测的精度为99.1%。结果表明,针对特定信息提取任务的基于规则的NLP方法可跨机构移植。但是,在新数据集上自定义算法可提高其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号