首页> 外文期刊>Health informatics journal >University of California, Irvine-Pathology Extraction Pipeline: The pathology extraction pipeline for information extraction from pathology reports
【24h】

University of California, Irvine-Pathology Extraction Pipeline: The pathology extraction pipeline for information extraction from pathology reports

机译:加州大学尔湾分校病理提取管线:用于从病理报告中提取信息的病理提取管线

获取原文
获取原文并翻译 | 示例
       

摘要

We describe Pathology Extraction Pipeline (PEP)a new Open Health Natural Language Processing pipeline that we have developed for information extraction from pathology reports, with the goal of populating the extracted data into a research data warehouse. Specifically, we have built upon Medical Knowledge Analysis Tool pipeline (MedKATp), which is an extraction framework focused on pathology reports. Our particular contributions include additional customization and development on MedKATp to extract data elements and relationships from cancer pathology reports in richer detail than at present, an abstraction layer that provides significantly easier configuration of MedKATp for extraction tasks, and a machine-learning-based approach that makes the extraction more resilient to deviations from the common reporting format in a pathology reports corpus. We present experimental results demonstrating the effectiveness of our pipeline for information extraction in a real-world task, demonstrating performance improvement due to our approach for increasing extractor resilience to format deviation, and finally demonstrating the scalability of the pipeline across pathology reports for different cancer types.
机译:我们描述了病理提取管道(PEP),这是我们为从病理报告中提取信息而开发的一种新的开放式健康自然语言处理管道,目的是将提取的数据填充到研究数据仓库中。具体来说,我们建立在医学知识分析工具管道(MedKATp)的基础上,该管道是针对病理报告的提取框架。我们的特殊贡献包括:在MedKATp上进行额外的自定义和开发,以从癌症病理学报告中提取比目前更详细的数据元素和关系;为提取任务配置MedKATp的抽象层大大简化;以及基于机器学习的方法,使提取对病理报告语料库中常见报告格式的偏离更具弹性。我们提供的实验结果证明了我们的管道在实际任务中用于信息提取的有效性,表明了由于我们提高提取器对格式偏差的适应性的方法而导致的性能改善,并最终证明了针对不同癌症类型的病理报告中管道的可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号