首页> 外文期刊>Journal of Pathology Informatics >The feasibility of using natural language processing to extract clinical information from breast pathology reports
【24h】

The feasibility of using natural language processing to extract clinical information from breast pathology reports

机译:使用自然语言处理从乳腺病理报告中提取临床信息的可行性

获取原文
           

摘要

Objective:The opportunity to integrate clinical decision support systems into clinical practice is limited due to the lack of structured, machine readable data in the current format of the electronic health record. Natural language processing has been designed to convert free text into machine readable data. The aim of the current study was to ascertain the feasibility of using natural language processing to extract clinical information from >76,000 breast pathology reports.Approach and Procedure:Breast pathology reports from three institutions were analyzed using natural language processing software (Clearforest, Waltham, MA) to extract information on a variety of pathologic diagnoses of interest. Data tables were created from the extracted information according to date of surgery, side of surgery, and medical record number. The variety of ways in which each diagnosis could be represented was recorded, as a means of demonstrating the complexity of machine interpretation of free text.Results:There was widespread variation in how pathologists reported common pathologic diagnoses. We report, for example, 124 ways of saying invasive ductal carcinoma and 95 ways of saying invasive lobular carcinoma. There were >4000 ways of saying invasive ductal carcinoma was not present. Natural language processor sensitivity and specificity were 99.1% and 96.5% when compared to expert human coders.Conclusion:We have demonstrated how a large body of free text medical information such as seen in breast pathology reports, can be converted to a machine readable format using natural language processing, and described the inherent complexities of the task.
机译:目的:由于缺乏电子病历当前格式下的结构化,机器可读数据,因此将临床决策支持系统集成到临床实践中的机会有限。自然语言处理已设计为将自由文本转换为机器可读数据。本研究的目的是确定使用自然语言处理从> 76,000例乳腺病理报告中提取临床信息的可行性。方法和程序:使用自然语言处理软件(Clearforest,Waltham,MA)分析了三个机构的乳腺病理报告)以提取有关各种病理学诊断的信息。根据手术日期,手术部位和病历号从提取的信息中创建数据表。记录了代表各种诊断的各种方式,以此证明对自由文本进行机器解释的复杂性。结果:病理学家报告常见病理诊断的方式存在很大差异。例如,我们报告了124种侵袭性导管癌的说法和95种侵略性小叶癌的说法。有> 4000种说法不存在浸润性导管癌。与专业人类编码员相比,自然语言处理者的敏感性和特异性分别为99.1%和96.5%。结论:我们已经证明了如何使用乳房病理学报告中看到的大量自由文本医学信息可以使用以下方式转换为机器可读格式:自然语言处理,并描述了任务固有的复杂性。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号