...
首页> 外文期刊>BMC Bioinformatics >Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings
【24h】

Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings

机译:放射学报告的自然语言处理,用于检测血栓栓塞性疾病和临床相关的偶然发现

获取原文

摘要

Background Natural Language Processing (NLP) has been shown effective to analyze the content of radiology reports and identify diagnosis or patient characteristics. We evaluate the combination of NLP and machine learning to detect thromboembolic disease diagnosis and incidental clinically relevant findings from angiography and venography reports written in French. We model thromboembolic diagnosis and incidental findings as a set of concepts, modalities and relations between concepts that can be used as features by a supervised machine learning algorithm. A corpus of 573 radiology reports was de-identified and manually annotated with the support of NLP tools by a physician for relevant concepts, modalities and relations. A machine learning classifier was trained on the dataset interpreted by a physician for diagnosis of deep-vein thrombosis, pulmonary embolism and clinically relevant incidental findings. Decision models accounted for the imbalanced nature of the data and exploited the structure of the reports. Results The best model achieved an F measure of 0.98 for pulmonary embolism identification, 1.00 for deep vein thrombosis, and 0.80 for incidental clinically relevant findings. The use of concepts, modalities and relations improved performances in all cases. Conclusions This study demonstrates the benefits of developing an automated method to identify medical concepts, modality and relations from radiology reports in French. An end-to-end automatic system for annotation and classification which could be applied to other radiology reports databases would be valuable for epidemiological surveillance, performance monitoring, and accreditation in French hospitals.
机译:背景自然语言处理(NLP)已被证明可有效分析放射学报告的内容并确定诊断或患者特征。我们评估了NLP和机器学习的组合,以检测法文撰写的血管造影和静脉造影报告中的血栓栓塞性疾病诊断和临床相关偶然发现。我们将血栓栓塞性诊断和偶然发现建模为一组概念,模式和概念之间的关系,这些概念,模式和概念之间的关系可以通过有监督的机器学习算法用作特征。取消了573份放射学报告的语料库,并在医师的NLP工具支持下手动注释了相关的概念,方式和关系。在医生解释的数据集上训练了机器学习分类器,以诊断深静脉血栓形成,肺栓塞和临床相关的偶然发现。决策模型考虑了数据的不平衡性,并利用了报告的结构。结果最佳模型的F值用于肺栓塞识别为0.98,深静脉血栓形成为1.00,临床偶然发现为0.80。在所有情况下,概念,模式和关系的使用都提高了绩效。结论本研究证明了开发自动方法以从法语放射学报告中识别医学概念,形态和关系的好处。可以应用于其他放射学报告数据库的端到端自动注释和分类系统,对于法国医院的流行病学监测,性能监测和认证具有重要意义。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号