首页> 美国卫生研究院文献>AMIA Annual Symposium Proceedings >Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation
【2h】

Leveraging existing corpora for de-identification of psychiatric notes using domain adaptation

机译:利用现有语料库通过域自适应来取消对精神病笔记的识别

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

De-identification of clinical notes is a special case of named entity recognition. Supervised machine-learning (ML) algorithms have achieved promising results for this task. However, ML-based de-identification systems often require annotating a large number of clinical notes of interest, which is costly. Domain adaptation (DA) is a technology that enables learning from annotated datasets from different sources, thereby reducing annotation cost required for ML training in the target domain. In this study, we investigate the use of DA methods for deidentification of psychiatric notes. Three state-of-the-art DA methods: instance pruning, instance weighting, and feature augmentation are applied to three source corpora of annotated hospital discharge summaries, outpatient notes, and a mixture of different note types written for diabetic patients. Our results show that DA can increase deidentification performance over the baselines, indicating that it can effectively reduce annotation cost for the target psychiatric notes. Feature augmentation is shown to increase performance the most among the three DA methods. Performance variation among the different types of clinical notes is also observed, showing that a mixture of different types of notes brings the biggest increase in performance.
机译:临床笔记的取消识别是命名实体识别的一种特殊情况。监督式机器学习(ML)算法已为该任务取得了可喜的结果。但是,基于ML的去识别系统通常需要注释大量感兴趣的临床笔记,这很昂贵。域适应(DA)是一项技术,可以从不同来源的带注释的数据集中进行学习,从而降低目标域中ML训练所需的注释成本。在这项研究中,我们调查了使用DA方法对精神病笔记进行身份识别。三种最新的DA方法:实例修剪,实例权重和特征增强被应用到带注释的医院出诊摘要,门诊便笺以及为糖尿病患者编写的不同便笺类型的混合物的三个源语料库。我们的结果表明,DA可以提高基线的身份识别性能,这表明它可以有效降低目标精神病学笔记的注释成本。在三种DA方法中,功能增强可最大程度地提高性能。还观察到不同类型的临床笔记之间的性能差异,这表明不同类型的笔记的混合物带来了最大的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号