首页> 外文会议>Internatioanl Conference for Smart Health >Character-Based Deep Learning Approaches for Clinical Named Entity Recognition: A Comparative Study Using Chinese EHR Texts
【24h】

Character-Based Deep Learning Approaches for Clinical Named Entity Recognition: A Comparative Study Using Chinese EHR Texts

机译:临床名称实体识别的基于性质的深度学习方法:使用中文EHR文本的比较研究

获取原文

摘要

Previous studies on clinical sequence labeling require large amounts of task specific knowledge in the form of handcrafted features. Using latest development in representation learning, this paper introduces BERT embedding as character based pretrained model and incorporates it with three competing deep learning models (CNN-LSTM, Bi-LSTM and Bi-LSTM-CRF) to extract clinical entities from electronic health records. A comparative evaluation based on CCKS-2017 task 2 benchmark dataset reveals that: (1) BERT embedding not only facilitates improving performance of clinical NER tasks but also acts as good candidate for building end-to-end NER model requiring no feature engineering from Chinese EHR. (2) Bi-LSTM-CRF has the highest performance, i.e., 93% F1 scores when it uses BERT embedding. This paper may enhance our understanding of how to use BERT embedding in clinical NER researches.
机译:以前关于临床序列标记的研究需要以手工特征的形式需要大量的任务特异性知识。本文使用最新的发展学习,介绍了伯特嵌入式基于普拉的净化模型,并将其与三个竞争的深度学习模型(CNN-LSTM,Bi-LSTM和Bi-LSTM-CRF)一起用,以从电子健康记录中提取临床实体。基于CCKS-2017任务2的比较评估,基准数据集显示:(1)BERT嵌入不仅促进了提高临床行程的性能,而且还可作为建立终端内部模型的良好候选者,要求没有中文的功能工程EHR。 (2)双LSTM-CRF具有最高性能,即,使用BERT嵌入时的93%F1分数。本文可以提高我们对如何在临床NER研究中使用BERT嵌入的理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号