Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules

机译：结合领域词典和规则改进中文电子病历的命名实体识别

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Electronic medical records are an integral part of medical texts. Entity recognition of electronic medical records has triggered many studies that propose many entity extraction methods. In this paper, an entity extraction model is proposed to extract entities from Chinese Electronic Medical Records (CEMR). In the input layer of the model, we use word embedding and dictionary features embedding as input vectors, where word embedding consists of a character representation and a word representation. Then, the input vectors are fed to the bidirectional long short-term memory to capture contextual features. Finally, a conditional random field is employed to capture dependencies between neighboring tags. We performed experiments on body classification task, and the F1 values reached 90.65%. We also performed experiments on anatomic region recognition task, and the F1 values reached 93.89%. On both tasks, our model had higher performance than state-of-the-art models, such as Bi-LSTM-CRF, Bi-LSTM-Attention, and Vote. Through experiments, our model has a good effect when dealing with small frequency entities and unknown entities; with a small training dataset, our method showed 2–4% improvement on F1 value compared to the basic Bi-LSTM-CRF models. Additionally, on anatomic region recognition task, besides using our proposed entity extraction model, 12 rules we designed and domain dictionary were adopted. Then, in this task, the weighted F1 value of the three specific entities extraction reached 84.36%.

机译：电子病历是医学文本的组成部分。电子病历的实体识别引发了许多研究，提出了许多实体提取方法。本文提出了一种实体提取模型，以从中国电子病历（CEMR）中提取实体。在模型的输入层中，我们使用词嵌入和字典特征嵌入作为输入向量，其中词嵌入由字符表示和词表示组成。然后，输入向量被馈送到双向长短期存储器以捕获上下文特征。最后，使用条件随机字段来捕获相邻标签之间的依赖关系。我们对人体分类任务进行了实验，F1值达到了90.65％。我们还进行了解剖区域识别任务的实验，F1值达到93.89％。在这两个任务上，我们的模型都具有比最新模型（如Bi-LSTM-CRF，Bi-LSTM-Attention和Vote）更高的性能。通过实验，我们的模型在处理小频率实体和未知实体时有很好的效果。与少量的训练数据集相比，我们的方法显示出与基本Bi-LSTM-CRF模型相比F1值提高了2-4％。另外，在解剖区域识别任务上，除了使用我们提出的实体提取模型外，我们还设计了12条规则和域字典。然后，在此任务中，提取的三个特定实体的加权F1值达到84.36％。

著录项

期刊名称 International Journal of Environmental Research and Public Health
作者
Xianglong Chen; Chunping Ouyang; Yongbin Liu; Yi Bu;
展开▼
作者单位

展开▼
年(卷),期 2020(17),8
年度 2020
页码 -1
总页数 16
原文格式 PDF
正文语种
中图分类公共卫生工程;
关键词
entity recognition; electronic medical records; Bi-LSTM-CRF; rules; domain dictionary;

机译：实体识别;电子病历;Bi-LSTM-CRF;规则;领域词典;
入库时间 2022-08-21 11:40:24

相似文献

外文文献
中文文献
专利

1. Named Entity Recognition Over Electronic Health Records Through a Combined Dictionary-based Approach [J] . Alexandra Pomares Quimbaya, Alejandro Sierra Múnera, Rafael Andrés González Rivera, Procedia Computer Science . 2016,第1期

机译：通过基于字典的组合方法在电子病历中命名实体识别
2. A Hybrid Model for Named Entity Recognition on Chinese Electronic Medical Records [J] . Wang Yu, Sun Yining, Ma Zuchang, ACM transactions on Asian and low-resource language information processing . 2021,第2期

机译：中国电子医疗记录命名实体识别的混合模型
3. Chinese electronic medical record named entity recognition algorithm based on transfer learning [J] . Li Yi, Liu Jianyi, Zhang Ru Basic & clinical pharmacology & toxicology. . 2020,第S9期

机译：基于转移学习的中国电子医疗记录名为实体识别算法
4. Improved deep belief network model and its application in named entity recognition of Chinese electronic medical records [C] . Wusuo Li, Shenghui Shi, Ziqiao Gao, IEEE International Conference on Big Data Analysis . 2018

机译：改进的深度信念网络模型及其在中国电子病历命名实体识别中的应用
5. Understanding, evaluating and enhancing electronic medical record adoption in a primary caresetting: A programme to improve electronic medical record data quality and its effect on familypractice provision of incentivized and enhanced care for chronic disease patients [D] . Bowen, Michael. 2013

机译：了解，评估和增强在初级护理环境中采用电子病历的方案：一项旨在提高电子病历数据质量及其对家庭实践的激励措施的计划，该方案为慢性病患者提供激励和加强护理
6. Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN [O] . Xishuang Dong, Shanta Chowdhury, Lijun Qian, 2015

机译：深度学习用于中国电子病历中的命名实体识别：将深度迁移学习与多任务双向LSTM RNN相结合
7. Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN [O] . Xishuang Dong, Shanta Chowdhury, Lijun Qian, 2019

机译：关于中国电子病历的命名实体认可的深度学习：将深度转移学习与多任务双向LSTM RNN相结合

Improving the Named Entity Recognition of Chinese Electronic Medical Records by Combining Domain Dictionary and Rules

摘要

著录项

相似文献

相关主题

期刊订阅