Exploring Word Segmentation and Medical Concept Recognition for Chinese Medical Texts

机译：探索中国医学文本的词分割和医学概念识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Chinese word segmentation (CWS) and medical concept recognition are two fundamental tasks to process Chinese electronic medical records (EMRs) and play important roles in downstream tasks for understanding Chinese EMRs. One challenge to these tasks is the lack of medical domain datasets with high-quality annotations, especially medical-related tags that reveal the characteristics of Chinese EMRs. In this paper, we collected a Chinese EMR corpus, namely, ACEMR, with human annotations for Chinese word segmentation and EMR-related tags. On the ACEMR corpus, we run well-known models (i.e., BiLSTM,. BERT, and ZEN) and existing state-of-the-art systems (e.g., WMSeg and TwASP) for CWS and medical concept recognition. Experimental results demonstrate the necessity of building a dedicated medical dataset and show that models that leverage extra resources achieve the best performance for both tasks, which provides certain guidance for future studies on model selection in the medical domain.

机译：中文字分割（CWS）和医学概念认可是处理中国电子医疗记录（EMRS）的两个基本任务，并在下游任务中发挥重要作用，以了解中国EMR。对这些任务的一个挑战是缺乏具有高质量注释的医疗域数据集，特别是医疗相关标签，揭示了中国EMR的特征。在本文中，我们收集了中国EMR语料库，即ACEMR，具有用于中文字分和EMR相关标签的人为注释。在ACEMR语料库上，我们经营着名的模型（即，Bilstm，Bert和Zen）以及用于CWS和医学概念识别的现有最先进的系统（例如，WMSEG和Twasp）。实验结果表明，建立专用医疗数据集的必要性，并显示利用额外资源的模型来实现两项任务的最佳性能，这为未来的模型选择在医疗领域中的研究提供了一定的指导。

著录项

来源
《SIGBioMed Workshop on Biomedical Language Processing》|2021年|213-220|共8页
会议地点
作者
Yang Liu; Yuanhe Tian; Tsung-Hui Chang; Song Wu; Xiang Wan; Yan Song;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 13:58:10

相似文献

外文文献
中文文献
专利

1. Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text [J] . Xu Jing, Gan Liang, Cheng Mian, Journal of healthcare engineering. . 2018,第Pta2期

机译：无监督的医疗实体识别和链接中文在线医学文本
2. Knowledge based word-concept model estimation and refinement for biomedical text mining [J] . Yepes Antonio Jimeno, Berlanga Rafael Journal of biomedical informatics. . 2015,第1期

机译：基于知识的生物医学文本挖掘的词概念模型估计与改进
3. Deep neural network-based recognition of entities in Chinese online medical inquiry texts [J] . Xin Liu, Yanju Zhou, Zongrun Wang Future generation computer systems . 2021,第Jana期

机译：基于深度神经网络的中文在线医学查询文本的实体认可
4. Chinese Word Segmentation in Electronic Medical Record Text via Graph Neural Network-Bidirectional LSTM-CRF Model [C] . Jinlian Du, Wei Mi, Xiaolin Du IEEE International Conference on Bioinformatics and Biomedicine . 2020

机译：通过图形神经网络 - 双向LSTM-CRS模型的电子医疗记录文本中的中文词分割
5. User-Centered Design and Evaluation of Interactive Segmentation Methods for Medical Images =Conception et évaluation orientées utilisateur des méthodes de segmentation interactives des images médicales [D] . Gueziri, Houssem-Eddine. 2017

机译：以用户为中心的医学图像交互式分割方法的设计和评估=以用户为中心的医学图像交互式分割方法的设计和评估
6. Unsupervised Medical Entity Recognition and Linking in Chinese Online Medical Text [O] . Jing Xu, Liang Gan, Mian Cheng, 2018

机译：中文在线医学文本中的无监督医学实体识别与链接
7. Interpretable Segmentation of Medical Free-Text Records Based on Word Embeddings [O] . Adam Gabriel Dobrakowski, Agnieszka Mykowiecka, Małgorzata Marciniak, 2020

机译：基于Word Embeddings的医疗自由文本记录的可解释分割

Exploring Word Segmentation and Medical Concept Recognition for Chinese Medical Texts

摘要

著录项

相似文献

相关主题

期刊订阅