Category Multi-representation: A Unified Solution for Named Entity Recognition in Clinical Texts

机译：类别多表示：临床文本中命名实体识别的统一解决方案

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Clinical Named Entity Recognition (CNER), the task of identifying the entity boundaries in clinical texts, is essential for many applications. Previous methods usually follow the traditional NER methods that heavily rely on language specific features (i.e. linguistics and lexicons) and high quality annotated data. However, due to the problem of Limited Availability of Annotated Data and Informal Clinical Texts, CNER becomes more challenging. In this paper, we propose a novel method that learn multiple representations for each category, namely category-multi-representation (CMR) that captures the semantic relat-edness between words and clinical categories from different perspectives. CMR is learned based on a large scale unannotated corpus and a small set of annotated data, which greatly alleviates the burden of human effort. Instead of the language specific features, our proposed method uses more evidential features without any additional NLP tools, and enjoys a lightweight adaption among languages. We conduct a series of experiments to verify our new CMR features can further improve the performance of NER significantly without leveraging any external lexicons.

机译：临床命名实体识别（CNER）是在临床文本中标识实体边界的任务，对于许多应用程序来说都是必不可少的。先前的方法通常遵循传统的NER方法，该方法严重依赖于语言的特定功能（即语言学和词典）以及高质量的带注释数据。但是，由于注释数据和非正式临床文本的可用性有限的问题，CNER变得更具挑战性。在本文中，我们提出了一种学习每种类别的多种表示的新颖方法，即类别多表示（CMR），它从不同的角度捕获了单词和临床类别之间的语义相关性。 CMR是基于大规模的未注释语料库和少量注释数据集而学习的，这极大地减轻了人员的负担。代替语言特定的功能，我们提出的方法使用了更多的证据功能，而没有任何其他的NLP工具，并且在语言之间具有轻巧的适应性。我们进行了一系列实验，以验证我们的新CMR功能可以在不利用任何外部词典的情况下进一步显着提高NER的性能。

著录项

来源
《Pacific-Asia conference on knowledge discovery and data mining》|2018年|275-287|共13页
会议地点
作者
Jiangtao Zhang; Juanzi Li; Shuai Wang; Yan Zhang; Yixin Cao; Lei Hou; Xiao-Li Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts [J] . Lee Wangjin, Kim Kyungmo, Lee Eun Young, Computers in Biology and Medicine . 2018,第期

机译：临床命名实体识别的条件随机字段：韩国临床文本的比较研究
2. DNER Clinical (named entity recognition) from free clinical text to Snomed-CT concept [J] . IGNACIO MARTINEZ SORIANO, JUAN LUIS CASTRO PENA WSEAS Transactions on Computers . 2017,第期

机译：DNER临床（命名实体识别）从免费的临床文本到SNOMED-CT概念
3. Cost-aware active learning for named entity recognition in clinical text [J] . Qiang Wei, Yukun Chen, Mana Salimi, Journal of the American Medical Informatics Association : . 2019,第11期

机译：在临床文本中命名实体识别的成本感知主动学习
4. Category Multi-representation: A Unified Solution for Named Entity Recognition in Clinical Texts [C] . Jiangtao Zhang, Juanzi Li, Shuai Wang, Pacific-Asia Conference on Knowledge Discovery and Data Mining . 2018

机译：类别多表示：临床文本中指定实体识别的统一解决方案
5. Semi-supervised Named Entity Recognition: Learning to recognize 100 entity types with little supervision [D] . Nadeau, David. 2007

机译：半监督的命名实体识别：在很少的监督下学习识别100种实体类型
6. A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text [O] . Yonghui Wu, Jun Xu, Min Jiang, 2015

机译：神经文本嵌入对临床文本中命名实体识别的研究
7. Evaluation of a Concept Mapping Task Using Named Entity Recognition and Normalization in Unstructured Clinical Text [O] . Sapna Trivedi, Roger Gildersleeve, Sandra Franco, 2020

机译：在非结构化临床文本中使用命名实体识别和归一化的概念映射任务的评估

Category Multi-representation: A Unified Solution for Named Entity Recognition in Clinical Texts

摘要

著录项

相似文献

相关主题

期刊订阅