Target Concept Guided Medical Concept Normalization in Noisy User-Generated Texts

机译：目标概念在嘈杂的用户生成的文本中引导医学概念标准化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Medical concept normalization (MCN) i.e., mapping of colloquial medical phrases to standard concepts is an essential step in analysis of medical social media text. The main drawback in existing state-of-the-art approach (Kalyan and Sangeetha, 2020b) is learning target concept vector representations from scratch which requires more training instances. Our model is based on RoBERTa and target concept embed-dings. In our model, we integrate a) target concept information in the form of target concept vectors generated by encoding target concept descriptions using SRoBERTa, state-of-the-art RoBERTa based sentence embedding model and b) domain lexicon knowledge by enriching target concept vectors with synonym relationship knowledge using retrofitting algorithm. It is the first attempt in MCN to exploit both target concept information as well as domain lexicon knowledge in the form of retrofitted target concept vectors. Our model outperforms all the existing models with an accuracy improvement up to 1.36% on three standard datasets. Further, our model when trained only on mapping lexicon synonyms achieves up to 4.87% improvement in accuracy.

机译：医学概念标准化（MCN）即标准概念的口语医学短语的映射是医学社交媒体文本分析的重要步骤。现有最先进的方法（Kalyan和Sangeetha，2020B）的主要缺点是从头划痕学习目标概念向量表示，这需要更多的培训实例。我们的模型基于罗伯塔和目标概念嵌入叮当。在我们的模型中，我们通过通过丰富目标概念向量来编码目标概念描述生成的目标概念向量生成的目标概念向量的形式，通过丰富目标概念向量，以通过编码目标概念描述来进行目标概念信息。通过丰富目标概念向量，域名Lexicon知识使用改装算法的同义词关系知识。它是MCN的第一次尝试，以利用目标概念信息以及改装目标概念向量的形式域名词典知识。我们的模型优于所有现有模型，精度提高高达三个标准数据集的1.36％。此外，我们的型号仅在映射Lexicon同义词上培训时，请在准确性上实现高达4.87％。

著录项

来源
《Workshop on Knowledge Extraction and Integration for Deep Learning Architectures》|2020年|64-73|共10页
会议地点
作者
Katikapalli Subramanyam Kalyan; Sivanesan Sangeetha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Medical Concept Normalization by Encoding Target Knowledge [J] . Nikhil Pattisapu, Sangameshwar Patil, Girish Palshikar, JMLR: Workshop and Conference Proceedings . 2020,第2010期

机译：通过编码目标知识的医学概念标准化
2. Hypothesis Generation From Text Based On Co-Evolution Of Biomedical Concepts [J] . Kishlay Jha, Guangxu Xun, Yaqing Wang, SIGKDD explorations . 2019,第Udisk期

机译：基于生物医学概念共同演变的文本的假设生成
3. Different approaches for identifying important concepts in probabilistic biomedical text summarization [J] . Moradi Milad, Ghadiri Nasser Artificial intelligence in medicine . 2018,第JANa期

机译：识别概率生物医学文本摘要中重要概念的不同方法
4. Medical Concept Normalization in User-Generated Texts by Learning Target Concept Embeddings [C] . Katikapalli Subramanyam Kalyan, Sivanesan Sangeetha International Workshop on Health Text Mining and Information Analysis . 2020

机译：通过学习目标概念嵌入用户生成的文本中的医学概念标准化
5. Identification of concepts from emergency department text using natural language processing techniques and the Unified Medical Language System RTM. [D] . Travers, Debbie. 2003

机译：使用自然语言处理技术和Unified Medical Language System RTM从急诊科文本中识别概念。
6. Improving the CONTES method for normalizing biomedical text entities with concepts from an ontology with (almost) no training data [O] . Arnaud Ferré, Mouhamadou Ba, Robert Bossy 2019

机译：改进CONTES方法以（几乎）没有训练数据的本体论概念标准化生物医学文本实体
7. LexExp: a system for automatically expanding concept lexicons for noisy biomedical texts [O] . Abeed Sarker 2020

机译：Lexexp：用于自动扩展概念词汇的系统，用于嘈杂的生物医学文本

Target Concept Guided Medical Concept Normalization in Noisy User-Generated Texts

摘要

著录项

相似文献

相关主题

期刊订阅