Adapting Pre-trained Word Embeddings For Use In Medical Coding

机译：调整预训练的词嵌入以用于医疗编码

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Word embeddings are a crucial component in modern NLP. Pre-trained embeddings released by different groups have been a major reason for their popularity. However, they are trained on generic corpora, which limits their direct use for domain specific tasks. In this paper, we propose a method to add task specific information to pre-trained word embeddings. Such information can improve their utility. We add information from medical coding data, as well as the first level from the hierarchy of ICD-10 medical code set to different pre-trained word embeddings. We adapt CBOW algorithm from the word2vec package for our purpose. We evaluated our approach on five different pre-trained word embeddings. Both the original word embeddings, and their modified versions (the ones with added information) were used for automated review of medical coding. The modified word embeddings give an improvement in f-score by 1 % on the 5-fold evaluation on a private medical claims dataset. Our results show that adding extra information is possible and beneficial for the task at hand.

机译：词嵌入是现代NLP中的关键组成部分。不同小组发布的经过预训练的嵌入是其受欢迎的主要原因。但是，他们接受了通用语料库的培训，这限制了它们直接用于特定领域的任务。在本文中，我们提出了一种将任务特定信息添加到预训练词嵌入中的方法。这样的信息可以提高其效用。我们将医学编码数据中的信息以及ICD-10医学代码层次结构中的第一级添加到不同的预训练词嵌入中。我们将word2vec软件包中的CBOW算法用于我们的目的。我们对五种不同的预训练词嵌入方法进行了评估。原始单词嵌入及其修改版本（带有附加信息的版本）都用于自动检查医学编码。在私人医疗索赔数据集的5倍评估中，修改后的单词嵌入使f得分提高了1％。我们的结果表明，添加额外的信息是可能的，并且对手头的任务有益。

著录项

来源
《16th workshop on biomedical natural language processing》|2017年|302-306|共5页
会议地点 Vancouver(CA)
作者
Kevin Patel; Divya Patel; Mansi Golakiya; Pushpak Bhattacharyya; Nilesh Birari;
展开▼
作者单位

Indian Institute of Technology Bombay, India;

Dharmsinh Desai University, India;

Dharmsinh Desai University, India;

Indian Institute of Technology Bombay, India;

ezDI Inc, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records [J] . Qingyu Chen, Jingcheng Du, Sun Kim, BMC Medical Informatics and Decision Making . 2020,第1期

机译：与生物医学Corpora预先培训的句子嵌入的深度学习提高了在电子病历中找到类似句子的表现
2. Spatial Role Labeling based on Improved Pre-trained Word Embeddings and Transfer Learning [J] . Alaeddine Moussa, Sébastien Fournier, Khaoula Mahmoudi, Procedia Computer Science . 2021,第a期

机译：基于改进的预训练单词嵌入和转移学习的空间角色标记
3. Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification [J] . Physica, A. Statistical mechanics and its applications . 2020,第期

机译：使用预训练的单词嵌入在土耳其语文本分类的深神经网络上使用预先训练的单词嵌入来提高准确性
4. Adapting Pre-trained Word Embeddings For Use In Medical Coding [C] . Kevin Patel, Divya Patel, Mansi Golakiya, Annual meeting of the Association for Computational Linguistics . 2017

机译：调整预训练的单词嵌入用于医疗编码
5. Learning Deep Representations, Embeddings and Codes from the Pixel Level of Natural and Medical Images. [D] . Kiros, Ryan. 2013

机译：从自然和医学图像的像素级学习深度表示，嵌入和代码。
6. Protein-Protein Interaction Article Classification Using a Convolutional Recurrent Neural Network with Pre-trained Word Embeddings [O] . Sérgio Matos, Rui Antunes 2017

机译：使用带预训练词嵌入的卷积递归神经网络进行蛋白质与蛋白质相互作用的文章分类
7. Adapting Pre-trained Word Embeddings For Use In Medical Coding [O] . Kevin Patel, Divya Patel, Mansi Golakiya, 2017

机译：调整预训练的单词嵌入用于医疗编码

Adapting Pre-trained Word Embeddings For Use In Medical Coding

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅