Analyzing Multiple Medical Corpora Using Word Embedding

机译：使用词嵌入分析多个医疗语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural language models, such as word embedding, can effectively embed words into vector spaces and preserve linguistic regularities and semantic relationships. However, few researchers have shown their effectiveness on medical terms and relationships. In this paper, we study the applicability of word2vec, a well-known technique for word embedding, to embed medical terms and relations based on different medical text corpora, including biomedical abstracts of scientific papers, health-related discussion forums, and a commonly available general-purpose information resource. We empirically evaluate the applicability of this approach by studying how the word embedding projects certain classes of medical terms and relations to the word space and analyzing the differences between the three corpora for embedding medical terms and relations. Results show that the corpus of health-related discussion forum posts, authored by lay persons and medical novices, trains a comparable word embedding for popular medical terms, when compared against a professionally authored corpus of published biomedical abstracts.

机译：诸如词嵌入之类的神经语言模型可以有效地将词嵌入向量空间中，并保留语言规律性和语义关系。但是，很少有研究者在医学术语和人际关系上证明其有效性。在本文中，我们研究了word2vec（一种著名的词嵌入技术）是否适用于基于不同医学文本语料库（包括科学论文的生物医学摘要，与健康相关的讨论论坛以及常见的医学文献）来嵌入医学术语和关系的适用性。通用信息资源。我们通过研究单词嵌入如何将某些类别的医学术语和关系投射到单词空间并分析三种语料库之间嵌入医学术语和关系的差异，从经验上评估这种方法的适用性。结果表明，与专业出版的生物医学摘要语料库相比，由非专业人士和医学新手撰写的健康相关讨论论坛帖子的语料库可以训练出类似的词来嵌入流行医学术语。

著录项

来源
《IEEE International Conference on Healthcare Informatics》|2016年|527-533|共7页
会议地点
作者
Jian Huang; Keyang Xu; V. G. Vinod Vydiswaran;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Semantics; Biomedical imaging; Encyclopedias; Electronic publishing; Internet; Diseases;

机译：语义学;生物医学成像;百科全书;电子出版;互联网;疾病;

相似文献

外文文献
中文文献
专利

1. Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records [J] . Qingyu Chen, Jingcheng Du, Sun Kim, BMC Medical Informatics and Decision Making . 2020,第1期

机译：与生物医学Corpora预先培训的句子嵌入的深度学习提高了在电子病历中找到类似句子的表现
2. Deep learning in law: early adaptation and legal word embeddings trained on large corpora [J] . Ilias Chalkidis, Dimitrios Kampas Artificial Intelligence and Law . 2019,第2期

机译：法律深度学习：在大型语料库上进行早期适应和法律单词嵌入训练
3. Deep learning in law: early adaptation and legal word embeddings trained on large corpora [J] . Ilias Chalkidis, Dimitrios Kampas Artificial Intelligence and Law . 2019,第2期

机译：在法律中深入学习：在大公司训练的早期适应和法律词嵌入
4. Analyzing Multiple Medical Corpora Using Word Embedding [C] . Jian Huang, Keyang Xu, V. G. Vinod Vydiswaran International Conference on Healthcare Informatics . 2016

机译：使用Word Embedding分析多个医疗Corpora
5. Hypernym Discovery over WordNet and English Corpora - Using Hearst Patterns and Word Embeddings [D] . Vallabhajosyula, Manikya Swathi 2018

机译：通过WordNet和英语语料库发现Hypernym-使用赫斯特模式和单词嵌入
6. Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records [O] . Qingyu Chen, Jingcheng Du, Sun Kim, 2020

机译：在生物医学语料库上预先训练的带有句子嵌入的深度学习可提高在电子病历中查找相似句子的性能
7. Entity Extraction in Biomedical Corpora: An Approach to Evaluate Word Embedding Features with PSO based Feature Selection [O] . Shweta Yadav, Asif Ekbal, Sriparna Saha, 2017

机译：生物医学技术中的实体提取：一种评估基于PSO的特征选择词嵌入功能的方法

Analyzing Multiple Medical Corpora Using Word Embedding

摘要

著录项

相似文献

相关主题

期刊订阅