首页> 外文会议>Workshop on biomedical natural language processing 2015 >Evaluating distributed word representations for capturing semantics of biomedical concepts
【24h】

Evaluating distributed word representations for capturing semantics of biomedical concepts

机译:评估分布式词表示形式以捕获生物医学概念的语义

获取原文
获取原文并翻译 | 示例

摘要

Recently there is a surge in interest in learning vector representations of words using huge corpus in unsupervised manner. Such word vector representations, also known as word embedding, have been shown to improve the performance of machine learning models in several NLP tasks. However efficiency of such representation has not been systematically evaluated in biomedical domain. In this work our aim is to compare the performance of two state-of-the-art word embedding methods, namely word2vec and GloVe on a basic task of reflecting semantic similarity and relatedness of biomedical concepts. For this, vector representations of all unique words in the corpus of more than 1 million full-length research articles in biomedical domain are obtained from the two methods. We observe that parameters of these models do affect their ability to capture lexico-semantic properties and word2vec with particular language modeling seems to perform better than others.
机译:最近,人们对以无人监督的方式使用巨大语料库学习单词的向量表示形式的兴趣激增。此类单词向量表示法(也称为单词嵌入)已显示在若干NLP任务中提高了机器学习模型的性能。但是,这种表示的效率尚未在生物医学领域进行系统评价。在这项工作中,我们的目的是比较两种最先进的词嵌入方法(即word2vec和GloVe)在反映生物医学概念的语义相似性和相关性这一基本任务上的性能。为此,可以通过两种方法获得生物医学领域超过一百万篇全长研究文章语料库中所有唯一单词的向量表示。我们观察到,这些模型的参数确实会影响其捕获词汇语义属性的能力,并且使用特定语言建模的word2vec似乎比其他模型表现更好。

著录项

  • 来源
  • 会议地点 Beijing(CA)
  • 作者单位

    Department of Computer Science and Engineering, Indian Institute of Technology Guwahati, Assam - 781039, India;

    Department of Computer Science and Engineering, Indian Institute of Technology Guwahati, Assam - 781039, India;

    Department of Computer Science and Engineering, Indian Institute of Technology Guwahati, Assam - 781039, India;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-26 14:23:26

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号