首页> 外文会议>16th workshop on biomedical natural language processing >Insights into Analogy Completion from the Biomedical Domain
【24h】

Insights into Analogy Completion from the Biomedical Domain

机译:从生物医学领域洞察类比完成

获取原文
获取原文并翻译 | 示例

摘要

Analogy completion has been a popular task in recent years for evaluating the semantic properties of word embeddings, but the standard methodology makes a number of assumptions about analogies that do not always hold, either in recent benchmark datasets or when expanding into other domains. Through an analysis of analogies in the biomedical domain, we identify three assumptions: that of a Single Answer for any given analogy, that the pairs involved describe the Same Relationship, and that each pair is Informative with respect to the other. We propose modifying the standard methodology to relax these assumptions by allowing for multiple correct answers, reporting MAP and MRR in addition to accuracy, and using multiple example pairs. We further present BMASS, a novel dataset for evaluating linguistic regularities in biomedical embeddings, and demonstrate that the relationships described in the dataset pose significant semantic challenges to current word embedding methods.
机译:近年来,类比完成是评估词嵌入的语义特性的一项流行任务,但是标准方法对最近在基准数据集中或扩展到其他领域时并不总是成立的类比做出了许多假设。通过对生物医学领域中类比的分析,我们确定了三个假设:对于任何给定类比的单个答案,所涉及的对描述了相同的关系,并且每个对相对于彼此都是信息性的。我们建议修改标准方法,以通过允许多个正确答案,除了准确性以外还报告MAP和MRR并使用多个示例对来放宽这些假设。我们进一步介绍了BMASS,这是一种用于评估生物医学嵌入中语言规则性的新颖数据集,并证明了数据集中描述的关系对当前的词嵌入方法构成了重大的语义挑战。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号