首页> 外文会议>Conference on empirical methods in natural language processing >Firearms and Tigers are Dangerous, Kitchen Knives and Zebras are Not: Testing whether Word Embeddings Can Tell
【24h】

Firearms and Tigers are Dangerous, Kitchen Knives and Zebras are Not: Testing whether Word Embeddings Can Tell

机译:枪支和老虎是危险的,厨房刀和斑马不是:测试嵌入词是否可以说明

获取原文

摘要

This paper presents an approach for investigating the nature of semantic information captured by word embeddings. We propose a method that extends an existing human-elicited semantic property dataset with gold negative examples using crowd judgments. Our experimental approach tests the ability of supervised classifiers to identify semantic features in word embedding vectors and compares this to a feature-identification method based on full vector cosine similarity. The idea behind this method is that properties identified by classifiers, but not through full vector comparison are captured by embeddings. Properties that cannot be identified by either method are not. Our results provide an initial indication that semantic properties relevant for the way entities interact (e.g. dangerous) are captured, while perceptual information (e.g. colors) is not represented. We conclude that, though preliminary, these results show that our method is suitable for identifying which properties are captured by embeddings.
机译:本文介绍了一种调查Word Embeddings捕获的语义信息的性质的方法。我们提出了一种方法,该方法将现有的人类引出语义属性数据集扩展,使用人群判断将存在于金负例。我们的实验方法测试了监督分类器识别Word嵌入向量中语义特征的能力,并将其与基于完整矢量余弦相似性的特征识别方法进行比较。这种方法背后的想法是由分类器识别的属性,但不是通过全矢量比较被嵌入捕获。无法通过任一方法识别的属性不是。我们的结果提供了初始指示,即捕获了与实体交互的方式相关的语义特性(例如,危险),而感知信息(例如颜色)未被表示。我们得出结论,虽然初步,但这些结果表明,我们的方法适用于识别嵌入物捕获的属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号