首页> 外文会议>1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018 >Firearms and Tigers are Dangerous, Kitchen Knives and Zebras are Not: Testing whether Word Embeddings Can Tell
【24h】

Firearms and Tigers are Dangerous, Kitchen Knives and Zebras are Not: Testing whether Word Embeddings Can Tell

机译:枪支和虎具危险性,而厨刀和斑马具具危险性:测试单词嵌入是否可以分辨

获取原文
获取原文并翻译 | 示例

摘要

This paper presents an approach for investigating the nature of semantic information captured by word embeddings. We propose a method that extends an existing human-elicited semantic property dataset with gold negative examples using crowd judgments. Our experimental approach tests the ability of supervised classifiers to identify semantic features in word embedding vectors and compares this to a feature-identification method based on full vector cosine similarity. The idea behind this method is that properties identified by classifiers, but not through full vector comparison are captured by embeddings. Properties that cannot be identified by either method are not. Our results provide an initial indication that semantic properties relevant for the way entities interact (e.g. dangerous) are captured, while perceptual information (e.g. colors) is not represented. We conclude that, though preliminary, these results show that our method is suitable for identifying which properties are captured by embeddings.
机译:本文提出了一种方法,用于研究词嵌入捕获的语义信息的性质。我们提出了一种使用人群判断来扩展带有金负面示例的现有人为语义属性数据集的方法。我们的实验方法测试了监督分类器识别词嵌入向量中语义特征的能力,并将其与基于全向量余弦相似度的特征识别方法进行了比较。该方法背后的思想是,通过嵌入而不是通过全向量比较来识别分类器所标识的属性。不能通过任何一种方法识别的属性都不会。我们的结果初步表明,与实体交互方式(例如危险)有关的语义属性已被捕获,而感知信息(例如颜色)未得到表示。我们得出的结论是,尽管初步,但这些结果表明,我们的方法适用于识别通过嵌入捕获的属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号