首页> 外文会议>International conference on computational linguistics >Assessing Composition in Sentence Vector Representations
【24h】

Assessing Composition in Sentence Vector Representations

机译:评估句子向量表示中的组成

获取原文

摘要

An important component of achieving language understanding is mastering the composition of sentence meaning, but an immediate challenge to solving this problem is the opacity of sentence vector representations produced by current neural sentence composition models. We present a method to address this challenge, developing tasks that directly target compositional meaning information in sentence vector representations with a high degree of precision and control. To enable the creation of these controlled tasks, we introduce a specialized sentence generation system that produces large, annotated sentence sets meeting specified syntactic, semantic and lexical constraints. We describe the details of the method and generation system, and then present results of experiments applying our method to probe for compositional information in embeddings from a number of existing sentence composition models. We find that the method is able to extract useful information about the differing capacities of these models, and we discuss the implications of our results with respect to these systems' capturing of sentence information. We make available for public use the datasels used for these experiments, as well as the generation system.1
机译:实现语言理解的重要组成部分是掌握句子含义的组成,但是解决此问题的直接挑战是当前神经句子组成模型所产生的句子矢量表示的不透明性。我们提出了一种解决这一挑战的方法,开发了以高精度和可控制性直接针对句子矢量表示中的成分含义信息的任务。为了能够创建这些受控任务,我们引入了一种特殊的句子生成系统,该系统可以生成满足指定句法,语义和词汇约束的大型带注释句子集。我们描述了该方法和生成系统的细节,然后介绍了使用我们的方法从许多现有的句子组成模型中探查嵌入中的组成信息的实验结果。我们发现该方法能够提取有关这些模型的不同能力的有用信息,并且我们讨论了关于这些系统捕获句子信息的结果的含义。我们将这些实验所使用的数据集以及生成系统公开提供给公众。1

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号