首页> 外文会议>Workshop on Knowledge Extraction and Integration for Deep Learning Architectures >Attention vs non-attention for a Shapley-based explanation method
【24h】

Attention vs non-attention for a Shapley-based explanation method

机译:基于Shapley解释方法的注意与不注意

获取原文

摘要

The field of explainable AI has recently seen an explosion in the number of explanation methods for highly non-linear deep neural networks. The extent to which such methods - that are often proposed and tested in the domain of computer vision - are appropriate to address the explainability challenges in NLP is yet relatively unexplored. In this work, we consider Contextual Decomposition (CD) - a Shapley-based input feature attribution method that has been shown to work well for recurrent NLP models - and we test the extent to which it is useful for models that contain attention operations. To this end, we extend CD to cover the operations necessary for attention-based models. We then compare how long distance subject-verb relationships are processed by models with and without attention, considering a number of different syntactic structures in two different languages: English and Dutch. Our experiments confirm that CD can successfully be applied for attention-based models as well, providing an alternative Shapley-based attribution method for modem neural networks. In particular, using CD, we show that the English and Dutch models demonstrate similar processing behaviour, but that under the hood there are consistent differences between our attention and non-attention models.
机译:在可解释人工智能领域,针对高度非线性深层神经网络的解释方法的数量最近出现了爆炸式增长。这些方法——通常在计算机视觉领域被提出和测试——在多大程度上适合于解决NLP中的可解释性挑战,这一点还相对未知。在这项工作中,我们考虑上下文分解(CD)-基于Shapley的输入特征归因方法,已被证明是很好地用于经常性的NLP模型-我们测试的程度,它是有用的模型包含注意力操作。为此,我们将CD扩展到基于注意力的模型所需的操作。然后,考虑到英语和荷兰语两种不同语言中的许多不同句法结构,我们比较了有注意和无注意的模型处理远距离主谓关系的程度。我们的实验证实,CD也可以成功地应用于基于注意的模型,为现代神经网络提供了另一种基于Shapley的归因方法。特别是,使用CD,我们证明了英语和荷兰语模型表现出相似的加工行为,但在我们的注意模型和非注意模型之间存在一致的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号