首页> 外文会议>Conference on empirical methods in natural language processing >Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models
【24h】

Unsupervised Token-wise Alignment to Improve Interpretation of Encoder-Decoder Models

机译:无监督的令牌 - 明智的对齐,以改善编码器 - 解码器模型的解释

获取原文

摘要

Developing a method for understanding the inner workings of black-box neural methods is an important research endeavor. Conventionally, many studies have used an attention matrix to interpret how Encoder-Decoder-based models translate a given source sentence to the corresponding target sentence. However, recent studies have empirically revealed that an attention matrix is not optimal for token-wise translation analyses. We propose a method that explicitly models the token-wise alignment between the source and target sequences to provide a better analysis. Experiments show that our method can acquire token-wise alignments that are superior to those of an attention mechanism.
机译:制定一种了解黑匣子神经方法的内部工作的方法是一个重要的研究努力。传统上,许多研究使用了注意力矩阵来解释编码器 - 解码器的模型如何将给定的源句转换为相应的目标句子。然而,最近的研究已经经验揭示了注意力矩阵对令牌的翻译分析不适。我们提出了一种方法,该方法明确地模拟了源和目标序列之间的令牌方向对准,以提供更好的分析。实验表明,我们的方法可以获得优于注意机制的令牌的对准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号