...
首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Evaluating the Visualization of What a Deep Neural Network Has Learned
【24h】

Evaluating the Visualization of What a Deep Neural Network Has Learned

机译:评估深度神经网络所学知识的可视化

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Deep neural networks (DNNs) have demonstrated impressive performance in complex machine learning tasks such as image classification or speech recognition. However, due to their multilayer nonlinear structure, they are not transparent, i.e., it is hard to grasp what makes them arrive at a particular classification or recognition decision, given a new unseen data sample. Recently, several approaches have been proposed enabling one to understand and interpret the reasoning embodied in a DNN for a single test image. These methods quantify the “importance” of individual pixels with respect to the classification decision and allow a visualization in terms of a heatmap in pixel/input space. While the usefulness of heatmaps can be judged subjectively by a human, an objective quality measure is missing. In this paper, we present a general methodology based on region perturbation for evaluating ordered collections of pixels such as heatmaps. We compare heatmaps computed by three different methods on the SUN397, ILSVRC2012, and MIT Places data sets. Our main result is that the recently proposed layer-wise relevance propagation algorithm qualitatively and quantitatively provides a better explanation of what made a DNN arrive at a particular classification decision than the sensitivity-based approach or the deconvolution method. We provide theoretical arguments to explain this result and discuss its practical implications. Finally, we investigate the use of heatmaps for unsupervised assessment of the neural network performance.
机译:在复杂的机器学习任务(例如图像分类或语音识别)中,深度神经网络(DNN)表现出出色的性能。但是,由于它们的多层非线性结构,它们不是透明的,即,在给定一个新的看不见的数据样本的情况下,很难掌握使它们做出特定分类或识别决策的原因。最近,已经提出了几种方法,使一种方法能够理解和解释DNN中针对单个测试图像的推理。这些方法量化了各个像素相对于分类决策的“重要性”,并允许根据像素/输入空间中的热图进行可视化。虽然可以由人主观地判断热图的有用性,但缺少客观的质量度量。在本文中,我们提出了一种基于区域扰动的通用方法,用于评估像素(如热图)的有序集合。我们在SUN397,ILSVRC2012和MIT Places数据集上比较了通过三种不同方法计算出的热图。我们的主要结果是,与基于灵敏度的方法或解卷积方法相比,最近提出的分层相关性传播算法在质量和数量上提供了更好的解释,使DNN能够做出特定的分类决策。我们提供理论论据来解释此结果并讨论其实际含义。最后,我们研究了使用热图对神经网络性能进行无监督评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号