Towards Visually Explaining Video Understanding Networks with Perturbation

机译：在视觉上解释扰动的视频理解网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

"Making black box models explainable " is a vital problem that accompanies the development of deep learning networks. For networks taking visual information as input, one basic but challenging explanation method is to identify and visualize the input pixels/regions that dominate the network’s prediction. However, most existing works focus on explaining networks taking a single image as input and do not consider the temporal relationship that exists in videos. Providing an easy-to-use visual explanation method that is applicable to diversified structures of video understanding networks still remains an open challenge. In this paper, we investigate a generic perturbation-based method for visually explaining video understanding networks. Besides, we propose a novel loss function to enhance the method by constraining the smoothness of its results in both spatial and temporal dimensions. The method enables the comparison of explanation results between different network structures to become possible and can also avoid generating the pathological adversarial explanations for video inputs. Experimental comparison results verified the effectiveness of our method.

机译：“制作黑匣子型号可解释”是一个重要的问题，伴随着深入学习网络的发展。对于将视觉信息视为输入的网络，一个基本但具有挑战性的解释方法是识别和可视化主导网络预测的输入像素/区域。然而，大多数现有的作品侧重于解释拍摄单个图像的网络作为输入，并且不考虑视频中存在的时间关系。提供易于使用的视觉解释方法，适用于视频理解网络的多元化结构仍然是一个开放的挑战。在本文中，我们研究了在视觉上解释视频理解网络的基于通用扰动的方法。此外，我们提出了一种新颖的损失功能来增强该方法，通过约束其在空间和时间尺寸的结果的平滑度。该方法使得可以比较不同网络结构之间的解释结果，并且还可以避免为视频输入产生病理对抗解释。实验比较结果验证了我们方法的有效性。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision》|2021年|1119-1128|共10页
会议地点
作者
Zhenqiang Li; Weimin Wang; Zuoyue Li; Yifei Huang; Yoichi Sato;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Knowledge engineering; Deep learning; Visualization; Pathology; Computer vision; Perturbation methods; Conferences;

机译：知识工程;深入学习;可视化;病理学;计算机愿景;扰动方法;会议;

相似文献

外文文献
中文文献
专利

1. RELATCH: relative optimality in metabolic networks explains robust metabolic and regulatory responses to perturbations. [J] . Kim J, Reed JL Genome Biology . 2012,第9期

机译：RELATCH：代谢网络中的相对最佳解释了对扰动的强有力的代谢和调节反应。
2. RELATCH: relative optimality in metabolic networks explains robust metabolic and regulatory responses to perturbations [J] . Joonhoon Kim, Jennifer L Reed Genome Biology . 2012,第9期

机译：RELATCH：代谢网络中的相对最佳解释了对扰动的强有力的代谢和调节反应
3. A TOOL TO EXPLAIN VISUALLY THE ZONES OF INFLUENCE OF SEVERAL DISTRIBUTION CENTERS IN A NETWORK [J] . Carlos Maligo Brazilian Journal of Operations and Production Management . 2016,第3期

机译：可视化解释网络中多个分布中心影响范围的工具
4. Visually Explaining Videos using Recurrent Neural Networks with Gradient-Based Localization [C] . Naoya Yamashita, Naoto Iwahashi, Seiya Nakano, 情報処理学会;情報処理学会全国大会 . 2018

机译：使用递归神经网络和基于梯度的本地化可视地解释视频
5. Video Understanding with Deep Networks [D] . Ng, Joe Yue-Hei. 2018

机译：通过深度网络了解视频
6. RELATCH: relative optimality in metabolic networks explains robust metabolic and regulatory responses to perturbations [O] . Joonhoon Kim, Jennifer L Reed 2012

机译：RELATCH：代谢网络中的相对最佳解释了对扰动的强有力的代谢和调节反应
7. Understanding Videos, Constructing Plots Learning a Visually Grounded Storyline Model from Annotated Videos [O] . Gupta Abhinav, Srinivasan Praveen, Shi Jianbo, 2009

机译：了解视频，构建剧情从带注释的视频中学习基于视觉的故事情节模型

Towards Visually Explaining Video Understanding Networks with Perturbation

摘要

著录项

相似文献

相关主题

期刊订阅