首页> 外文会议>International conference on digital image processing >Explore Fine-grained Discriminative Visual Explanation When Making Classification Decision
【24h】

Explore Fine-grained Discriminative Visual Explanation When Making Classification Decision

机译:在做出分类决策时探索细粒度的辨别视觉解释

获取原文

摘要

Language and image are two most important media for describing surrounding world. Fine-grained visual explanations are helpful for people to understand the reasons or motivation of vision system when it makes classification decision. Base on the pioneer work of Lisa, this paper proposes a new model for discriminative visual explanation generation. It extracts res5c image features from deep residual network and uses multimodal compact bilinear strategy for multimodal information fusion. Selective attention mechanism is introduced to focus on visual parts that are most related to the predicted category information. The proposed network both considers spatial distribution of image content and fusion strategy that better model different modal information. The result on CUB Bird Dataset shows that our model can improve the quality of the explanation statement, which indicates that our proposed network is effective.
机译:语言和图像是描述周围世界的两种最重要的媒体。细粒度的视觉解释有助于人们理解视觉系统做出分类决策的原因或动机。基于Lisa的开创性工作,本文提出了一种新的判别式视觉解释生成模型。它从深度残差网络中提取res5c图像特征,并使用多峰紧凑双线性策略进行多峰信息融合。引入选择性注意机制以关注与预测类别信息最相关的视觉部分。拟议的网络都考虑了图像内容的空间分布和融合策略,可以更好地对不同的模态信息进行建模。 CUB Bird数据集上的结果表明,我们的模型可以提高解释性陈述的质量,这表明我们提出的网络是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号