CapVis: Toward Better Understanding of Visual-Verbal Saliency Consistency

Liang Haoran; Jiang Ming; Liang Ronghua; Zhao Qi

首页> 外文期刊>ACM transactions on intelligent systems >CapVis: Toward Better Understanding of Visual-Verbal Saliency Consistency

【24h】

CapVis: Toward Better Understanding of Visual-Verbal Saliency Consistency

机译：CapVis：更好地理解视觉语言显着性一致性

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

When looking at an image, humans shift their attention toward interesting regions, making sequences of eye fixations. When describing an image, they also come up with simple sentences that highlight the key elements in the scene. What is the correlation between where people look and what they describe in an image? To investigate this problem intuitively, we develop a visual analytics system, CapVis, to look into visual attention and image captioning, two types of subjective annotations that are relatively task-free and natural. Using these annotations, we propose a word-weighting scheme to extract visual and verbal saliency ranks to compare against each other. In our approach, a number of low-level and semantic-level features relevant to visual-verbal saliency consistency are proposed and visualized for a better understanding of image content. Our method also shows the different ways that a human and a computational model look at and describe images, which provides reliable information for a captioning model. Experiment also shows that the visualized feature can be integrated into a computational model to effectively predict the consistency between the two modalities on an image dataset with both types of annotations.

机译：当观看图像时，人类将注意力转移到有趣的区域，从而进行眼睛注视。在描述图像时，他们还会想出简单的句子来突出显示场景中的关键元素。人们看起来和他们在图像中所描绘的东西之间有什么关联？为了直观地调查此问题，我们开发了视觉分析系统CapVis，以研究视觉注意力和图像字幕，这两种类型的主观注释相对来说无需任务，而且很自然。使用这些注释，我们提出了一种词加权方案，以提取视觉和言语显着性等级以相互比较。在我们的方法中，提出了许多与视觉语言显着一致性相关的低层和语义层特征，并对其进行了可视化，以更好地理解图像内容。我们的方法还显示了人类和计算模型查看和描述图像的不同方式，这为字幕模型提供了可靠的信息。实验还表明，可视化特征可以集成到计算模型中，以有效预测具有两种类型注释的图像数据集上两种模态之间的一致性。

著录项

来源
《ACM transactions on intelligent systems》 |2019年第1期|10.1-10.23|共23页
作者
Liang Haoran; Jiang Ming; Liang Ronghua; Zhao Qi;
展开▼
作者单位

Zhejiang Univ Technol, Dept Informat Engn, 288 Liuhe Rd, Hangzhou 310013, Zhejiang, Peoples R China;

Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA;

Zhejiang Univ Technol, Dept Informat Engn, 288 Liuhe Rd, Hangzhou 310013, Zhejiang, Peoples R China;

Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Image captioning; visual saliency; visual analytics;

机译：图像字幕;视觉显着性;视觉分析;
入库时间 2022-08-18 04:16:06

相似文献

外文文献
中文文献
专利

1. Beyond saliency: Understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation [J] . Li Heyi, Tian Yunke, Mueller Klaus, Image and Vision Computing . 2019,第MARaaAPRa期

机译：超越显着性：从分层关联传播的显着性预测中了解卷积神经网络
2. Spatiotemporal Saliency Detection Based on Maximum Consistency Superpixels Merging for Video Analysis [J] . Zhang Jianhua, Chen Jingbo, Wang Qichao, IEEE transactions on industrial informatics . 2020,第1期

机译：基于最大一致性超顶限制的空间效力检测
3. Image fusion with structural saliency measure and content adaptive consistency verification [J] . Yang Bin, Sun Yuhan, Li Yuehua Journal of electronic imaging . 2020,第1期

机译：图像融合具有结构显着性度量和内容自适应一致性验证
4. Visual-verbal consistency of image saliency [C] . Haoran Liang, Ming Jiang, Ronghua Liang, International Conference on Systems, Man, and Cybernetics . 2017

机译：图像显着性的视觉-语言一致性
5. Exploring the Overlap, Saliency, and Consistency of Environmental Predictors in Crime Hot Spots: A Remote Systematic Social Observation and Case-Control Examination [D] . Connealy, Nathan T. 2021

机译：探索犯罪热点环境预测因子的重叠，显着性和一致性：远程系统的社会观察和案例控制检查
6. Towards an understanding of salient neighborhood boundaries: adolescent reports of an easy walking distance and convenient driving distance [O] . Natalie Colabianchi, Marsha Dowda, Karin A Pfeiffer, 2007

机译：理解主要邻域边界：青少年报告步行距离和驾车距离都很方便
7. Beyond saliency: Understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation [O] . Heyi Li, Yunke Tian, Klaus Mueller, 2019

机译：超越显着性：了解卷积神经网络从显着性预测到层面相关性传播

CapVis: Toward Better Understanding of Visual-Verbal Saliency Consistency

摘要

著录项

相似文献

相关主题

期刊订阅