DeepVS2.0: A Saliency-Structured Deep Learning Method for Predicting Dynamic Visual Attention

Jiang Lai; Xu Mai; Wang Zulin; Sigal Leonid

首页> 外文期刊>International Journal of Computer Vision >DeepVS2.0: A Saliency-Structured Deep Learning Method for Predicting Dynamic Visual Attention

【24h】

DeepVS2.0: A Saliency-Structured Deep Learning Method for Predicting Dynamic Visual Attention

机译：DeepVS2.0：一种用于预测动态视觉注意力的显着性深度学习方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks (DNNs) have exhibited great success in image saliency prediction. However, few works apply DNNs to predict the saliency of generic videos. In this paper, we propose a novel DNN-based video saliency prediction method, called DeepVS2.0. Specifically, we establish a large-scale eye-tracking database of videos (LEDOV), which provides sufficient data to train the DNN models for predicting video saliency. Through the statistical analysis of LEDOV, we find that human attention is normally attracted by objects, particularly moving objects or the moving parts of objects. Accordingly, we propose an object-to-motion convolutional neural network (OM-CNN) in DeepVS2.0 to learn spatio-temporal features for predicting the intra-frame saliency via exploring the information of both objectness and object motion. We further find from our database that human attention has a temporal correlation with a smooth saliency transition across video frames. Therefore, a saliency-structured convolutional long short-term memory network (SS-ConvLSTM) is developed in DeepVS2.0 to predict inter-frame saliency, using the extracted features of OM-CNN as the input. Moreover, the center-bias dropout and sparsity-weighted loss are embedded in SS-ConvLSTM, to consider the center-bias and sparsity of human attention maps. Finally, the experimental results show that our DeepVS2.0 method advances the state-of-the-art video saliency prediction.

机译：None

著录项

来源
《International Journal of Computer Vision》 |2021年第1期|共22页
作者
Jiang Lai; Xu Mai; Wang Zulin; Sigal Leonid;
展开▼
作者单位

Beihang Univ Sch Elect &

Informat Engn Beijing Peoples R China;

Beihang Univ Sch Elect &

Informat Engn Beijing Peoples R China;

Beihang Univ Sch Elect &

Informat Engn Beijing Peoples R China;

Univ British Columbia Dept Comp Sci Vancouver BC Canada;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Deep neural networks; Saliency prediction; Convolutional LSTM; Eye-tracking database; Video; Video database;

机译：深神经网络;显着性预测;卷积LSTM;眼跟踪数据库;视频;视频数据库;

相似文献

外文文献
中文文献
专利

1. Wayfinding design in transportation architecture-are saliency models or designer visual attention a good predictor of passenger visual attention? [J] . Ran Xu, Haishan Xia, Mei Tian 建筑学研究前沿(英文版) . 2020,第004期
2. Wayfinding design in transportation architecture-are saliency models or designer visual attention a good predictor of passenger visual attention? [J] . Ran Xu, Haishan Xia, Mei Tian 中国建筑与土木工程前沿：英文版 . 2020,第004期
3. pLoc_Deep-mPlant: Predict Subcellular Localization of Plant Proteins by Deep Learning [J] . Yu-Tao Shao, Xin-Xin Liu, Zhe Lu, 自然科学期刊（英文） . 2020,第005期
4. Part Recognition Method Based on Visual Selective Attention Mechanism and Deep Learning [J] . Dan Zhou, Nanfeng Xiao Journal of fiber bioengineering and informatics . 2015,第4期

机译：基于视觉选择性注意机制和深度学习的零件识别方法
5. An Adaptive Scale Sea Surface Temperature Predicting Method Based on Deep Learning With Attention Mechanism [J] . Xie Jiang, Zhang Jiyuan, Yu Jie, IEEE Geoscience and Remote Sensing Letters . 2020,第5期

机译：基于深度学习的自适应鳞片表面温度预测方法
6. An integrated deep learning and dynamic programming method for predicting tumor suppressor genes, oncogenes, and fusion from PDB structures [J] . Anandanadarajah N., Chu C. H., Loganantharaj R. Computers in Biology and Medicine . 2021,第1期

机译：一种用于预测肿瘤抑制基因，癌肠化合物和PDB结构融合的集成深层学习和动态规划方法
7. A Deep Learning Method for Automatic Visual Attention Detection in Older Drivers [C] . Belkacem Chikhaoui, Perrine Ruer, Evelyne F. Vallieres International Conference on Smart Homes and Health Telematics . 2019

机译：一种用于旧驾驶员自动视觉注意检测的深度学习方法
8. Deep Learning for Understanding Dynamic Visual Data [D] . Liu, Xingyu. 2019

机译：理解动态视觉数据的深度学习
9. Linguistic Labels Dynamic Visual Features and Attention in Infant Category Learning [O] . Wei (Sophia) Deng, Vladimir M. Sloutsky -1

机译：语言标签动态视觉功能和婴儿类别学习中的注意
10. Improving Automated Visual Fault Detection by Combining a Biologically Plausible Model of Visual Attention with Deep Learning [O] . Frederik Beuth, Tobias Schlosser, Michael Friedrich, 2020

机译：通过将生物合理的视觉关注模型与深度学习相结合，改善自动视觉故障检测
11. Learning To Recognize Visual Concepts: Development and Implementation of a Method for Texture Concept Acquisition Through Inductive Learning [R] . Bala, J. W. 1993

机译：学会识别视觉概念：通过归纳学习获取纹理概念的方法的开发和实现

DeepVS2.0: A Saliency-Structured Deep Learning Method for Predicting Dynamic Visual Attention

摘要

著录项

相似文献

相关主题

期刊订阅