An iterative representation learning framework to predict the sequence of eye fixations

机译：迭代表示学习框架，用于预测眼球注视的顺序

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual attention is a dynamic search process of acquiring information. However, most previous studies have focused on the prediction of static attended locations. Without considering the temporal relationship of fixations, these models usually cannot explain the dynamic saccadic behavior well. In this paper, an iterative representation learning framework is proposed to predict the saccadic scanpath. Within the proposed framework, saccade can be explained as an iterative process of finding the most uncertain area and updating the representation of scenes. In implementation, a deep autoencoder is employed for representation learning. The current fixation is predicted to be the most salient pixel, with saliency estimated by the reconstruction residual of the deep network. Image patches around this fixation are then sampled to update the network for the selection of subsequent fixations. Compared with existing models, the proposed model shows the state-of-the-art performance on several public data sets.

机译：可视注意是获取信息的动态搜索过程。然而，最先前的研究专注于预测静态出席的位置。在不考虑固定的时间关系，这些模型通常无法解决动态扫视行为。在本文中，提出了一种迭代表示学习框架来预测扫视扫描路径。在所提出的框架内，扫视可以被解释为找到最不确定的区域和更新场景的代表的迭代过程。在实现中，使用深度自动频率用于表示学习。预测当前固定是最突出的像素，具有由深网络的重建剩余估计的显着性。然后，对此固定周围的图像修补程序进行采样以更新网络以选择后续固定。与现有型号相比，所提出的模型显示了几种公共数据集上的最先进的性能。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2017年|1530-1535|共6页
会议地点
作者
Chen Xia; Fei Qi; Guangming Shi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image reconstruction; Visualization; Training; Computational modeling; Predictive models; Estimation; Data models;

机译：图像重建;可视化;训练;计算建模;预测模型;估计;数据模型;

相似文献

外文文献
中文文献
专利

1. Learning to Predict Sequences of Human Visual Fixations [J] . Ming Jiang, Xavier Boix, Gemma Roig, Neural Networks and Learning Systems, IEEE Transactions on . 2016,第6期

机译：学习预测人类视觉固定的顺序
2. Predicting Eye Fixations on Webpage With an Ensemble of Early Features and High-Level Representations from Deep Network [J] . Shen Chengyao, Huang Xun, Zhao Qi Multimedia, IEEE Transactions on . 2015,第11期

机译：结合深度网络的早期功能和高级表示，预测网页上的眼睛注视
3. Learning distributed representations of RNA and protein sequences and its application for predicting lncRNA-protein interactions [J] . Hai-Cheng Yi, Zhu-Hong You, Li Cheng, Computational and Structural Biotechnology Journal . 2020,第1期

机译：学习RNA和蛋白质序列的分布式表示及其预测LNCRNA - 蛋白质相互作用的应用
4. AN ITERATIVE REPRESENTATION LEARNING FRAMEWORK TO PREDICT THE SEQUENCE OF EYE FIXATIONS [C] . Chen Xia, Fei Qi, Guangming Shi IEEE International Conference on Multimedia and Expo . 2017

机译：一种迭代表示学习框架，用于预测眼睛固定序列
5. Predicting initial fixations of the eye: Investigating contrast-based image features [D] . Rymer, Nicholas 2008

机译：预测眼睛的最初注视：研究基于对比度的图像特征
6. Learning distributed representations of RNA and protein sequences and its application for predicting lncRNA-protein interactions [O] . Hai-Cheng Yi, Zhu-Hong You, Li Cheng, 2020

机译：学习RNA和蛋白质序列的分布式表示及其在预测lncRNA-蛋白质相互作用中的应用
7. Learning Model Predictive Control for iterative tasks. A Data-Driven Control Framework [O] . Rosolia, Ugo, Borrelli, Francesco 2017

机译：迭代任务的学习模型预测控制。数据驱动控制框架
8. Eye Fixations of Aircraft Pilots. III. Frequency, Duration, and Sequence Fixations When Flying Air Force Ground-Controlled Approach System (GCA). [R] . Fitts, P. M., Jones, R. E., Milton, J. L. 1949

机译：飞行员的眼动研究。 III。频率，持续时间和顺序录制品当飞空军地面控制进场系统（GCa）。

An iterative representation learning framework to predict the sequence of eye fixations

摘要

著录项

相似文献

相关主题

期刊订阅