首页> 外文期刊>Neurocomputing >Deep gaze pooling: Inferring and visually decoding search intents from human gaze fixations
【24h】

Deep gaze pooling: Inferring and visually decoding search intents from human gaze fixations

机译:深度凝视池:从人类凝视注视推断和视觉解码搜索意图

获取原文
获取原文并翻译 | 示例
           

摘要

Predicting the target of visual search from human eye fixations (gaze) is a difficult problem with many applications, e.g. in human-computer interaction. While previous work has focused on predicting specific search target instances, we propose the first approach to predict categories and attributes of search intents from gaze data and to visually reconstruct plausible targets. However, state-of-the-art models for categorical recognition, in general, require large amounts of training data, which is prohibitive for gaze data. To address this challenge, we further propose a novel Gaze Pooling Layer that combines gaze information with visual representations from Deep Learning approaches. Our scheme incorporates both spatial and temporal aspects of human gaze behavior as well as the appearance of the fixated locations. We propose an experimental setup and novel dataset and demonstrate the effectiveness of our method for gaze-based search target prediction and reconstruction. We highlight several practical advantages of our approach, such as compatibility with existing architectures, no need for gaze training data, and robustness to noise from common gaze sources. (C) 2020 Elsevier B.V. All rights reserved.
机译:从人眼注视(注视)预测视觉搜索的目标是许多应用中的难题。在人机交互中。虽然先前的工作集中于预测特定的搜索目标实例,但我们提出了第一种方法,可以根据凝视数据预测搜索意图的类别和属性,并从视觉上重建合理的目标。但是,用于分类识别的最新模型通常需要大量的训练数据,这对于凝视数据是不允许的。为了应对这一挑战,我们进一步提出了一种新颖的凝视池层,该层将凝视信息与深度学习方法的视觉表示相结合。我们的方案结合了人类凝视行为的空间和时间方面以及固定位置的外观。我们提出了一个实验装置和新颖的数据集,并证明了我们的方法对基于凝视的搜索目标预测和重构的有效性。我们着重介绍了我们方法的一些实际优势,例如与现有架构的兼容性,无需注视训练数据以及对普通注视源产生的噪声的鲁棒性。 (C)2020 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2020年第28期|369-382|共14页
  • 作者

  • 作者单位

    Max Planck Inst Informat Saarland Informat Campus Campus E1 4 D-66123 Saarbrucken Germany;

    CISPA Helmholtz Ctr Informat Secur Saarland Informat Campus Stuhlsatzenhaus 5 D-66123 Saarbrucken Germany;

    Univ Stuttgart Inst Visualisat & Interact Syst Pfaffenwaldring 5a D-70569 Stuttgart Germany;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Gaze pooling; Visual search; Deep learning; Mental image; Visual search target prediction; Visual search target reconstruction;

    机译:注视汇集;视觉搜索;深度学习;心理形象;视觉搜索目标预测;视觉搜索目标重建;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号