Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets

机译：随着任意数量的目标3D场景中注意估算的视觉焦点

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual Focus of Attention (VFOA) estimation in conversation is challenging as it relies on difficult to estimate information (gaze) combined with scene features like target positions and other contextual information (speaking status) allowing to disambiguate situations. Previous VFOA models fusing all these features are usually trained for a specific setup and using a fixed number of interacting people, and should be retrained to be applied to another one, which limits their usability. To address these limitations, we propose a novel deep learning method that encodes all input features as a fixed number of 2D maps, which makes the input more naturally processed by a convolutional neural network, provides scene normalization, and allows to consider an arbitrary number of targets. Experiments performed on two publicly available datasets demonstrate that the proposed method can be trained in a cross-dataset fashion without loss in VFOA accuracy compared to intra-dataset training.

机译：关注的视觉焦点（VFOA）对话中的估计是具有挑战性的，因为它依赖于难以估计的信息（Gaze）与场景特征相结合，如目标位置和其他上下文信息（口语状态），允许消除情况。以前的VFOA模型融合所有这些功能通常是针对特定设置和使用固定数量的交互人员培训，并且应该被撤退以应用于另一个，这限制了其可用性。为了解决这些限制，我们提出了一种新颖的深度学习方法，可以将所有输入特征编码为固定数量的2D地图，这使得卷积神经网络更自然地处理的输入提供了场景归一化，并允许考虑任意数量目标。在两个公共数据集上进行的实验表明，与数据集训练相比，所提出的方法可以以跨数据集时尚训练，而不会损失VFOA精度。

著录项

来源
《IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops》|2021年|3147-3155|共9页
会议地点
作者
Rémy Siegfried; Jean-Marc Odobez;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Training; Visualization; Three-dimensional displays; TV; Estimation; Usability;

机译：深入学习;培训;可视化;三维显示器;电视;估计;可用性;

相似文献

外文文献
中文文献
专利

1. Contour-based focus of attention mechanism to speed up object detection and labeling in 3D scenes [J] . Rafael Arnay, Leopoldo Acosta Image and Vision Computing . 2014,第5期

机译：基于轮廓的关注机制，可加快3D场景中物体的检测和标记
2. Mapping the distribution of focused visual attention in real 3D space: Potential implications for interface design [J] . Gerhard Rinkenauer, Marc Grosjean Zeitschrift Fur Arbeitswissenschaft . 2008,第3期

机译：在真实的3D空间中映射集中的视觉注意力的分布：对界面设计的潜在影响
3. Visually-guided attention enhances target identification in a complex auditory scene. [J] . Best V, Ozmeral EJ, Shinn-Cunningham BG Journal of the Association for Research in Otolaryngology: JARO . 2007,第2期

机译：视觉引导的注意力可增强复杂听觉场景中的目标识别。
4. 3D User-perspective, Voxel-based Estimation of Visual Focus of Attention in Dynamic Meeting Scenarios [C] . Michael Voit, Rainer Stiefelhagen International conference on multimodal interfaces and workshop on machine learning for multimodal interaction 2010 . 2010

机译：动态会议场景中基于3D用户视角，基于体素的视觉注意力关注度估计
5. The effect of preferred visual aesthetic on focused attention, use intention, and persistence in an instructional simulation [D] . Robison, Don Grady 2015

机译：优选的视觉美学对教学模拟中的重点注意，使用意图和持久性的影响
6. Visually-guided Attention Enhances Target Identification in a Complex Auditory Scene [O] . Virginia Best, Erol J. Ozmeral, Barbara G. Shinn-Cunningham 2007

机译：视觉引导的注意力增强了复杂听觉场景中的目标识别
7. Dynamic Intelligent Lighting for Directing Visual Attention in Interactive 3D Scenes [O] . Seif El-Nasr Magy, Vasilakos Thanos, Rao Chinmay, 2009

机译：动态智能照明，用于指导交互式3D场景中的视觉注意力
8. Estimation des Deplacements a Partir de Deux Scenes 3D Reconstruites Par UNSysteme Stereoscopique (Estimation of Displacements from Two 3D Frames Obtained from Stereo) [R] . Zhang, Z., Faugeras, O. D. 1991

机译：Estimation des Deplacements partir de Deux scenes 3D Reconstruites par UNsysteme stereoscopique（从立体声中获取的两个3D帧的位移估计）

Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets

摘要

著录项

相似文献

相关主题

期刊订阅