...
首页> 外文期刊>ACM transactions on multimedia computing communications and applications >Learning a Deep Agent to Predict Head Movement in 360-Degree Images
【24h】

Learning a Deep Agent to Predict Head Movement in 360-Degree Images

机译:学习深层代理以预测360度图像中的头部运动

获取原文
获取原文并翻译 | 示例
           

摘要

Virtual reality adequately stimulates senses to trick users into accepting the virtual environment. To create a sense of immersion, high-resolution images are required to satisfy human visual system, and low latency is essential for smooth operations, which put great demands on data processing and transmission. Actually, when exploring in the virtual environment, viewers only perceive the content in the current field of view. Therefore, if we can predict the head movements that are important behaviors of viewers, more processing resources can be allocated to the active field of view. In this article, we propose a model to predict the trajectory of head movement. Deep reinforcement learning is employed to mimic the decision making. In our framework, to characterize each state, features for viewport images are extracted by convolutional neural networks. In addition, the spherical coordinate maps and visited maps are generated for each viewport image, which facilitate the multiple dimensions of the state information by considering the impact of historical head movement and position information. To ensure the accurate simulation of visual behaviors during the watching of panoramas, we stipulate that the model imitates the behaviors of human demonstrators. To allow the model to generalize to more conditions, the intrinsic motivation is employed to guide the agent's action toward reducing uncertainty, which can enhance robustness during the exploration. The experimental results demonstrate the effectiveness of the proposed stepwise head movement predictor.
机译:虚拟现实充分刺激感官,以欺骗用户接受虚拟环境。为了创建沉浸感,需要高分辨率图像来满足人类视觉系统,并且低延迟对于平滑操作至关重要,这对数据处理和传输提供了极大的要求。实际上,在虚拟环境中探索时,观众只会在当前视野中感知内容。因此,如果我们可以预测观看者的重要行为的头部运动,则可以将更多的处理资源分配给活动视野。在本文中,我们提出了一种模型来预测头部运动的轨迹。使用深度加强学习来模仿决策。在我们的框架中,为了表征每个状态,通过卷积神经网络提取视口图像的功能。另外,对于每个视口图像,为每个视口图像产生球形坐标映射和访问的映射,其通过考虑历史头移动和位置信息的影响而促进了状态信息的多个维度。为了确保在墙上观看拍摄期间的视觉行为准确模拟,我们规定了模型模仿人类示威者的行为。为了允许模型概括到更多条件,所用内在动机来指导代理人对减少不确定性的行动,这可以在勘探期间提高鲁棒性。实验结果表明了所提出的逐步头移动预测器的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号