LiveDeep: Online Viewport Prediction for Live Virtual Reality Streaming Using Lifelong Deep Learning

机译：LiveDeep：使用终身深度学习进行实时虚拟现实流传输的在线视口预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Live virtual reality (VR) streaming has become a popular and trending video application in the consumer market providing users with 360-degree, immersive viewing experiences. To provide premium quality of experience, VR streaming faces unique challenges due to the significantly increased bandwidth consumption. To address the bandwidth challenge, VR video viewport prediction has been proposed as a viable solution, which predicts and streams only the user’s viewport of interest with high quality to the VR device. However, most of the existing viewport prediction approaches target only the video-on-demand (VOD) use cases, requiring offline processing of the historical video and/or user data that are not available in the live streaming scenario. In this work, we develop a novel viewport prediction approach for live VR streaming, which only requires video content and user data in the current viewing session. To address the challenges of insufficient training data and real-time processing, we propose a live VR-specific deep learning mechanism, namely LiveDeep, to create the online viewport prediction model and conduct real-time inference. LiveDeep employs a hybrid approach to address the unique challenges in live VR streaming, involving (1) an alternate online data collection, labeling, training, and inference schedule with controlled feedback loop to accommodate for the sparse training data; and (2) a mixture of hybrid neural network models to accommodate for the inaccuracy caused by a single model. We evaluate LiveDeep using 48 users and 14 VR videos of various types obtained from a public VR user head movement dataset. The results indicate around 90% prediction accuracy, around 40% bandwidth savings, and premium processing time, which meets the bandwidth and real-time requirements of live VR streaming.

机译：实时虚拟现实（VR）流已成为消费市场中流行的趋势视频应用程序，可为用户提供360度沉浸式观看体验。为了提供优质的体验，VR流由于带宽消耗的显着增加而面临着独特的挑战。为了解决带宽挑战，已经提出了VR视频视口预测作为可行的解决方案，该方法可以仅将用户感兴趣的视口高质量地预测并流式传输到VR设备。但是，大多数现有的视口预测方法仅针对视频点播（VOD）用例，要求离线处理在实时流传输场景中不可用的历史视频和/或用户数据。在这项工作中，我们为实时VR流开发了一种新颖的视口预测方法，该方法仅在当前观看会话中需要视频内容和用户数据。为了解决训练数据不足和实时处理的挑战，我们提出了一种针对VR的实时深度学习机制，即LiveDeep，以创建在线视口预测模型并进行实时推理。 LiveDeep采用一种混合方法来解决实时VR流中的独特挑战，其中包括：（1）备用在线数据收集，标记，训练和推理进度表，并具有受控的反馈回路，以适应稀疏的训练数据; （2）混合神经网络模型的混合，以适应由单个模型引起的不准确性。我们使用48个用户和14个从公共VR用户头部运动数据集中获得的各种类型的VR视频评估LiveDeep。结果表明，大约90％的预测精度，大约40％的带宽节省以及超长的处理时间，可以满足实时VR流的带宽和实时性要求。

著录项

来源
《IEEE Conference on Virtual Reality and 3D User Interfaces》|2020年|800-808|共9页
会议地点
作者
Xianglong Feng; Yao Liu; Sheng Wei;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Streaming media; Predictive models; Bandwidth; Real-time systems; Data models; Machine learning; Virtual reality;

机译：流媒体;预测模型;带宽;实时系统;数据模型;机器学习;虚拟现实;

相似文献

外文文献
中文文献
专利

1. Social-viewport adaptive caching scheme with clustering for virtual reality streaming in an edge computing platform [J] . Yousung Yang, Joohyung Lee, Nakyoung Kim, Future generation computer systems . 2020,第Jula期

机译：边缘计算平台中用于虚拟现实流的带聚类的社交视口自适应缓存方案
2. Deep Learning for Content-Based Personalized Viewport Prediction of 360-Degree VR Videos [J] . Xinwei Chen, Ali Taleb Zadeh Kasgari, Walid Saad IEEE Networking Letters . 2020,第2期

机译：基于内容的个性化视口预测360度VR视频的深度学习
3. Deep stacked stochastic configuration networks for lifelong learning of non-stationary data streams [J] . Pratama Mahardhika, Wang Dianhui Information Sciences: An International Journal . 2019,第期

机译：非静止数据流终身学习的深层堆叠随机配置网络
4. Head-Orientation-Prediction Based on Deep Learning on sEMG for Low-Latency Virtual Reality Application [C] . Tommy Sugiarto, Chun-Lung Hsu, Chi-Tien Sun, IEEE International Conference on Robotic Computing . 2020

机译：基于深度学习对低延迟虚拟现实应用的深度学习的头向预测
5. Play at work, possibly save lives: Online basic LMS instruction vs. Virtual Reality simulation [D] . Commini, Michael F. 2016

机译：在工作中娱乐，可能挽救生命：在线基本LMS指导与虚拟现实仿真
6. Home to Hospital Live Streaming With Virtual Reality Goggles: A Qualitative Study Exploring the Experiences of Hospitalized Children [O] . Aafke Bakker, Lindy Janssen, Cees Noordam 2018

机译：带虚拟护目镜的医院直播流媒体之家：一项定性研究探索住院儿童的经历
7. Viewport-aware adaptive 360{\deg} video streaming using tiles for virtual reality [O] . Ozcinar, Cagri, De Abreu, Ana, Smolic, Aljosa 2017

机译：使用视图的视口感知自适应360 {\ deg}视频流虚拟现实

LiveDeep: Online Viewport Prediction for Live Virtual Reality Streaming Using Lifelong Deep Learning

摘要

著录项

相似文献

相关主题

期刊订阅