A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360° Video

机译：360°视频的时空突出检测的深度排名模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We address the problem of highlight detection from a 360° video by summarizing it both spatially and temporally. Given a long 360° video, we spatially select pleasantly-looking normal field-of-view (NFOV) segments from unlimited field of views (FOV) of the 360° video, and temporally summarize it into a concise and informative highlight as a selected subset of subshots. We propose a novel deep ranking model named as Composition View Score (CVS) model, which produces a spherical score map of composition per video segment, and determines which view is suitable for highlight via a sliding window kernel at inference. To evaluate the proposed framework, we perform experiments on the Pano2Vid benchmark dataset (Su, Jayaraman, and Grauman 2016) and our newly collected 360° video highlight dataset from YouTube and Vimeo. Through evaluation using both quantitative summarization metrics and user studies via Amazon Mechanical Turk, we demonstrate that our approach outperforms several state-of-the-art highlight detection methods. We also show that our model is 16 times faster at inference than AutoCam (Su, Jayaraman, and Grauman 2016), which is one of the first summarization algorithms of 360° videos.

机译：通过在空间和时间汇总，我们通过总结它来解决360°Video的突出显示检测问题。给定长360°视频，我们在360°视频的无限视野（FOV）中，在空间上选择令人愉快的正常视野（NFOV）段，并将其逐步总结为简明和信息的突出显示，如选定的子集。我们提出了一种名为Compinition View评分（CVS）模型的新颖的深度排名模型，其产生每个视频段的球面评分图，并确定哪个视图适用于在推理时通过滑动窗口内核突出显示。为了评估拟议的框架，我们在Pano2VID基准数据集（SU，Jayaraman和Grauman 2016）上执行实验，我们新收集的360°视频高亮来自YouTube和Vimeo的视频集。通过使用亚马逊机械土耳麦使用定量摘要指标和用户研究的评估，我们证明了我们的方法优于几种最先进的突出检测方法。我们还表明，我们的模型推断比AutoCam（Su，Jayaraman和Grauman 2016）更快16倍，这是360°视频的第一个摘要算法之一。

著录项

来源
《AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence》|2018年|6665-7655p|共9页
会议地点
作者
Youngjae Yu; Sangho Lee; Joonil Na; Jaeyun Kang; Gunhee Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Video Summarization Using Highlight Detection and Pairwise Deep Ranking Model [J] . M. Sridevi, Mayuri Kharde Procedia Computer Science . 2020,第5期

机译：使用突出检测和成对深度排名模型的视频摘要
2. Video Highlight Detection via Region-Based Deep Ranking Model [J] . Jiao Yifan, Zhang Tianzhu, Huang Shucheng, International Journal of Pattern Recognition and Artificial Intelligence . 2019,第7期

机译：通过基于区域的深度排名模型进行视频精彩片段检测
3. Video Highlight Detection via Region-Based Deep Ranking Model [J] . Jiao Yifan, Zhang Tianzhu, Huang Shucheng, International Journal of Pattern Recognition and Artificial Intelligence . 2019,第7期

机译：通过基于区域的深度排名模型进行视频突出显示
4. A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360° Video [C] . Youngjae Yu, Sangho Lee, Joonil Na, AAAI Conference on Artificial Intelligence;Innovative Applications of Artificial Intelligence Conference;Symposium on Educational Advances in Artificial Intelligence . 2018

机译：360°视频的时空突出检测的深度排名模型
5. Deep Learning for Object Detection and Tracking and for Field of View Prediction in 360-Degree Videos [D] . Li, Chenge 2019

机译：用于360度视频中的对象检测和跟踪以及视场预测的深度学习
6. Spatio-Temporal Attention Model for Foreground Detection in Cross-Scene Surveillance Videos [O] . Dong Liang, Jiaxing Pan, Han Sun, 2019

机译：跨场景监控视频中前景检测的时空注意模型
7. MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection [O] . Fa-Ting Hong, Xuanteng Huang, Wei-Hong Li, 2020

机译：MINI-NET：多实例排名网络用于视频亮点检测

A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360° Video

摘要

著录项

相似文献

相关主题

期刊订阅