Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization

机译：具有成对深度排名的高亮检测，用于第一人称视频摘要

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The emergence of wearable devices such as portable cameras and smart glasses makes it possible to record life logging first-person videos. Browsing such long unstructured videos is time-consuming and tedious. This paper studies the discovery of moments of user's major or special interest (i.e., highlights) in a video, for generating the summarization of first-person videos. Specifically, we propose a novel pairwise deep ranking model that employs deep learning techniques to learn the relationship between high-light and non-highlight video segments. A two-stream network structure by representing video segments from complementary information on appearance of video frames and temporal dynamics across frames is developed for video highlight detection. Given a long personal video, equipped with the highlight detection model, a highlight score is assigned to each segment. The obtained highlight segments are applied for summarization in two ways: video time-lapse and video skimming. The former plays the highlight (non-highlight) segments at low (high) speed rates, while the latter assembles the sequence of segments with the highest scores. On 100 hours of first-person videos for 15 unique sports categories, our highlight detection achieves the improvement over the state-of-the-art RankSVM method by 10.5% in terms of accuracy. Moreover, our approaches produce video summary with better quality by a user study from 35 human subjects.

机译：诸如便携式相机和智能眼镜之类的可穿戴设备的出现使录制生活记录的第一人称视频成为可能。浏览这么长的非结构化视频既费时又乏味。本文研究了视频中用户最感兴趣或最感兴趣的时刻（即精彩瞬间）的发现，以生成第一人称视频的摘要。具体来说，我们提出了一种新颖的成对深度排名模型，该模型采用深度学习技术来学习高光和非高光视频片段之间的关系。通过从视频帧的外观和跨帧的时间动态方面的补充信息表示视频片段，开发出一种两流网络结构，用于视频高亮检测。给定配备了高光检测模型的较长的个人视频，则将高光得分分配给每个片段。所获得的精彩片段以两种方式应用于摘要：视频延时和视频剪辑。前者以低（高）速度播放亮点（非高亮）片段，而后者则组合得分最高的片段序列。在15个独特体育项目的100小时第一人称视频中，我们的重点检测功能使最新的RankSVM方法的准确性提高了10.5％。此外，我们的方法通过对35个人类受试者的用户研究产生了质量更高的视频摘要。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2016年|982-990|共9页
会议地点
作者
Ting Yao; Tao Mei; Yong Rui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Streaming media; Visualization; Image segmentation; Computer architecture; Three-dimensional displays; Cameras; Neural networks;

机译：流媒体可视化图像分割计算机体系结构三维显示器相机神经网络;

相似文献

外文文献
中文文献
专利

1. Video Summarization Using Highlight Detection and Pairwise Deep Ranking Model [J] . M. Sridevi, Mayuri Kharde Procedia Computer Science . 2020,第5期

机译：使用突出检测和成对深度排名模型的视频摘要
2. Video Highlight Detection via Region-Based Deep Ranking Model [J] . Jiao Yifan, Zhang Tianzhu, Huang Shucheng, International Journal of Pattern Recognition and Artificial Intelligence . 2019,第7期

机译：通过基于区域的深度排名模型进行视频精彩片段检测
3. Video Highlight Detection via Region-Based Deep Ranking Model [J] . Jiao Yifan, Zhang Tianzhu, Huang Shucheng, International Journal of Pattern Recognition and Artificial Intelligence . 2019,第7期

机译：通过基于区域的深度排名模型进行视频突出显示
4. Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization [C] . Ting Yao, Tao Mei, Yong Rui IEEE Conference on Computer Vision and Pattern Recognition . 2016

机译：突出显示检测与第一人称视频摘要的成对深度排名
5. Lecture Video Summarization by Detection and Representation of Content [D] . ?Urala Kota, Bhargava 2020

机译：通过检测和表示内容的讲课视频摘要
6. Ranking of Dystonia Severity by Pairwise Video Comparison [O] . Tobias Ellermeyer, Karen Otte, Felix Heinrich, 2016

机译：成对视频比较对肌张力障碍严重程度的排名
7. Video Highlights Detection and Summarization with Lag-Calibration based on Concept-Emotion Mapping of Crowd-sourced Time-Sync Comments [O] . Ping, Qing, Chen, Chaomei 2017

机译：视频亮点基于滞后校准的检测和摘要群众时间同步评论的概念 - 情感映射研究

Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization

摘要

著录项

相似文献

相关主题

期刊订阅