首页> 外文会议>IEEE Conference on Computer Vision and Pattern Recognition >Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization
【24h】

Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization

机译:具有成对深度排名的高亮检测,用于第一人称视频摘要

获取原文

摘要

The emergence of wearable devices such as portable cameras and smart glasses makes it possible to record life logging first-person videos. Browsing such long unstructured videos is time-consuming and tedious. This paper studies the discovery of moments of user's major or special interest (i.e., highlights) in a video, for generating the summarization of first-person videos. Specifically, we propose a novel pairwise deep ranking model that employs deep learning techniques to learn the relationship between high-light and non-highlight video segments. A two-stream network structure by representing video segments from complementary information on appearance of video frames and temporal dynamics across frames is developed for video highlight detection. Given a long personal video, equipped with the highlight detection model, a highlight score is assigned to each segment. The obtained highlight segments are applied for summarization in two ways: video time-lapse and video skimming. The former plays the highlight (non-highlight) segments at low (high) speed rates, while the latter assembles the sequence of segments with the highest scores. On 100 hours of first-person videos for 15 unique sports categories, our highlight detection achieves the improvement over the state-of-the-art RankSVM method by 10.5% in terms of accuracy. Moreover, our approaches produce video summary with better quality by a user study from 35 human subjects.
机译:诸如便携式相机和智能眼镜之类的可穿戴设备的出现使录制生活记录的第一人称视频成为可能。浏览这么长的非结构化视频既费时又乏味。本文研究了视频中用户最感兴趣或最感兴趣的时刻(即精彩瞬间)的发现,以生成第一人称视频的摘要。具体来说,我们提出了一种新颖的成对深度排名模型,该模型采用深度学习技术来学习高光和非高光视频片段之间的关系。通过从视频帧的外观和跨帧的时间动态方面的补充信息表示视频片段,开发出一种两流网络结构,用于视频高亮检测。给定配备了高光检测模型的较长的个人视频,则将高光得分分配给每个片段。所获得的精彩片段以两种方式应用于摘要:视频延时和视频剪辑。前者以低(高)速度播放亮点(非高亮)片段,而后者则组合得分最高的片段序列。在15个独特体育项目的100小时第一人称视频中,我们的重点检测功能使最新的RankSVM方法的准确性提高了10.5%。此外,我们的方法通过对35个人类受试者的用户研究产生了质量更高的视频摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号