首页> 美国卫生研究院文献>other >Gaze-enabled Egocentric Video Summarization via Constrained Submodular Maximization

【2h】

Gaze-enabled Egocentric Video Summarization via Constrained Submodular Maximization

机译：通过约束子模最大化实现凝视的自我中心视频汇总

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the proliferation of wearable cameras, the number of videos of users documenting their personal lives using such devices is rapidly increasing. Since such videos may span hours, there is an important need for mechanisms that represent the information content in a compact form (i.e., shorter videos which are more easily browsable/sharable). Motivated by these applications, this paper focuses on the problem of egocentric video summarization. Such videos are usually continuous with significant camera shake and other quality issues. Because of these reasons, there is growing consensus that direct application of standard video summarization tools to such data yields unsatisfactory performance. In this paper, we demonstrate that using gaze tracking information (such as fixation and saccade) significantly helps the summarization task. It allows meaningful comparison of different image frames and enables deriving personalized summaries (gaze provides a sense of the camera wearer's intent). We formulate a summarization model which captures common-sense properties of a good summary, and show that it can be solved as a submodular function maximization with partition matroid constraints, opening the door to a rich body of work from combinatorial optimization. We evaluate our approach on a new gaze-enabled egocentric video dataset (over 15 hours), which will be a valuable standalone resource.

机译：随着可穿戴式相机的激增，使用此类设备记录个人生活的用户视频数量正在迅速增加。由于此类视频可能会持续数小时，因此非常需要一种以紧凑形式表示信息内容的机制（即，更容易浏览/共享的较短视频）。受这些应用程序的激励，本文重点关注以自我为中心的视频摘要问题。此类视频通常是连续的，并带有明显的相机抖动和其他质量问题。由于这些原因，越来越多的共识认为，将标准视频摘要工具直接应用于此类数据会产生不令人满意的性能。在本文中，我们证明了使用凝视跟踪信息（例如注视和扫视）可以极大地帮助进行汇总。它可以对不同的图像帧进行有意义的比较，并可以得出个性化的摘要（凝视可让您了解相机佩戴者的意图）。我们制定了一个汇总模型，该模型捕获了良好摘要的常识属性，并表明可以将其解决为具有分区拟阵约束的子模函数最大化，从而为组合优化打开了广阔的大门。我们在新的启用凝视的自我中心视频数据集（超过15个小时）上评估了我们的方法，这将是宝贵的独立资源。

著录项

期刊名称 other
作者
Jia Xut; Lopamudra Mukherjee; Yin Li; Jamieson Warner; James M. Rehg; Vikas Singht;
展开▼
作者单位

展开▼
年(卷),期 -1(2015),-1
年度 -1
页码 2235–2244
总页数 21
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Summarizing egocentric videos using deep features and optimal clustering [J] . Sahu Abhimanyu, Chowdhury Ananda S. Neurocomputing . 2020,第Jul20期

机译：使用深度特征和最佳聚类来概述自我中心视频
2. Multiscale summarization and action ranking in egocentric videos [J] . Sahu Abhimanyu, Chowdhury Ananda S. Pattern recognition letters . 2020,第May期

机译：多尺度摘要和行动排名在EgoCentric视频中
3. Summarization of Egocentric Videos: A Comprehensive Survey [J] . Ana Garcia del Molino, Cheston Tan, Joo-Hwee Lim, Human-Machine Systems, IEEE Transactions on . 2017,第1期

机译：以自我为中心的视频摘要：综合调查
4. Gaze-enabled egocentric video summarization via constrained submodular maximization [C] . Jia Xu, Mukherjee Lopamudra, Yin Li, IEEE Conference on Computer Vision and Pattern Recognition . 2015

机译：通过受约束子模块最大化实现凝视启用的自我视频概述
5. Constrained expectation-maximization (EM), dynamic analysis, linear quadratic tracking, and nonlinear constrained expectation-maximization (EM) for the analysis of genetic regulatory networks and signal transduction networks. [D] . Xiong, Hao. 2008

机译：约束期望最大化（EM），动态分析，线性二次跟踪和非线性约束期望最大化（EM），用于分析遗传调控网络和信号转导网络。
6. Submodular Maximization via Gradient Ascent: The Case of Deep Submodular Functions [O] . Wenruo Bai, William S Noble, Jeff A. Bilmes -1

机译：通过梯度上升的亚模最大化：深亚模函数的情况
7. Gaze-enabled egocentric video summarization via constrained submodular maximization [O] . Jia Xu, Lopamudra Mukherjee, Yin Li, 2015

机译：通过受约束子模块最大化实现凝视启用的自我视频概述

Gaze-enabled Egocentric Video Summarization via Constrained Submodular Maximization

摘要

著录项

相似文献

相关主题

期刊订阅