首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >A Framework Towards Domain Specific Video Summarization
【24h】

A Framework Towards Domain Specific Video Summarization

机译:面向特定领域的视频摘要的框架

获取原文

摘要

In the light of exponentially increasing video content, video summarization has attracted a lot of attention recently due to its ability to optimize time and storage. Characteristics of a good summary of a video depend on the particular domain under question. We propose a novel framework for domain specific video summarization. Given a video of a particular domain, our system can produce a summary based on what is important for that domain in addition to possessing other desired characteristics like representation, coverage, diversity etc. as suitable to that domain. Past related work has focused either on using supervised approaches for ranking the snippets to produce summary or on using unsupervised approaches of generating the summary as a subset of snippets with the above characteristics. We look at the joint problem of learning domain specific importance of segments as well as the desired summary characteristic for that domain. Our studies show that the more efficient way of incorporating domain specific relevance into a summary is by obtaining ratings of shots as opposed to binary inclusion/exclusion information. We also argue that ratings can be seen as unified representation of all possible ground truth summaries of a video, taking us one step closer in dealing with challenges associated with multiple ground truth summaries of a video. We also propose a novel evaluation measure which is more naturally suited in assessing the quality of video summary for the task at hand than F1 like measures. It leverages the ratings information and is richer in appropriately modeling desirable and undesirable characteristics of a summary. Lastly, we release a gold standard dataset for furthering research in domain specific video summarization, which to our knowledge is the first dataset with long videos across several domains with rating annotations. We conduct extensive experiments to demonstrate the benefits of our proposed solution.
机译:鉴于视频内容呈指数增长,最近,视频摘要由于具有优化时间和存储的能力而备受关注。视频摘要的特征取决于所讨论的特定领域。我们为领域特定的视频摘要提出了一种新颖的框架。给定特定领域的视频,我们的系统除了具有适合该领域的其他所需特征(如表示,覆盖范围,多样性等)外,还可以基于对该领域重要的内容来生成摘要。过去的相关工作主要集中在使用受监督的方法对摘要进行排序以生成摘要,或使用不受监督的方法将摘要生成为具有上述特征的摘要的子集。我们着眼于学习细分的领域特定重要性以及该领域所需的摘要特性的共同问题。我们的研究表明,将领域特定相关性纳入摘要的更有效方法是通过获取镜头评级,而不是二进制包含/排除信息。我们还认为,收视率可以看作是视频的所有可能的地面事实摘要的统一表示,这使我们在处理与视频的多个地面事实摘要相关的挑战时更近了一步。我们还提出了一种新颖的评估方法,它比F1之类的方法更自然地适合评估手头任务的视频摘要的质量。它利用了评级信息,并且在适当地建模摘要的期望和不期望的特征方面更加丰富。最后,我们发布了黄金标准数据集,用于进一步研究特定于域的视频摘要,据我们所知,这是第一个包含具有分级注释的跨多个域的长视频的数据集。我们进行了广泛的实验,以证明我们提出的解决方案的好处。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号