首页> 外文会议>Internet Imaging >Estimation of Web video multiplicity
【24h】

Estimation of Web video multiplicity

机译:网络视频多重性估算

获取原文
获取原文并翻译 | 示例

摘要

Abstract: With ever more popularity of video web-publishing, many popular contents are being mirrored, reformatted, modified and republished, resulting in excessive content duplication. While such redundancy provides fault tolerance for continuous availability of information, it could potentially create problems for multimedia search engines in that the search results for a given query might become repetitious, and cluttered with a large number of duplicates. As such, developing techniques for detecting similarity and duplication is important to multimedia search engines. In addition, content providers might be interested in identifying duplicates of their content for legal, contractual or other business related reasons. In this paper, we propose an efficient algorithm called video signature to detect similar video sequences for large databases such as the web. The idea is to first form a 'signature' for each video sequence by selection a small number of its frames that are most similar to a number of randomly chosen seed images. Then the similarity between any tow video sequences can be reliably estimated by comparing their respective signatures. Using this method, we achieve 85 percent recall and precision ratios on a test database of 377 video sequences. As a proof of concept, we have applied our proposed algorithm to a collection of 1800 hours of video corresponding to around 45000 clips from the web. Our results indicate that, on average, every video in our collection from the web has around five similar copies. !27
机译:摘要:随着视频网络发布的日益普及,许多受欢迎的内容正在被镜像,重新格式化,修改和重新发布,从而导致过多的内容重复。尽管这种冗余为信息的连续可用性提供了容错能力,但它可能会给多媒体搜索引擎带来问题,因为给定查询的搜索结果可能会重复出现,并出现大量重复项。因此,开发用于检测相似性和重复性的技术对多媒体搜索引擎很重要。另外,出于法律,合同或其他与业务相关的原因,内容提供商可能会对识别其内容的副本感兴趣。在本文中,我们提出了一种有效的算法,称为视频签名,可以为大型数据库(例如Web)检测相似的视频序列。想法是首先通过选择与多数随机选择的种子图像最相似的少量帧,为每个视频序列形成一个“签名”。然后,可以通过比较它们各自的签名来可靠地估计任何两个视频序列之间的相似性。使用这种方法,我们在包含377个视频序列的测试数据库上实现了85%的查全率和查准率。作为概念的证明,我们已将我们提出的算法应用于1800小时的视频集合,对应于网络上的大约45000个剪辑。我们的结果表明,平均而言,我们网上收藏的每个视频都有大约五个相似的副本。 !27

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号