首页> 外文OA文献 >Multimedia copy detection using audio and video fingerprints
【2h】

Multimedia copy detection using audio and video fingerprints

机译:使用音频和视频指纹进行多媒体复制检测

摘要

According to a study by the International Data Corporation (IDC), the digital universe is doubling in size every two years to rich 44 trillion gigabytes by 2020. A large part of this big universe consists of audio and videos (e.g. music, TV shows and films), which are distributed over the Internet in an effortless way. This has increased the need for powerful tools to handle this data in terms of identification, filtering and retrieval. In this context, multimedia copy detection, which consists of identifying duplicate (or near duplicate) multimedia content, has become an emerging and active research area due to its broad applications. Multimedia copy detection can be used in a wide variety of applications such as broadcast monitoring, music identification, copyright control, law enforcement investigation and music library organization. Content-Based Copy Detection (CBCD) has been recently introduced as a solution to the problem of multimedia copy detection. This approach extracts fingerprints from a candidate copy and then compares them against fingerprints of the original content. However, audio and video signals are subjected to various kinds of transformations that make robust fingerprint extraction challenging. Thus, fingerprints should be robust to a variety of audio and video transformations and also discriminate against imposter fingerprints. In addition, the search of a candidate copy against a large dataset of fingerprints should be very fast.ududIn this thesis, we propose an efficient multimedia copy detection system that is highly robust to a variety of audio and video transformations. We first describe a new audio feature extraction schema that allows the generation of three kinds of audio fingerprints. We then address the problem of video copy detection and we describe two video fingerprint extraction algorithms. In addition, we propose a fusion technique that combines the results achieved separately from the audio and the video parts to tackle the problem of audio+video copy detection. In the last part of this thesis, we address the problem of fingerprint retrieval, and we propose two solutions to improve the speed of the search algorithm. In the first solution we propose to parallelize the similarity search algorithm by using a Graphics Processing Unit (GPU), whereas the second solution is based on a clustering technique.ududWe evaluate the proposed systems on the TRECVID 2009 and 2010 datasets, and we evaluate our approaches in terms of detection performance, localization accuracy and run time. In addition, we demonstrate the effectiveness of our methods by comparing them to several state-of-the-art audio and video copy detection systems.
机译:根据国际数据公司(IDC)的一项研究,到2020年,数字宇宙的规模每两年翻一番,达到44万亿千兆字节。这一庞大的宇宙很大一部分由音频和视频(例如音乐,电视节目和电影),它们可以轻松地通过Internet分发。这就增加了使用功能强大的工具来识别,过滤和检索数据的需求。在这种情况下,由于其广泛的应用,由识别重复的(或几乎重复的)多媒体内容组成的多媒体复制检测已成为新兴且活跃的研究领域。多媒体复制检测可用于多种应用,例如广播监视,音乐识别,版权控制,执法调查和音乐图书馆组织。最近引入了基于内容的复制检测(CBCD)作为多媒体复制检测问题的解决方案。该方法从候选副本中提取指纹,然后将其与原始内容的指纹进行比较。然而,音频和视频信号经过各种变换,这些变换使得鲁棒的指纹提取具有挑战性。因此,指纹对于各种音频和视频转换应具有鲁棒性,并且还应区分冒名顶替的指纹。此外,针对大型指纹数据集的候选副本的搜索应该非常快。 ud ud在本文中,我们提出了一种高效的多媒体副本检测系统,该系统对各种音频和视频转换具有高度鲁棒性。我们首先描述一种新的音频特征提取方案,该方案可以生成三种音频指纹。然后,我们解决了视频复制检测的问题,并描述了两种视频指纹提取算法。此外,我们提出了一种融合技术,将音频和视频部分分别获得的结果进行组合,以解决音频+视频复制检测的问题。在本文的最后一部分,我们解决了指纹检索的问题,并提出了两种解决方案来提高搜索算法的速度。在第一种解决方案中,我们建议使用图形处理单元(GPU)来并行化相似性搜索算法,而第二种解决方案是基于聚类技术。 ud ud我们在TRECVID 2009和2010数据集上评估提出的系统,并且我们根据检测性能,定位精度和运行时间评估我们的方法。此外,我们通过将它们与几种最新的音频和视频复制检测系统进行比较来证明我们方法的有效性。

著录项

  • 作者

    Ouali Chahid;

  • 作者单位
  • 年度 2016
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号