首页> 外文会议>International Conference on Multimedia Big Data >An Efficient Cascaded Filtering Retrieval Method for Big Audio Data
【24h】

An Efficient Cascaded Filtering Retrieval Method for Big Audio Data

机译:大音频数据的一种有效的级联滤波检索方法

获取原文

摘要

Fast audio retrieval is crucial for many important applications and yet demanding due to the high dimension nature and increasingly larger volume of audios in the internet. Although audio fingerprinting can greatly reduce its dimension while keeping audio identifiable, the dimension of audio fingerprints is still too high to scale up for big audio data. The tradeoff between the accuracy and the efficiency prevents the further reducing of the dimension of fingerprints. This paper proposes a multi-stage filtering strategy for audio retrieval, with the beginning stages focusing on speed up by using a middle fingerprint with much smaller size to quickly filtering the most likely audios, and the ending stages emphasizing on accuracy by applying an accurate and robust fingerprint on the small set of the most likely audios. A notion called middle fingerprint is devised with considerable small dimension for quickly filtering out most irrelevant audios. A matching algorithm is developed to reduce the computational complexity by comparing the samples at fixed interval of two audios with thresholds. By using the middle fingerprint, audio retrieval can get a speed gain of 12 times on average compared with the Fibonacci Hashing retrieval. By combing the Fibonacci hashing algorithm with the middle filtering retrieval and the matching algorithm, we propose an efficient cascaded filtering retrieval methods, which can further improve the retrieval by 250 times on average. After applying MP3 conversion, resampling, and random shearing, the recall rates of the method are all above 99.47%, and the theoretical accuracy is close to 100%.
机译:快速音频检索对于许多重要应用至关重要,但由于互联网的高维度特性和越来越大的音频量,因此要求很高。尽管音频指纹识别可以在保持音频可识别性的同时大大减小其尺寸,但是音频指纹的尺寸仍然太大,无法扩展到较大的音频数据。准确性和效率之间的权衡阻止了指纹尺寸的进一步减小。本文提出了一种用于音频检索的多阶段过滤策略,其开始阶段着重于通过使用尺寸较小的中间指纹来快速过滤最可能的音频来加快速度,而结束阶段则通过应用准确且准确的音频来强调准确性。一小部分最可能的音频上的坚固指纹。设计了一种称为“中间指纹”的概念,该概念具有相当小的尺寸,可以快速过滤掉大多数不相关的音频。通过将两个音频的固定间隔的样本与阈值进行比较,开发了一种匹配算法来降低计算复杂性。通过使用中间指纹,与斐波那契散列检索相比,音频检索平均可以获得12倍的速度增益。通过将Fibonacci哈希算法与中间过滤检索和匹配算法相结合,提出了一种有效的级联过滤检索方法,该方法可以将检索平均平均提高250倍。经过MP3转换,重采样和随机剪切后,该方法的查全率均在99.47%以上,理论精度接近100%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号