首页> 外文OA文献 >Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude
【2h】

Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude

机译:使用2D傅立叶变换幅度的大规模翻唱歌曲识别

摘要

Large-scale cover song recognition involves calculating item-to-item similarities that can accommodate differences in timing and tempo, rendering simple Euclidean measures unsuitable. Expensive solutions such as dynamic time warping do not scale to million of instances, making them inappropriate for commercial-scale applications. In this work, we transform a beat-synchronous chroma matrix with a 2D Fourier transform and show that the resulting representation has properties that fit the cover song recognition task. We can also apply PCA to efficiently scale comparisons. We report the best results to date on the largest available dataset of around 18,000 cover songs amid one million tracks, giving a mean average precision of 3.0%.
机译:大规模的翻唱歌曲识别涉及计算项目之间的相似度,以适应时间和节奏上的差异,从而使简单的欧几里得测度不适合。诸如动​​态时间规整之类的昂贵解决方案无法扩展到数百万个实例,这使其不适用于商业规模的应用程序。在这项工作中,我们使用2D傅立叶变换对心跳同步色度矩阵进行了变换,并证明了所得表示具有适合翻唱歌曲识别任务的特性。我们还可以应用PCA来有效地进行比较。我们报告了迄今为止最大的可用数据集,其中包括一百万首曲目中的约18,000首翻唱歌曲中的最佳结果,平均平均精度为3.0%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号