首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Quantitative Analysis of a Common Audio Similarity Measure
【24h】

Quantitative Analysis of a Common Audio Similarity Measure

机译:常见音频相似性度量的定量分析

获取原文
获取原文并翻译 | 示例
       

摘要

For music information retrieval tasks, a nearest neighbor classifier using the Kullback-Leibler divergence between Gaussian mixture models of songs' melfrequency cepstral coefficients is commonly used to match songs by timbre. In this paper, we analyze this distance measure analytically and experimentally by the use of synthesized MIDI files, and we find that it is highly sensitive to different instrument realizations. Despite the lack of theoretical foundation, it handles the multipitch case quite well when all pitches originate from the same instrument, but it has some weaknesses when different instruments play simultaneously. As a proof of concept, we demonstrate that a source separation frontend can improve performance. Furthermore, we have evaluated the robustness to changes in key, sample rate, and bitrate.
机译:对于音乐信息检索任务,通常使用在歌曲的中频倒谱系数的高斯混合模型之间使用Kullback-Leibler散度的最近邻分类器来按音色匹配歌曲。在本文中,我们通过使用合成的MIDI文件来分析和实验分析此距离量度,我们发现它对不同的乐器实现高度敏感。尽管缺乏理论基础,但当所有音高均来自同一乐器时,它可以很好地处理多音高情况,但是当不同乐器同时演奏时,它具有一些缺点。作为概念证明,我们证明了源分离前端可以提高性能。此外,我们评估了密钥,采样率和比特率变化的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号