首页> 外文期刊>Information retrieval >Learning music similarity from relative user ratings
【24h】

Learning music similarity from relative user ratings

机译:从相对用户评价中学习音乐相似性

获取原文
获取原文并翻译 | 示例
           

摘要

Computational modelling of music similarity is an increasingly important part of personalisation and optimisation in music information retrieval and research in music perception and cognition. The use of relative similarity ratings is a new and promising approach to modelling similarity that avoids well known problems with absolute ratings. In this article, we use relative ratings from the MagnaTagATune dataset with new and existing variants of state-of-the-art algorithms and provide the first comprehensive and rigorous evaluation of this approach. We compare metric learning based on support vector machines (SVMs) and metric-learning-to-rank (MLR), including a diagonal and a novel weighted variant, and relative distance learning with neural networks (RDNN). We further evaluate the effectiveness of different high and low level audio features and genre data, as well as dimensionality reduction methods, weighting of similarity ratings, and different sampling methods. Our results show that music similarity measures learnt on relative ratings can be significantly better than a standard Euclidian metric, depending on the choice of learning algorithm, feature sets and application scenario. MLR and SVM outperform DMLR and RDNN, while MLR with weighted ratings leads to no further performance gain. Timbral and music-structural features are most effective, and all features jointly are significantly better than any other combination of feature sets. Sharing audio clips (but not the similarity ratings) between test and training sets improves performance, in particular for the SVM-based methods, which is useful for some applications scenarios.
机译:音乐相似性的计算模型已成为音乐信息检索和音乐感知与认知研究中个性化和优化的重要组成部分。相对相似性评级的使用是一种新的有前途的建模相似性的方法,可以避免众所周知的绝对评级问题。在本文中,我们将MagnaTagATune数据集的相对等级与最新算法和现有算法的变体一起使用,并对这种方法进行了首次全面而严格的评估。我们比较了基于支持向量机(SVM)和度量学习等级(MLR)(包括对角线和新颖的加权变量)以及基于神经网络的相对距离学习(RDNN)的度量学习。我们进一步评估了不同的高低级音频特征和体裁数据的有效性,以及降维方法,相似性等级的加权和不同的采样方法。我们的结果表明,根据学习算法,功能集和应用场景的选择,从相对评级中学到的音乐相似性度量可以明显优于标准的欧几里得度量。 MLR和SVM的性能优于DMLR和RDNN,而具有加权评级的MLR不会进一步提高性能。音色和音乐结构特征最有效,并且所有特征共同比特征集的任何其他组合明显更好。在测试集和训练集之间共享音频片段(而不是相似性评级)可以提高性能,尤其是对于基于SVM的方法而言,这对于某些应用场景很有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号