...
首页> 外文期刊>IEEE transactions on multimedia >Audio thumbnailing of popular music using chroma-based representations
【24h】

Audio thumbnailing of popular music using chroma-based representations

机译:使用基于色度的表示法对流行音乐进行音频缩略图

获取原文
获取原文并翻译 | 示例
           

摘要

With the growing prevalence of large databases of multimedia content, methods for facilitating rapid browsing of such databases or the results of a database search are becoming increasingly important. However, these methods are necessarily media dependent. We present a system for producing short, representative samples (or "audio thumbnails") of selections of popular music. The system searches for structural redundancy within a given song with the aim of identifying something like a chorus or refrain. To isolate a useful class of features for performing such structure-based pattern recognition, we present a development of the chromagram, a variation on traditional time-frequency distributions that seeks to represent the cyclic attribute of pitch perception, known as chroma. The pattern recognition system itself employs a quantized chromagram that represents the spectral energy at each of the 12 pitch classes. We evaluate the system on a database of popular music and score its performance against a set of "ideal" thumbnail locations. Overall performance is found to be quite good, with the majority of errors resulting from songs that do not meet our structural assumptions.
机译:随着大型多媒体内容数据库的普及,促进这种数据库的快速浏览或数据库搜索结果的方法变得越来越重要。但是,这些方法必须依赖于媒体。我们提出了一种用于产生流行音乐选择的简短,有代表性的样本(或“音频缩略图”)的系统。该系统搜索给定歌曲中的结构冗余,以识别诸如合唱或副歌之类的东西。为了隔离用于执行此类基于结构的模式识别的有用类特征,我们介绍了色谱图的发展,它是传统时频分布的一种变体,旨在表示音调感知的循环属性,即色度。模式识别系统本身采用量化的色谱图,该色谱图表示12个音调类别中每一个的频谱能量。我们在流行音乐数据库中评估该系统,并针对一组“理想”缩略图位置对其性能进行评分。我们发现整体表现相当不错,其中大多数错误是由于歌曲不符合我们的结构假设而导致的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号