首页> 外文会议>IEEE Workshop on Multimedia Signal Processing >An attribute-based approach to audio description applied to segmenting vocal sections in popular music songs
【24h】

An attribute-based approach to audio description applied to segmenting vocal sections in popular music songs

机译:基于属性的音频描述方法应用于在流行的音乐歌曲中分段声音部分

获取原文

摘要

We present a descriptive approach for analyzing audio scenes that can comprise a mixture of audio sources. We apply this method to segment popular music songs into vocal and non-vocal sections. Unlike existing methods that directly rely on within-class feature similarities of acoustic sources, the proposed data-driven system is based on a training set where the acoustic sources are grouped by their perceptual or semantic attributes. Our audio analysis approach is based on a quantitative time-varying metric to measure the interaction between acoustic sources present in a scene developed using pattern recognition methods. Using the proposed system that is trained on a general sound effects library, we achieve less than ten percent vocal-section segmentation error and less than five percent false alarm rates when evaluated on a database of popular music recordings that spans four different genres (rock, hiphop, pop, and easy listening).
机译:我们介绍了用于分析可以包括音频源的混合的音频场景的描述性方法。我们将这种方法应用于将流行音乐歌曲分段为声乐和非声音部分。与直接依赖于类别的类别相似性的现有方法不同,所提出的数据驱动系统基于训练集,其中声源由其感知或语义属性分组。我们的音频分析方法基于定量时变度量,以测量使用模式识别方法开发的场景中存在的声源之间的相互作用。使用在一般声音效果库上培训的建议系统,我们达到了低于10%的声带分割误差,并且在跨越四种不同类型的流行音乐录制数据库时少于5%的误报率,如横跨不同类型(摇滚, Hiphop,Pop,易听)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号