首页> 外文会议>International Conference on Control, Decision and Information Technologies >Audio Tempo Estimation Method Improved by Rhythm Pattern and Data Augmentation
【24h】

Audio Tempo Estimation Method Improved by Rhythm Pattern and Data Augmentation

机译:节奏模式和数据增强改进了音频节奏估计方法

获取原文

摘要

Tempo is the intuitive attribute of audio music, since people could feel fast or slow expressively and detect salient pulses to form perceived tempo value naturally. Nonetheless, for some audio, the tempo value could be ambiguous due to complex metrical level, different composing habit and creating style. Even though most of audio have the predominant tempo with consensus between the listeners, the others could have two dominant tempi. The challenge and goal of tempo estimation is to discriminate the salient tempi, mostly one or two tempos, related to the metric level by analyzing the audio signal directly. In this study, we propose the rhythm patterns of long-term periodicity curve derived from tempogram to improve the saliency detection. Besides, the data augmentation method is also invented to conquer the deficiency and representative of the three training datasets. The performance is evaluated on three public datasets in which the accuracy of “GiantSteps” dataset even outperforms the state-of-the-art tempo estimator of convolutional neural network implementation.
机译:Tempo是音频音乐的直观属性,因为人们可以快速或慢慢地感到快速或缓慢,并且检测突出脉冲,以便自然形成感知的节奏值。尽管如此,对于某些音频,由于复杂的度量级别,不同的构图习惯和创造风格,节奏值可能是模糊的。尽管大多数音频在听众之间共有共识的主要节奏,但其他音频可以有两个优势速度。 Tempo估计的挑战和目标是通过直接分析音频信号来区分与度量水平相关的突出的Tempi,大多数一个或两个节奏。在这项研究中,我们提出了衍生自刺激图的长期周期性曲线的节奏模式,以改善显着性检测。此外,还发明了数据增强方法以征服三个训练数据集的缺陷和代表。在三个公共数据集中评估性能,其中“GiantSteps”数据集的准确性甚至优于卷积神经网络实现的最先进的节奏估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号