首页> 外文期刊>IEEE transactions on multimedia >An effective music information retrieval method using three-dimensional continuous DP
【24h】

An effective music information retrieval method using three-dimensional continuous DP

机译:一种利用三维连续DP的有效音乐信息检索方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper describes a music information retrieval system that uses humming as the key for retrieval. Humming is an easy way for a user to input a melody. However, there are several problems with humming that degrade the retrieval of information. One problem is the human factor. Sometimes, people do not sing accurately, especially if they are inexperienced or unaccompanied. Another problem arises from signal processing. Therefore, a music information retrieval method should be sufficiently robust to surmount various humming errors and signal processing problems. A retrieval system has to extract the pitch from the user's humming. However, pitch extraction is not perfect. It often captures half or double pitches, which are harmonic frequencies of the true pitch, even if the extraction algorithms take the continuity of the pitch into account. Considering these problems, we propose a system that takes multiple pitch candidates into account. In addition to the frequencies of the pitch candidates, the confidence measures obtained from their powers are taken into consideration as well. We also propose the use of an algorithm with three dimensions that is an extension of the conventional Dynamic Programming (DP)algorithm, so that multiple pitch candidates can be treated. Moreover, in the proposed algorithm, DP paths are changed dynamically to take deltaPitches and IOIratios (inter-onset-interval) of input and reference notes into account in order to treat notes being split or unified. We carried out an evaluation experiment to compare the proposed system with a conventional system . When using three-pitch candidates with conference measure and IOI features, the top-ten retrieval accuracy was 94.1%. Thus, the proposed method gave a better retrieval performance than the conventional system.
机译:本文介绍了一种以嗡嗡声作为检索关键的音乐信息检索系统。嗡嗡声是用户输入旋律的简便方法。但是,嗡嗡声存在几个问题,这些问题会使信息的检索变差。一个问题是人为因素。有时,人们唱歌不准确,尤其是在没有经验或无人陪伴的情况下。另一个问题来自信号处理。因此,音乐信息检索方法应足够健壮,以克服各种嗡嗡声和信号处理问题。检索系统必须从用户的嗡嗡声中提取音高。但是,音高提取并不完美。即使提取算法考虑了音高的连续性,它也经常捕获一半或两倍的音高,这是真实音高的谐波频率。考虑到这些问题,我们提出了一种考虑多个音高候选者的系统。除了音高候选者的频率外,还应考虑从其功率获得的置信度。我们还建议使用具有三个维度的算法,这是常规动态规划(DP)算法的扩展,因此可以处理多个音高候选者。此外,在提出的算法中,DP路径被动态更改以考虑输入和参考音符的deltaPitchs和IOIratios(间隔时间间隔),以便处理被分割或统一的音符。我们进行了一个评估实验,将提出的系统与常规系统进行比较。当使用具有会议度量和IOI功能的三音候选者时,前十名的检索准确性为94.1%。因此,与常规系统相比,该方法具有更好的检索性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号