首页> 外文学位 >Algorithms for error-tolerant information retrieval from music databases using vocal input.
【24h】

Algorithms for error-tolerant information retrieval from music databases using vocal input.

机译:使用语音输入从音乐数据库中检索容错信息的算法。

获取原文
获取原文并翻译 | 示例

摘要

We present a system for searching a database of music through input queries provided through vocal input, i.e., humming a few bars of a desired song.; In order to ensure that the system performs well for the average person, a study of human humming skills was conducted to augment and extend the results of previous studies in music perception, recognition, and reproduction. We quantified the nature and frequency of errors typically introduced into vocal renditions of familiar and unfamiliar tunes, as well as the differences in performance between those with musical training and those without. The results of this study formed the basis of a series of algorithms designed to match an input query to its intended song stored in a database of music.; Algorithms developed for existing music information retrieval systems were evaluated against our collection of 172 hummed input query phrases and found to be inadequate in recognition accuracy. We created and tested more than 30 additional algorithms based in part on results obtained from our experimental study. New representations of music data such as duration contours and duration intervals were devised. An algorithm to extract tempo information from sparse and imprecise user data was developed.; Aspects of these individual efforts were eventually combined into an effective matching algorithm named RePReD. In 172 experimental trials, the algorithm correctly identified the intended song from a hummed input query in 68% of the trials for those with average vocal skills, and the correct song appears in the top ten reported results in 79% of the queries tested. Results for test subjects with no musical training were lower, at 46% and 58%, respectively. Based on our test data, the RePReD algorithm provides in real time higher matching accuracy than any other published system.
机译:我们提出了一种系统,该系统用于通过通过语音输入提供的输入查询来搜索音乐数据库,即,哼唱所需歌曲的几个小节。为了确保该系统对普通人而言性能良好,对人类哼唱技巧进行了研究,以扩大和扩展先前在音乐感知,识别和再现方面的研究成果。我们量化了通常会在熟悉和不熟悉的音调的声音演绎中引入的错误的性质和频率,以及接受过音乐训练的人和没有接受过音乐训练的人在演奏上的差异。这项研究的结果构成了一系列算法的基础,这些算法旨在使输入查询与存储在音乐数据库中的预期歌曲相匹配。针对现有音乐信息检索系统开发的算法已针对我们收集的172个嗡嗡作响的输入查询短语进行了评估,发现识别精度不足。我们部分基于从实验研究中获得的结果,创建并测试了30多种其他算法。设计了音乐数据的新表示形式,例如持续时间轮廓和持续时间间隔。开发了一种从稀疏和不精确的用户数据中提取速度信息的算法。这些个人努力的各个方面最终被合并为一个名为RePReD的有效匹配算法。在172个试验中,该算法在68%的试验中从声音输入查询中正确地识别了具有平均声音技能的目标歌曲,在79%的查询中,正确的歌曲出现在前十名报告的结果中。未经音乐训练的测试对象的结果较低,分别为46%和58%。根据我们的测试数据,RePReD算法实时提供比任何其他已发布系统更高的匹配精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号