首页> 外文学位 >Algorithms for error-tolerant information retrieval from music databases using vocal input.

【24h】

Algorithms for error-tolerant information retrieval from music databases using vocal input.

机译：使用语音输入从音乐数据库中检索容错信息的算法。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a system for searching a database of music through input queries provided through vocal input, i.e., humming a few bars of a desired song.; In order to ensure that the system performs well for the average person, a study of human humming skills was conducted to augment and extend the results of previous studies in music perception, recognition, and reproduction. We quantified the nature and frequency of errors typically introduced into vocal renditions of familiar and unfamiliar tunes, as well as the differences in performance between those with musical training and those without. The results of this study formed the basis of a series of algorithms designed to match an input query to its intended song stored in a database of music.; Algorithms developed for existing music information retrieval systems were evaluated against our collection of 172 hummed input query phrases and found to be inadequate in recognition accuracy. We created and tested more than 30 additional algorithms based in part on results obtained from our experimental study. New representations of music data such as duration contours and duration intervals were devised. An algorithm to extract tempo information from sparse and imprecise user data was developed.; Aspects of these individual efforts were eventually combined into an effective matching algorithm named RePReD. In 172 experimental trials, the algorithm correctly identified the intended song from a hummed input query in 68% of the trials for those with average vocal skills, and the correct song appears in the top ten reported results in 79% of the queries tested. Results for test subjects with no musical training were lower, at 46% and 58%, respectively. Based on our test data, the RePReD algorithm provides in real time higher matching accuracy than any other published system.

机译：我们提出了一种系统，该系统用于通过通过语音输入提供的输入查询来搜索音乐数据库，即，哼唱所需歌曲的几个小节。为了确保该系统对普通人而言性能良好，对人类哼唱技巧进行了研究，以扩大和扩展先前在音乐感知，识别和再现方面的研究成果。我们量化了通常会在熟悉和不熟悉的音调的声音演绎中引入的错误的性质和频率，以及接受过音乐训练的人和没有接受过音乐训练的人在演奏上的差异。这项研究的结果构成了一系列算法的基础，这些算法旨在使输入查询与存储在音乐数据库中的预期歌曲相匹配。针对现有音乐信息检索系统开发的算法已针对我们收集的172个嗡嗡作响的输入查询短语进行了评估，发现识别精度不足。我们部分基于从实验研究中获得的结果，创建并测试了30多种其他算法。设计了音乐数据的新表示形式，例如持续时间轮廓和持续时间间隔。开发了一种从稀疏和不精确的用户数据中提取速度信息的算法。这些个人努力的各个方面最终被合并为一个名为RePReD的有效匹配算法。在172个试验中，该算法在68％的试验中从声音输入查询中正确地识别了具有平均声音技能的目标歌曲，在79％的查询中，正确的歌曲出现在前十名报告的结果中。未经音乐训练的测试对象的结果较低，分别为46％和58％。根据我们的测试数据，RePReD算法实时提供比任何其他已发布系统更高的匹配精度。

著录项

作者
Kline, Richard Lewis.;
展开▼
作者单位

Rensselaer Polytechnic Institute.;

展开▼
授予单位 Rensselaer Polytechnic Institute.;
学科 Computer Science.; Engineering Electronics and Electrical.
学位 Ph.D.
年度 2002
页码 120 p.
总页数 120
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. SATIN: a persistent musical database for music information retrieval and a supporting deep learning experiment on song instrumental classification [J] . Bayle Yann, Robine Matthias, Hanna Pierre Multimedia Tools and Applications . 2019,第3期

机译：SATIN：用于音乐信息检索的持久音乐数据库，以及有关歌曲乐器分类的辅助深度学习实验
2. A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval [J] . Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第3期

机译：伴奏声音的鲁棒性歌唱建模及其在歌手识别和基于音色相似性的音乐信息检索中的应用
3. A revised cross-section database for gas retrieval in the UV-visible-near IR range, applied to the GOMOS retrieval algorithm AerGOM [J] . Christine Bingen, Charles E. Robert, Christian Hermans, Frontiers in Environmental Science . 2019,第9期

机译：修改后的横截面数据库，用于在UV-可见-近红外范围内进行气体检索，将其应用于GOMOS检索算法AerGOM
4. Approximate matching algorithms for music information retrieval using vocal input [C] . Richard L. Kline, Ephraim P. Glinert ACM international conference on Multimedia . 2003

机译：使用声音输入检索音乐信息的近似匹配算法
5. The Amazing Composobot: Music Information Retrieval and Algorithmic Composition [D] . Walker, Marcus. 2018

机译：Amazing Composobot：音乐信息检索和算法组成
6. Investigating country-specific music preferences and music recommendation algorithms with the LFM-1b dataset [O] . Markus Schedl -1

机译：使用LFM-1b数据集调查特定国家/地区的音乐偏好和音乐推荐算法
7. Towards machine musicians who have listened to more music than us : audio database-led algorithmic criticism for automatic composition and live concert systems. [O] . Collins Nick 2016

机译：对于听音乐比我们更多的机器音乐家：音频数据库主导的算法批评，涉及自动作曲和现场音乐会系统。

Algorithms for error-tolerant information retrieval from music databases using vocal input.

摘要

著录项

相似文献

相关主题

期刊订阅