首页> 外文期刊>IEEE transactions on audio, speech and language processing >Melody Transcription From Music Audio: Approaches and Evaluation
【24h】

Melody Transcription From Music Audio: Approaches and Evaluation

机译:音乐音频中的旋律转录:方法和评估

获取原文
获取原文并翻译 | 示例

摘要

Although the process of analyzing an audio recording of a music performance is complex and difficult even for a human listener, there are limited forms of information that may be tractably extracted and yet still enable interesting applications. We discuss melody-roughly, the part a listener might whistle or hum-as one such reduced descriptor of music audio, and consider how to define it, and what use it might be. We go on to describe the results of full-scale evaluations of melody transcription systems conducted in 2004 and 2005, including an overview of the systems submitted, details of how the evaluations were conducted, and a discussion of the results. For our definition of melody, current systems can achieve around 70% correct transcription at the frame level, including distinguishing between the presence or absence of the melody. Melodies transcribed at this level are readily recognizable, and show promise for practical applications
机译:尽管即使对于人类听众而言,分析音乐演奏的音频记录的过程也是复杂且困难的,但是信息的形式有限,可以很容易地提取出这些信息,但仍然可以实现有趣的应用。我们大致讨论旋律,即听众可能会吹口哨或发出嗡嗡声的部分,以此作为音乐音频的这种简化描述符,并考虑如何定义旋律以及其用途。我们将继续描述2004年和2005年进行的旋律转录系统的全面评估结果,包括所提交系统的概述,评估方式的详细信息以及对结果的讨论。对于我们对旋律的定义,当前的系统可以在帧级别实现大约70%的正确转录,包括区分旋律的存在与否。以这种水平转录的旋律很容易识别,并在实际应用中显示出希望

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号