首页> 外文期刊>International journal of computational intelligence systems >A Cognitive Method for Musicology Based Melody Transcription
【24h】

A Cognitive Method for Musicology Based Melody Transcription

机译:基于音乐学的旋律转录的认知方法

获取原文
获取原文并翻译 | 示例
       

摘要

This paper describes a method for transcribing the main structure of polyphonic music audio automatically by analyzing musical tonality related musicological information. Music transcription is a difficult topic in Music Information Retrieval (MIR) which contains many tasks to recognize all the elements of music. Among all these musical factors, tonality is a basic but elusive one, and plays as the core in organizing music structure. So by making an investigation into cognition and musicology, we propose an approach to transcribe major component of polyphonic music from the point of view of tonality. Unlike previous studies, our method focuses on music cognition theory instead of signal processing and pattern classification. Basing on our former work on key finding and chord recognition, we introduce a new cognitive feature called auditory saliency (AS), which contains both statistically modeled information about human's acoustic attention and psychology measured data on human's musical perception, to recognize the main part of the musicmelody stream. Constant Q transform which is a more musical STFT (Short-Time Fourier Transform), and a temporal ANN (Artificial Neuro Network) are also used in our framework. There are also a few musicology and signal processing based techniques designed to improve our method such as melody structural vertical regulating and onset detection. Our proposed method has been tested objectively (F-measure of pitch detection is 0.76 with a perfect musicological AS information) and subjectively (above 90% of transcribed segments are accepted by professional reviewers), and shows some inspiring results.
机译:本文介绍了一种通过分析与音调有关的音乐学信息自动转录和弦音乐音频主要结构的方法。音乐转录是音乐信息检索(MIR)中的一个困难主题,其中包含许多任务,以识别音乐的所有元素。在所有这些音乐因素中,调性是一个基本但难以捉摸的因素,并且是组织音乐结构的核心。因此,通过对认知和音乐学的研究,我们提出了一种从音调的角度抄录复音音乐主要成分的方法。与以前的研究不同,我们的方法侧重于音乐认知理论,而不是信号处理和模式分类。在我们以前的主要发现和和弦识别工作的基础上,我们引入了一种称为听觉显着性(AS)的新认知功能,该功能既包含有关人类声学注意力的统计建模信息,也包含有关人类音乐感知的心理学测量数据,以识别音乐旋律流。在我们的框架中还使用了常数Q变换,这是一种更具音乐性的STFT(短时傅立叶变换)和时态ANN(人工神经网络)。还有一些基于音乐学和信号处理的技术,旨在改善我们的方法,例如旋律结构垂直调节和开始检测。我们提出的方法经过客观测试(音调检测的F测度为0.76,具有完善的音乐学AS信息)和主观测试(超过90%的转录片段被专业评论家接受),并显示出一些令人鼓舞的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号