首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation
【24h】

A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation

机译:用于音高估计和音乐音频源分离的具有音乐动机的中级表示

获取原文
获取原文并翻译 | 示例
           

摘要

When designing an audio processing system, the target tasks often influence the choice of a data representation or transformation. Low-level time–frequency representations such as the short-time Fourier transform (STFT) are popular, because they offer a meaningful insight on sound properties for a low computational cost. Conversely, when higher level semantics, such as pitch, timbre or phoneme, are sought after, representations usually tend to enhance their discriminative characteristics, at the expense of their invertibility. They become so-called mid-level representations. In this paper, a source/filter signal model which provides a mid-level representation is proposed. This representation makes the pitch content of the signal as well as some timbre information available, hence keeping as much information from the raw data as possible. This model is successfully used within a main melody extraction system and a lead instrument/accompaniment separation system. Both frameworks obtained top results at several international evaluation campaigns.
机译:在设计音频处理系统时,目标任务通常会影响数据表示或转换的选择。诸如短时傅立叶变换(STFT)之类的低级时频表示很受欢迎,因为它们以低的计算成本提供了对声音属性的有意义的洞察。相反,当寻求诸如音调,音色或音素之类的高级语义时,表示通常倾向于以其可逆性为代价来增强其区分特性。它们成为所谓的中级表示形式。在本文中,提出了一种提供中级表示的源/滤波器信号模型。这种表示使信号的音高内容以及一些音色信息可用,因此从原始数据中保留了尽可能多的信息。该模型已成功用于主旋律提取系统和主奏乐器/伴奏分离系统中。这两个框架在几次国际评估活动中均取得了最高成果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号