A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation

Durrieu J.-L.; David B.; Richard G.

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation

【24h】

A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation

机译：用于音高估计和音乐音频源分离的具有音乐动机的中级表示

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

When designing an audio processing system, the target tasks often influence the choice of a data representation or transformation. Low-level time–frequency representations such as the short-time Fourier transform (STFT) are popular, because they offer a meaningful insight on sound properties for a low computational cost. Conversely, when higher level semantics, such as pitch, timbre or phoneme, are sought after, representations usually tend to enhance their discriminative characteristics, at the expense of their invertibility. They become so-called mid-level representations. In this paper, a source/filter signal model which provides a mid-level representation is proposed. This representation makes the pitch content of the signal as well as some timbre information available, hence keeping as much information from the raw data as possible. This model is successfully used within a main melody extraction system and a lead instrument/accompaniment separation system. Both frameworks obtained top results at several international evaluation campaigns.

机译：在设计音频处理系统时，目标任务通常会影响数据表示或转换的选择。诸如短时傅立叶变换（STFT）之类的低级时频表示很受欢迎，因为它们以低的计算成本提供了对声音属性的有意义的洞察。相反，当寻求诸如音调，音色或音素之类的高级语义时，表示通常倾向于以其可逆性为代价来增强其区分特性。它们成为所谓的中级表示形式。在本文中，提出了一种提供中级表示的源/滤波器信号模型。这种表示使信号的音高内容以及一些音色信息可用，因此从原始数据中保留了尽可能多的信息。该模型已成功用于主旋律提取系统和主奏乐器/伴奏分离系统中。这两个框架在几次国际评估活动中均取得了最高成果。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2011年第6期|p.1180-1191|共12页
作者
Durrieu J.-L.; David B.; Richard G.;
展开▼
作者单位

Signal Processing Laboratories (LTS5), Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Audio melody extraction; audio signal representation; musical audio source separation; non-negative matrix factorization (NMF); pitch estimation;

机译：音频旋律提取;音频信号表示;音乐音频源分离;非负矩阵分解（NMF）;音高估计;

相似文献

外文文献
中文文献
专利

1. Score-Informed Source Separation for Musical Audio Recordings: An overview [J] . IEEE Signal Processing Magazine . 2014,第3期

机译：音乐音频记录的乐谱信息源分离：概述
2. A Classifier-Based Approach to Score-Guided Source Separation of Musical Audio [J] . Christopher Raphael Computer Music Journal . 2008,第1期

机译：基于分类器的音乐音频分数引导源分离方法
3. A Mid-Level Representation for Melody-Based Retrieval in Audio Collections [J] . Marolt M. IEEE transactions on multimedia . 2008,第8期

机译：音频集合中基于旋律的检索的中间表示
4. Under-Determined Reverberant Audio Source Separation Using Local Observed Covariance and Auditory-Motivated Time-Frequency Representation [C] . Ngoc Q.K. Duong, Emmanuel Vincent, Romi Gribonval Latent variable analysis and signal separation . 2010

机译：使用本地观察到的协方差和听觉动机时频表示的欠定混响音频源分离
5. A musically motivated approach to spatial audio for large venues. [D] . Etlinger, David. 2009

机译：在大型场所使用具有音乐动机的空间音频方法。
6. Decoding the dynamic representation of musical pitch from human brain activity [O] . N. Sankaran, W. F. Thompson, S. Carlile, -1

机译：从人脑活动中解码音高的动态表示
7. Under-Determined Reverberant Audio Source Separation Using Local Observed Covariance and Auditory-Motivated Time-Frequency Representation [O] . Duong, Ngoc,, Vincent, Emmanuel, Gribonval, Rémi 2010

机译：使用本地观察到的协方差和听觉动机化的时频表示的欠定混响音频源分离

A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation

摘要

著录项

相似文献

相关主题

期刊订阅