Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

机译：音频的非负隐马尔可夫建模及其在信号源分离中的应用

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years, there has been a great deal of work in modeling audio using non-negative matrix factorization and its probabilistic counterparts as they yield rich models that are very useful for source separation and automatic music transcription. Given a sound source, these algorithms learn a dictionary of spectral vectors to best explain it. This dictionary is however learned in a manner that disregards a very important aspect of sound, its temporal structure. We propose a novel algorithm, the non-negative hidden Markov model (N-HMM), that extends the aforementioned models by jointly learning several small spectral dictionaries as well as a Markov chain that describes the structure of changes between these dictionaries. We also extend this algorithm to the non-negative factorial hidden Markov model (N-FHMM) to model sound mixtures, and demonstrate that it yields superior performance in single channel source separation tasks.

机译：近年来，在使用非负矩阵分解及其概率模型对音频进行建模方面已经进行了大量工作，因为它们产生了丰富的模型，这些模型对于源分离和自动音乐转录非常有用。给定一个声源，这些算法将学习频谱向量字典以最好地解释它。然而，以忽略声音的非常重要的方面，其时间结构的方式来学习该字典。我们提出了一种新颖的算法，即非负隐马尔可夫模型（N-HMM），该算法通过共同学习几个小的光谱字典以及描述这些字典之间变化结构的马尔可夫链来扩展上述模型。我们还将这种算法扩展到非负因式隐马尔可夫模型（N-FHMM），以对声音混合进行建模，并证明该算法在单通道源分离任务中表现出优异的性能。

著录项

来源
《Latent variable analysis and signal separation》|2010年|p.140-148|共9页
会议地点 St. Malo(FR);St. Malo(FR)
作者
Gautham J. Mysore; Paris Smaragdis; Bhiksha Raj;
展开▼
作者单位

Center for Computer Research in Music and Acoustics, Stanford University;

Advanced Technology Labs, Adobe Systems Inc;

School of Computer Science, Carnegie Mellon University;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Monophonic constrained non-negative sparse coding using instrument models for audio separation and transcription of monophonic source-based polyphonic mixtures [J] . Francisco Jose Rodriguez-Serrano, Julio Jose Carabias-Orti, Pedro Vera-Candeas, Multimedia Tools and Applications . 2014,第1期

机译：使用仪器模型对基于单音源的多音混合物进行音频分离和转录的单音约束非负稀疏编码
2. Blind separation of non-stationary sources using continuous density hidden Markov models [J] . Gu F., Zhang H., Zhu D. Digital Signal Processing . 2013,第5期

机译：使用连续密度隐马尔可夫模型对非平稳源进行盲分离
3. Modelling covariance matrices by the trigonometric separation strategy with application to hidden Markov models [J] . Spezia Luigi Test: An Official Journal of the Spanish Society of Statistics and Operations Research . 2019,第2期

机译：利用应用于隐马尔可夫模型的三角分离策略建模协方差矩阵
4. Non-negative Hidden Markov Modeling of Audio with Application to Source Separation [C] . Gautham J. Mysore, Paris Smaragdis, Bhiksha Raj International Conference on Latent Variable Analysis and Signal Separation . 2010

机译：使用应用于源分离的音频非负面隐马尔可夫建模
5. A system for acoustic chord transcription and key extraction from audio using hidden Markov models trained on synthesized audio. [D] . Lee, Kyogu. 2008

机译：一种使用在合成音频上训练的隐马尔可夫模型从音频进行和弦转录和音调提取的系统。
6. Nonparametric model validations for hidden Markov models with applications in financial econometrics [O] . Zhibiao Zhao -1

机译：隐藏马尔可夫模型的非参数模型验证在金融计量经济学中的应用程序
7. Audio imputation using the non-negative hidden markov model [O] . Jinyu Han, Gautham J. Mysore, Bryan Pardo 2012

机译：使用非负隐马尔可夫模型的音频插补

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

摘要

著录项

相似文献

相关主题

期刊订阅