首页> 外文会议>Latent variable analysis and signal separation >Non-negative Hidden Markov Modeling of Audio with Application to Source Separation
【24h】

Non-negative Hidden Markov Modeling of Audio with Application to Source Separation

机译:音频的非负隐马尔可夫建模及其在信号源分离中的应用

获取原文
获取原文并翻译 | 示例

摘要

In recent years, there has been a great deal of work in modeling audio using non-negative matrix factorization and its probabilistic counterparts as they yield rich models that are very useful for source separation and automatic music transcription. Given a sound source, these algorithms learn a dictionary of spectral vectors to best explain it. This dictionary is however learned in a manner that disregards a very important aspect of sound, its temporal structure. We propose a novel algorithm, the non-negative hidden Markov model (N-HMM), that extends the aforementioned models by jointly learning several small spectral dictionaries as well as a Markov chain that describes the structure of changes between these dictionaries. We also extend this algorithm to the non-negative factorial hidden Markov model (N-FHMM) to model sound mixtures, and demonstrate that it yields superior performance in single channel source separation tasks.
机译:近年来,在使用非负矩阵分解及其概率模型对音频进行建模方面已经进行了大量工作,因为它们产生了丰富的模型,这些模型对于源分离和自动音乐转录非常有用。给定一个声源,这些算法将学习频谱向量字典以最好地解释它。然而,以忽略声音的非常重要的方面,其时间结构的方式来学习该字典。我们提出了一种新颖的算法,即非负隐马尔可夫模型(N-HMM),该算法通过共同学习几个小的光谱字典以及描述这些字典之间变化结构的马尔可夫链来扩展上述模型。我们还将这种算法扩展到非负因式隐马尔可夫模型(N-FHMM),以对声音混合进行建模,并证明该算法在单通道源分离任务中表现出优异的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号