A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

Wohlmayr M.Stark M.Pernkopf F.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

【24h】

A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

机译：基于因子隐马尔可夫模型的多音调跟踪概率交互模型

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present a simple and efficient feature modeling approach for tracking the pitch of two simultaneously active speakers. We model the spectrogram features of single speakers using Gaussian mixture models in combination with the minimum description length model selection criterion. To obtain a probabilistic representation for the speech mixture spectrogram features of both speakers, we employ the mixture maximization model (MIXMAX) and, as an alternative, a linear interaction model. A factorial hidden Markov model is applied for tracking pitch over time. This statistical model can be used for applications beyond speech, whenever the interaction between individual sources can be represented as MIXMAX or linear model. For tracking, we use the loopy max-sum algorithm, and provide empirical comparisons to exact methods. Furthermore, we discuss a scheduling mechanism of loopy belief propagation for online tracking. We demonstrate experimental results using Mocha-TIMIT as well as data from the speech separation challenge provided by Cooke We show the excellent performance of the proposed method in comparison to a well known multipitch tracking algorithm based on correlogram features. Using speaker-dependent models, the proposed method improves the accuracy of correct speaker assignment, which is important for single-channel speech separation. In particular, we are able to reduce the overall tracking error by 51% relative for the speaker-dependent case. Moreover, we use the estimated pitch trajectories to perform single-channel source separation, and demonstrate the beneficial effect of correct speaker assignment on speech separation performance.

机译：我们提出了一种简单有效的特征建模方法，用于跟踪两个同时活动的扬声器的音调。我们使用高斯混合模型结合最小描述长度模型选择标准对单个扬声器的频谱图特征进行建模。为了获得两个讲话者的语音混合声谱图特征的概率表示，我们采用了混合最大化模型（MIXMAX）和线性交互模型。应用阶乘隐式马尔可夫模型来跟踪随时间变化的音调。每当单个来源之间的交互可以表示为MIXMAX或线性模型时，该统计模型就可以用于语音以外的应用。对于跟踪，我们使用循环最大和算法，并提供对精确方法的经验比较。此外，我们讨论了用于在线跟踪的循环信念传播的调度机制。我们演示了使用Mocha-TIMIT以及来自Cooke提供的语音分离挑战的数据的实验结果。与基于相关图特征的众所周知的多音高跟踪算法相比，我们展示了所提出方法的出色性能。使用与说话者相关的模型，该方法提高了正确分配说话者的准确性，这对于单通道语音分离非常重要。特别是，对于说话者相关的情况，我们能够将整体跟踪误差降低51％。此外，我们使用估计的音调轨迹执行单通道源分离，并证明正确的扬声器分配对语音分离性能的有益影响。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2011年第4期|p.799-810|共12页
作者
Wohlmayr M.Stark M.Pernkopf F.;
展开▼
作者单位

Signal Processing & Speech Communication Laboratory, Graz University of Technology, Graz, Austria;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Factorial hidden Markov model (FHMM); Gaussian mixture model (GMM); mixture maximization; multipitch tracking; speech analysis;

机译：因子隐马尔可夫模型（FHMM）;高斯混合模型（GMM）;混合最大化;多音高跟踪;语音分析;

相似文献

外文文献
中文文献
专利

1. Source/Filter Factorial Hidden Markov Model, With Application to Pitch and Formant Tracking [J] . Durrieu J.-L., Thiran J.-P. Audio, Speech, and Language Processing, IEEE Transactions on . 2013,第12期

机译：源/滤波器阶乘隐马尔可夫模型及其在音高和共振峰跟踪中的应用
2. Object Tracking and Tracing:Hidden Semi-Markov Model Based Probabilistic Location Determination [J] . WU Jie, WANG Dong, SHENG Huan-ye 上海交通大学学报（英文版） . 2011,第004期
3. A Hidden Markov Model for Single Particle Tracks Quantifies Dynamic Interactions between LFA-1 and the Actin Cytoskeleton [J] . Raibatak Das, Christopher W. Cairo, Daniel Coombs PLoS Computational Biology . 2009,第11期

机译：单粒子轨道的隐马尔可夫模型量化了LFA-1和肌动蛋白细胞骨架之间的动态相互作用。
4. Finite Mixture Spectrogram Modeling for Multipitch Tracking Using A Factorial Hidden Markov Model [C] . Michael Wohlmayr, Franz Pernkopf International Speech Communication Association . 2009

机译：使用因子隐马尔可夫模型进行多点跟踪的有限混合谱图模型
5. Entity Relation Detection with Factorial Hidden Markov Models and Maximum Entropy Discriminant Latent Dirichlet Allocations . [D] . Li, Dingcheng. 2011

机译：因子隐马尔可夫模型与最大熵判别潜在Dirichlet分配的实体关系检测。
6. A Hidden Markov Model for Single Particle Tracks Quantifies Dynamic Interactions between LFA-1 and the Actin Cytoskeleton [O] . Raibatak Das, Christopher W. Cairo, Daniel Coombs 2009

机译：单粒子轨道的隐马尔可夫模型量化了LFA-1和肌动蛋白细胞骨架之间的动态相互作用。
7. Joint tracking and video registration by factorial Hidden Markov models”, ICASSP [O] . Xue Mei, Fatih Porikli 2008

机译：通过阶乘隐马尔可夫模型进行联合跟踪和视频注册”，ICASSP

A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅