首页> 外文OA文献 >Towards Single-Channel Unsupervised Source Separation of Speech Mixtures: The Layered Harmonics/Formants Separation-Tracking Model
【2h】

Towards Single-Channel Unsupervised Source Separation of Speech Mixtures: The Layered Harmonics/Formants Separation-Tracking Model

机译:迈向混合语音的单通道无监督源分离:分层谐波/共振峰分离跟踪模型

摘要

Speaker models for blind source separation are typically based on HMMs consisting of vast numbers of states to capture source spectral variation, and trained on large amounts of isolated speech. Since observations can be similar between sources, inference relies on sequential constraints from the state transition matrix which are, however, quite weak. To avoid these problems, we propose a strategy of capturing local deformations of the time-frequency energy distribution. Since consecutive spectral frames are highly correlated, each frame can be accurately described as a nonuniform deformation of its predecessor. A smooth pattern of deformations is indicative of a single speaker, and the cliffs in the deformation fields may indicate a speaker switch. Further, the log-spectrum of speech can be decomposed into two additive layers, separately describing the harmonics and formant structure. We model smooth deformations as hidden transformation variables in both layers, using MRFs with overlapping subwindows as observations, assumed to be a noisy sum of the two layers. Loopy belief propagation provides for efficient inference. Without any pre-trained speech or speaker models, this approach can be used to fill in missing time-frequency observations, and the local entropy of the deformation fields indicate source boundaries for separation.
机译:用于盲源分离的扬声器模型通常基于HMM,该HMM由大量状态组成,以捕获源频谱变化,并针对大量孤立语音进行训练。由于源之间的观察结果可能相似,因此推论依赖于状态转换矩阵中的顺序约束,但条件约束相当弱。为了避免这些问题,我们提出了一种捕获时频能量分布局部变形的策略。由于连续的光谱帧高度相关,因此每个帧可以准确地描述为其前身的不均匀变形。变形的平滑模式指示单个扬声器,并且变形字段中的悬崖可能指示扬声器开关。此外,语音的对数谱可以分解为两个加法层,分别描述谐波和共振峰结构。我们使用两个子窗口重叠的MRF作为观察值,将平滑变形建模为两层中的隐藏变换变量,并假设这是两层的噪声总和。循环的信念传播提供了有效的推断。在没有任何预先训练的语音或说话者模型的情况下,该方法可用于填充缺少的时频观测值,并且变形场的局部熵指示分离的源边界。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号