【24h】

Noise-robust dynamic time warping using PLCA features

机译:使用PLCA功能的抗噪动态时间扭曲

获取原文

摘要

Conventional speech features, such as mel-frequency cepstral coefficients, tend to perform well in template matching systems, such as dynamic time warping, in low noise conditions. However, they tend to degrade in noisy environments. We propose a method of calculating features using the probabilistic latent component analysis (PLCA) framework. This framework models the speech and noise separately, leading to higher performance in noisy conditions than conventional methods. In this work, we compare our PLCA-based features with conventional features on the task of aligning a high-fidelity speech recording to a noisy speech recording, a scenario common in automatic dialogue replacement.
机译:常规的语音特征(例如mel频率倒谱系数)在低噪声条件下的模板匹配系统(例如动态时间扭曲)中往往表现良好。但是,它们倾向于在嘈杂的环境中退化。我们提出了一种使用概率潜在成分分析(PLCA)框架计算特征的方法。该框架分别对语音和噪声建模,从而在嘈杂条件下比常规方法具有更高的性能。在这项工作中,我们将基于PLCA的功能与传统功能进行了比较,以将高保真语音记录与嘈杂的语音记录对齐,这是自动对话替换中常见的情况。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号