首页> 外文会议>2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays >Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition
【24h】

Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition

机译:稀疏激活字典加低秩分解的内容自适应语音增强

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

One powerful approach to speech enhancement employs strong models for both speech and noise, decomposing a mixture into the most likely combination. But if the noise encountered differs significantly from the system's assumptions, performance will suffer. In previous work, we proposed a speech enhancement model that decomposes the spectrogram into sparse activation of a dictionary of target speech templates, and a low-rank background model. This makes few assumptions about the noise, and gave appealing results on small excerpts of noisy speech. However, when processing whole conversations, the foreground speech may vary in its complexity and may be unevenly distributed throughout the recording, resulting in inaccurate decompositions for some segments. In this paper, we explore an adaptive formulation of our previous model that incorporates separate side information to guide the decomposition, making it able to better process entire conversations that may exhibit large variations in the speech content.
机译:一种强大的语音增强方法采用强大的语音和噪声模型,将混合分解为最可能的组合。但是,如果遇到的噪声与系统的假设有很大不同,则会降低性能。在以前的工作中,我们提出了一种语音增强模型和一个低秩背景模型,该模型将频谱图分解为目标语音模板字典的稀疏激活。这几乎没有关于噪声的假设,并且在嘈杂的语音的小片段中给出了有吸引力的结果。但是,在处理整个对话时,前景语音的复杂度可能会有所不同,并且可能会在整个录制过程中分布不均,从而导致某些片段的分解不准确。在本文中,我们探索了先前模型的自适应公式,该公式结合了单独的辅助信息以指导分解,从而使其能够更好地处理可能显示语音内容差异很大的整个对话。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号