首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Boosting DNN-based speech enhancement via explicit transformations
【24h】

Boosting DNN-based speech enhancement via explicit transformations

机译:通过显式转换促进基于DNN的语音增强

获取原文

摘要

In this study, we investigate on the learning behaviors of DNN by explicit feature transformations. As a demonstration, linear and logarithm transformations, corresponding to the amplitude spectra and log-power spectra, are compared with the same minimum mean squared error (MMSE) objective function for optimizing DNN parameters. Based on the experimental analysis of the DNN learning behaviors, we make an interesting observation that the learning with the amplitude spectra tends to improve the speech intelligibility while the learning with the log-power spectra yields better speech quality. By leveraging on this strong complementarity, the feature concatenation with two transformations for the input layer and post-processing with two learned targets are proposed to boost DNN-based speech enhancement.
机译:在这项研究中,我们通过显式特征转换来研究DNN的学习行为。作为演示,将与振幅谱和对数功率谱相对应的线性和对数转换与相同的最小均方误差(MMSE)目标函数进行比较,以优化DNN参数。通过对DNN学习行为的实验分析,我们得出了一个有趣的观察结果:使用幅度谱学习会提高语音清晰度,而使用对数功率谱学习会产生更好的语音质量。通过利用这种强大的互补性,提出了针对输入层进行两次转换的特征级联,以及针对两个学习到的目标进行后处理的功能,以增强基于DNN的语音增强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号