首页> 外文会议>International Conference on Quality of Multimedia Experience >Towards the development of a non-intrusive objective quality measure for DNN-enhanced speech
【24h】

Towards the development of a non-intrusive objective quality measure for DNN-enhanced speech

机译:面向DNN增强语音的非侵入性客观质量度量的发展

获取原文

摘要

Recently, several works have focused on leveraging the advances of deep neural networks (DNN) to a variety of domains, including speech enhancement. While advances in instrumental quality metrics have been made, particularly for enhanced speech, there is still relatively little research assessing how useful such metrics are for DNN-enhanced speech. This work aims to fill this gap. We performed online listening tests using the outputs of three different DNN-based speech enhancement models for both denoising and dereverberation. When assessing the predictive power of several objective metrics, we found that existing non-intrusive methods fail at monitoring signal quality. To overcome this limitation, we propose a new metric based on a combination of a handful of relevant acoustic features. Results inline with those obtained with intrusive measures are then attained. In a leave-one-model-out test, the proposed non-intrusive metric is also shown to outperform two non-intrusive benchmarks for all three DNN enhancement methods, showing the proposed method is capable of generalizing to unseen models.
机译:最近,有几项工作集中在利用深度神经网络(DNN)的先进技术来扩展各个领域,包括语音增强。尽管在仪器质量指标方面取得了进步,尤其是在增强语音方面,但评估这种指标对DNN增强语音的有用性的研究仍相对较少。这项工作旨在填补这一空白。我们使用三种不同的基于DNN的语音增强模型的输出进行了在线听力测试,以进行去噪和去混响。在评估几个客观指标的预测能力时,我们发现现有的非侵入性方法无法监测信号质量。为克服此限制,我们提出了一种基于少数相关声学特征的组合的新度量标准。然后获得与通过侵入性措施获得的结果一致的结果。在“留一模型退出”测试中,对于所有三种DNN增强方法,拟议的非侵入式指标也表现出优于两个非侵入式基准,表明拟议的方法能够推广到看不见的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号