首页> 外文会议>European Signal Processing Conference >Enhanced output-based perceptual measure for predicting subjective quality of speech
【24h】

Enhanced output-based perceptual measure for predicting subjective quality of speech

机译:增强的基于输出的感知度量,用于预测语音的主观质量

获取原文

摘要

This paper presents an enhanced version of a non-intrusive measure for assessment of speech quality of voice communication systems and evaluates its performance. The new measure, which uses only the output of the system, is based on measuring perception-based objective auditory distances between voiced parts of the output (processed) speech whose quality is to be evaluated to appropriately matching references extracted from one of four pre-formulated codebooks, depending on their estimated pitch values. The codebooks are formed by optimally clustering large number of parametric speech vectors extracted from a database of clean speech records. The measured auditory distances are then mapped into equivalent subjective Mean Opinion Scores (MOS). The required clustering and matching process was effected by using an efficient data-mining tool known as the Self-Organizing Map (SOM). The short-time Bark Spectrum analysis is used in order to achieve perception-based, speaker-independent parametric representation of the speech. Reported evaluation results show that the proposed enhanced speech quality assessment method provides quality scores that are highly correlated with MOS obtained by formal subjective listening tests.
机译:本文提出了一种非侵入性措施的增强版本,用于评估语音通信系统的语音质量并评估其性能。这项新措施仅使用系统的输出,是基于对输出(处理后)语音的有声部分之间基于感知的客观听觉距离进行测量的,这些语音的质量要进行评估,以适当匹配从四个前置词之一中提取的参考。制定的代码书,取决于它们的估算音高值。通过最佳地聚类从干净语音记录数据库中提取的大量参量语音向量来形成码本。然后,将测得的听觉距离映射到等效的主观平均意见分数(MOS)中。所需的聚类和匹配过程是通过使用一种称为自组织图(SOM)的有效数据挖掘工具来实现的。短时树皮频谱分析用于获得语音的基于感知的,与说话者无关的参数表示。报告的评估结果表明,所提出的增强语音质量评估方法所提供的质量得分与通过正式的主观听力测试获得的MOS高度相关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号