首页> 外文会议>European Signal Processing Conference >ENHANCED OUTPUT-BASED PERCEPTUAL MEASURE FOR PREDICTING SUBJECTIVE QUALITY OF SPEECH
【24h】

ENHANCED OUTPUT-BASED PERCEPTUAL MEASURE FOR PREDICTING SUBJECTIVE QUALITY OF SPEECH

机译:基于增强的输出的感知措施,以预测主观语音质量

获取原文

摘要

This paper presents an enhanced version of a non-intrusive measure for assessment of speech quality of voice communication systems and evaluates its performance. The new measure, which uses only the output of the system, is based on measuring perception-based objective auditory distances between voiced parts of the output (processed) speech whose quality is to be evaluated to appropriately matching references extracted from one of four pre-formulated codebooks, depending on their estimated pitch values. The codebooks are formed by optimally clustering large number of parametric speech vectors extracted from a database of clean speech records. The measured auditory distances are then mapped into equivalent subjective Mean Opinion Scores (MOS). The required clustering and matching process was effected by using an efficient data-mining tool known as the Self-Organizing Map (SOM). The short-time Bark Spectrum analysis is used in order to achieve perception-based, speaker-independent parametric representation of the speech. Reported evaluation results show that the proposed enhanced speech quality assessment method provides quality scores that are highly correlated with MOS obtained by formal subjective listening tests.
机译:本文介绍了语音通信系统的语音质量的非侵入性措施的增强版本,并评估其性能。仅使用系统输出的新措施是基于测量输出(处理)语音的浊音部分之间的基于感知的客观听觉距离,其质量被评估为适当匹配从四个预先提取的引用提取的参考配制码本,取决于其估计的音高值。通过最佳聚类从清洁语音记录数据库中提取的大量参数语音向量来形成码本。然后将测量的听觉距离映射到等同的主观平均意见分数(MOS)。通过使用称为自组织地图(SOM)的有效数据挖掘工具来实现所需的聚类和匹配过程。使用短时间BARK谱分析来实现基于感知的扬声器的扬声器的参数表示的语音。报告的评估结果表明,建议的增强型演讲质量评估方法提供了与正式主观听力测试获得的MOS高度相关的质量分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号