首页> 外文期刊>IEEE transactions on audio, speech and language processing >PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception
【24h】

PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception

机译:PEMO-Q-使用听觉感知模型进行客观音频质量评估的新方法

获取原文
获取原文并翻译 | 示例
           

摘要

A new method for the objective assessment and prediction of perceived audio quality is introduced. It represents an expansion of the speech quality measure$q_C$, introduced by Hansen and Kollmeier, and is based on a psychoacoustically validated, quantitative model of the “effective” peripheral auditory processing by Dau To evaluate the audio quality of a given distorted signal relative to a corresponding high-quality reference signal, the auditory model is employed to compute “internal representations” of the signals, which are partly assimilated in order to account for assumed cognitive aspects. The linear cross correlation coefficient of the assimilated internal representations represents the perceptual similarity measure (PSM). PSM shows good correlations with subjective quality ratings if different types of audio signals are considered separately, whereas a better accuracy of signal-independent quality prediction is achieved by a second quality measure$ PSM_t$represented by the fifth percentile of the sequence of instantaneous audio quality PSM(t). The new measures were evaluated using a large database of subjective listening tests that were originally carried out on behalf of the International Telecommunication Union (ITU) and Moving Pictures Experts Group (MPEG) for the evaluation of various low bit-rate audio codecs. Additional tests with data unknown in the development phase of the model were carried out. Except for linear distortions, the new method shows a higher prediction accuracy than the ITU-R recommendation BS.1387 (“PEAQ”) for the tested data.
机译:介绍了一种客观评估和预测感知音频质量的新方法。它代表了由Hansen和Kollmeier引入的语音质量度量$ q_C $的扩展,并基于Dau“有效”外围听觉处理的经过心理声学验证的量化模型,以评估给定失真信号相对的音频质量。对于相应的高质量参考信号,采用听觉模型来计算信号的“内部表示”,对这些信号进行部分吸收以解决假定的认知方面的问题。被吸收的内部表示形式的线性互相关系数表示感知相似度(PSM)。如果分别考虑不同类型的音频信号,则PSM与主观质量等级显示出良好的相关性,而通过第二质量度量$ PSM_t $可以表示瞬时音频质量序列的第五个百分位数,从而可以更好地实现与信号无关的质量预测的准确性PSM(t)。使用大型主观听觉测试数据库对新措施进行了评估,该数据库最初是代表国际电信联盟(ITU)和运动图像专家组(MPEG)进行的,用于评估各种低比特率音频编解码器。使用模型开发阶段中未知的数据进行了其他测试。除线性失真外,新方法显示出比ITU-R建议BS.1387(“ PEAQ”)更高的预测精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号