首页> 外文期刊>IEEE transactions on audio, speech and language processing >An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities
【24h】

An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities

机译:针对各种音频保真度而优化的人类主观音频质量的客观指标

获取原文
获取原文并翻译 | 示例

摘要

The goal of this paper is to develop an audio quality metric that can accurately quantify subjective quality over audio fidelities ranging from highly impaired to perceptually lossless. As one example of its utility, such a metric would allow scalable audio coding algorithms to be easily optimized over their entire operating ranges. We have found that the ITU-recommended objective quality metric, ITU-R BS.1387, does not accurately predict subjective audio quality over the wide range of fidelity levels of interest to us. In developing the desired universal metric, we use as a starting point the model output variables (MOVs) that make up BS.1387 as well as the energy equalization truncation threshold which has been found to be particularly useful for highly impaired audio. To combine these MOVs into a single quality measure that is both accurate and robust, we have developed a hybrid least-squares/minimax optimization procedure. Our test results show that the minimax-optimized metric is up to 36% lower in maximum absolute error compared to a similar metric designed using the conventional least-squares procedure.
机译:本文的目的是开发一种音频质量度量标准,该度量标准可以准确地量化从严重受损到感知无损的音频保真度上的主观质量。作为其实用性的一个示例,这种度量将允许可伸缩音频编码算法在其整个工作范围内轻松优化。我们发现,ITU推荐的客观质量度量标准ITU-R BS.1387不能准确地预测我们感兴趣的各种保真度水平上的主观音频质量。在制定所需的通用度量标准时,我们将构成BS.1387的模型输出变量(MOV)以及能量均衡截断阈值用作起点,这对于严重受损的音频特别有用。为了将这些MOV组合成既准确又可靠的单个质量度量,我们开发了一种混合最小二乘/最小极大值优化程序。我们的测试结果表明,与使用常规最小二乘法设计的类似度量标准相比,最小最大优化度量标准的最大绝对误差降低了36%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号