首页> 外文期刊>IEEE transactions on audio, speech and language processing >Feature Extraction for the Prediction of Multichannel Spatial Audio Fidelity
【24h】

Feature Extraction for the Prediction of Multichannel Spatial Audio Fidelity

机译:用于多通道空间音频保真度预测的特征提取

获取原文
获取原文并翻译 | 示例
           

摘要

This paper seeks to present an algorithm for the prediction of frontal spatial fidelity and surround spatial fidelity of multichannel audio, which are two attributes of the subjective parameter called basic audio quality. A number of features chosen to represent spectral and spatial changes were extracted from a set of recordings and used in a regression model as independent variables for the prediction of spatial fidelities. The calibration of the model was done by ridge regression using a database of scores obtained from a series of formal listening tests. The statistically significant features based on interaural cross correlation and spectral features found from an initial model were employed to build a simplified model and these selected features were validated. The results obtained from the validation experiment were highly correlated with the listening test scores and had a low standard error comparable to that encountered in typical listening tests. The applicability of the developed algorithm is limited to predicting the basic audio quality of low-pass filtered and down-mixed recordings (as obtained in listening tests based on a multistimulus test paradigm with reference and two anchors: a 3.5-kHz low-pass filtered signal and a mono signal).
机译:本文试图提出一种预测多声道音频的正面空间保真度和周围空间保真度的算法,这是主观参数(称为基本音频质量)的两个属性。从一组记录中提取了一些代表光谱和空间变化的特征,并在回归模型中用作预测空间逼真度的自变量。使用从一系列正式听力测试中获得的分数数据库,通过山脊回归对模型进行校准。基于耳间互相关和从初始模型中发现的光谱特征的统计学显着特征被用于构建简化模型,并且对这些选定特征进行了验证。从验证实验获得的结果与听力测试成绩高度相关,并且具有与典型听力测试相当的低标准误差。所开发算法的适用范围仅限于预测低通滤波和降混录音的基本音频质量(如在听音测试中获得的,该测试基于具有参考和两个锚点的多激励测试范例:3.5 kHz低通滤波)信号和单声道信号)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号