How Deep is Your Encoder: An Analysis of Features Descriptors for an Autoencoder-Based Audio-Visual Quality Metric

机译：您的编码器有多深：基于自动编码器的视听质量指标的功能描述符分析

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The development of audio-visual quality assessment models poses a number of challenges in order to obtain accurate predictions. One of these challenges is the modelling of the complex interaction that audio and visual stimuli have and how this interaction is interpreted by human users. The No-Reference Audio-Visual Quality Metric Based on a Deep Autoencoder (NAViDAd) deals with this problem from a machine learning perspective. The metric receives two sets of audio and video features descriptors and produces a low-dimensional set of features used to predict the audio-visual quality. A basic implementation of NAViDAd was able to produce accurate predictions tested with a range of different audio-visual databases. The current work performs an ablation study on the base architecture of the metric. Several modules are removed or re-trained using different configurations to have a better understanding of the metric functionality. The results presented in this study provided important feedback that allows us to understand the real capacity of the metric's architecture and eventually develop a much better audio-visual quality metric.

机译：为了获得准确的预测，视听质量评估模型的发展提出了许多挑战。这些挑战之一是对音频和视觉刺激所具有的复杂交互以及人类用户如何解释这种交互进行建模。基于深度自动编码器（NAViDAd）的无引用视听质量度量标准从机器学习的角度解决了此问题。度量标准接收两组音频和视频特征描述符，并生成用于预测视听质量的低维特征集。 NAViDAd的基本实现能够产生经过一系列不同视听数据库测试的准确预测。当前的工作是对度量的基础架构进行消融研究。使用不同的配置删除或重新训练了几个模块，以更好地了解度量功能。这项研究中提供的结果提供了重要的反馈，使我们能够了解度量标准体系结构的实际容量，并最终开发出更好的视听质量度量标准。

著录项

来源
《International Conference on Quality of Multimedia Experience》|2020年|1-6|共6页
会议地点
作者
Helard Martinez; Andrew Hines; Mylène C. Q. Farias;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
audio-visual; quality metrics; autoencoder; qoe; machine learning;

机译：视听;质量指标;自动编码器;问题;机器学习;

相似文献

外文文献
中文文献
专利

1. Analysis of feature detector and descriptor combinations with a localization experiment for various performance metrics [J] . ERTU?RUL BAYRAKTAR, PINAR BOYRAZ Turkish Journal of Electrical Engineering and Computer Sciences . 2017,第3期

机译：使用各种性能指标的定位实验分析特征检测器和描述符组合
2. Video-Based Depression Level Analysis by Encoding Deep Spatiotemporal Features [J] . Al Jazaery Mohamad, Guo Guodong Affective Computing, IEEE Transactions on . 2021,第1期

机译：基于视频的抑郁级分析通过编码深蓝色的特征
3. Deep Air Learning: Interpolation, Prediction, and Feature Analysis of Fine-Grained Air Quality [J] . Zhongang Qi, Tianchun Wang, Guojie Song, IEEE Transactions on Knowledge and Data Engineering . 2018,第12期

机译：深度空气学习：细粒度空气质量的插值，预测和特征分析
4. Analysis of Gradient Degradation and Feature Map Quality in Deep All-Convolutional Neural Networks Compared to Deep Residual Networks [C] . Wei Gao, Mark D. McDonnell International conference on neural information processing . 2017

机译：深度全卷积神经网络与深度残差网络相比的梯度退化和特征图质量分析
5. Pattern encoding algorithms and information modeling metrics for network quality of service [D] . Vespa, Lucas 2011

机译：网络服务质量的模式编码算法和信息建模指标
6. Automatic summarization of soccer highlights using audio-visual descriptors [O] . A Raventós, R Quijada, Luis Torres, -1

机译：使用视听描述符自动汇总足球精彩片段
7. Analysis of feature detector and descriptor combinations with a localization experiment for various performance metrics [O] . Bayraktar, Ertugrul, Boyraz, Pinar 2017

机译：特征检测器和描述符组合的分析各种性能指标的本地化实验

How Deep is Your Encoder: An Analysis of Features Descriptors for an Autoencoder-Based Audio-Visual Quality Metric

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅