首页> 外文会议>IEEE International Conference on Multimedia Expo Workshops >Deep Learning Based Full-Reference and No-Reference Quality Assessment Models for Compressed UGC Videos
【24h】

Deep Learning Based Full-Reference and No-Reference Quality Assessment Models for Compressed UGC Videos

机译:基于深度学习的压缩UGC视频的全引用和无参考质量评估模型

获取原文

摘要

In this paper, we propose a deep learning based video quality assessment (VQA) framework to evaluate the quality of the compressed user’s generated content (UGC) videos. The proposed VQA framework consists of three modules, the feature extraction module, the quality regression module, and the quality pooling module. For the feature extraction module, we fuse the features from intermediate layers of the convolutional neural network (CNN) network into final quality-aware feature representation, which enables the model to make full use of visual information from low-level to high-level. Specifically, the structure and texture similarities of feature maps extracted from all intermediate layers are calculated as the feature representation for the full reference (FR) VQA model, and the global mean and standard deviation of the final feature maps fused by intermediate feature maps are calculated as the feature representation for the no reference (NR) VQA model. For the quality regression module, we use the fully connected (FC) layer to regress the quality-aware features into frame-level scores. Finally, a subjectively-inspired temporal pooling strategy is adopted to pool frame-level scores into the video-level score. The proposed model achieves the best performance among the state-of-the-art FR and NR VQA models on the Compressed UGC VQA database and also achieves pretty good performance on the in-the-wild UGC VQA databases.
机译:在本文中,我们提出了基于深度学习的视频质量评估(VQA)框架,以评估压缩用户生成的内容(UGC)视频的质量。所提出的VQA框架包括三个模块,特征提取模块,质量回归模块和质量汇集模块。对于特征提取模块,我们将卷积神经网络(CNN)网络的中间层的特征融入最终质量感知功能表示,这使得模型能够充分利用从低级到高级的视觉信息。具体地,从所有中间层提取的特征映射的结构和纹理相似度被计算为完整参考(FR)VQA模型的特征表示,并计算由中间特征映射融合的最终特征映射的全局均值和标准偏差作为No Reference(NR)VQA模型的特征表示。对于质量回归模块,我们使用完全连接的(FC)层将质量感知功能重新播放到帧级别分数。最后,采用主主主动池汇总策略来汇集帧级别分数进入视频级分数。该建议的模型在压缩的UGC VQA数据库上实现了最先进的FR和NR VQA模型中的最佳性能,并且还可以在野外的UGC VQA数据库上实现了很好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号