首页> 外文会议>International Conference on Quality of Multimedia Experience >Prenc — Predict Number of Video Encoding Passes with Machine Learning
【24h】

Prenc — Predict Number of Video Encoding Passes with Machine Learning

机译:Prenc —通过机器学习预测视频编码通过的次数

获取原文

摘要

Video streaming providers spend huge amounts of processing time to get a quality-optimized encoding. While the quality-related impact may be known to the service provider, the impact on video quality is hard to assess, when no reference is available. Here, bitstream-based video quality models may be applicable, delivering estimates that include encoding-specific settings. Such models typically use several input parameters, e.g. bitrate, framerate, resolution, video codec, QP values and more. However, for a given bitstream, to determine which encoding parameters were selected, e.g., the number of encoding passes, is not a trivial task. This leads to our following research question: Given an unknown video bitstream, which encoding settings have been used? To tackle this reverse engineering problem, we introduce a system called prenc. Besides the use in video-quality estimation, such algorithms may also be used in other applications such as video forensics. We prove our concept by applying prenc to distinguish between one- and two-pass encoding. Starting from modeling the problem as a classification task, estimating bitstream-based features, we further describe a machine learning approach with feature selection to automatically predict the number of encoding passes for a given video bitstream. Our large-scale evaluation consists of 16 short movie type 4K videos that were segmented and encoded with different settings (resolutions, codecs, bitrates), so that we in total analyzed 131.976 DASH video segments. We further show that our system is robust, based on a 50% train and 50% validation approach without source video overlapping, where we get a classification performance of 65% F1 score. Moreover, we also describe the used bitstream-based features in detail, the feature pooling strategy and include other machine learning algorithms in our evaluation.
机译:视频流提供商会花费大量的处理时间来获得质量优化的编码。尽管服务提供商可能知道与质量相关的影响,但是在没有参考可用的情况下,很难评估对视频质量的影响。在此,基于比特流的视频质量模型可能适用,可提供包含特定于编码的设置的估计。这样的模型通常使用几个输入参数,例如。比特率,帧率,分辨率,视频编解码器,QP值等。但是,对于给定的比特流,确定选择哪些编码参数,例如,编码通过的次数,并不是一件容易的事。这就引出了我们下面的研究问题:给定未知的视频比特流,使用了哪种编码设置?为了解决这个逆向工程问题,我们引入了一个称为prenc的系统。除了用于视频质量估计之外,此类算法还可以用于其他应用程序中,例如视频取证。我们通过使用prenc来区分一遍和两遍编码来证明我们的概念。从将问题建模为分类任务开始,估计基于比特流的特征,我们进一步描述一种具有特征选择的机器学习方法,以自动预测给定视频比特流的编码次数。我们的大规模评估包括16个短电影类型的4K视频,这些视频通过不同的设置(分辨率,编解码器,比特率)进行分段和编码,因此我们总共分析了131.976个DASH视频片段。我们进一步表明,基于50%的训练和50%的验证方法(没有源视频重叠),我们的系统是可靠的,在F1评分中,我们的分类性能为65%。此外,我们还将详细描述所使用的基于比特流的功能,功能池策略,并在我们的评估中包括其他机器学习算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号