Prenc — Predict Number of Video Encoding Passes with Machine Learning

机译：Prenc —通过机器学习预测视频编码通过的次数

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Video streaming providers spend huge amounts of processing time to get a quality-optimized encoding. While the quality-related impact may be known to the service provider, the impact on video quality is hard to assess, when no reference is available. Here, bitstream-based video quality models may be applicable, delivering estimates that include encoding-specific settings. Such models typically use several input parameters, e.g. bitrate, framerate, resolution, video codec, QP values and more. However, for a given bitstream, to determine which encoding parameters were selected, e.g., the number of encoding passes, is not a trivial task. This leads to our following research question: Given an unknown video bitstream, which encoding settings have been used? To tackle this reverse engineering problem, we introduce a system called prenc. Besides the use in video-quality estimation, such algorithms may also be used in other applications such as video forensics. We prove our concept by applying prenc to distinguish between one- and two-pass encoding. Starting from modeling the problem as a classification task, estimating bitstream-based features, we further describe a machine learning approach with feature selection to automatically predict the number of encoding passes for a given video bitstream. Our large-scale evaluation consists of 16 short movie type 4K videos that were segmented and encoded with different settings (resolutions, codecs, bitrates), so that we in total analyzed 131.976 DASH video segments. We further show that our system is robust, based on a 50% train and 50% validation approach without source video overlapping, where we get a classification performance of 65% F1 score. Moreover, we also describe the used bitstream-based features in detail, the feature pooling strategy and include other machine learning algorithms in our evaluation.

机译：视频流提供商会花费大量的处理时间来获得质量优化的编码。尽管服务提供商可能知道与质量相关的影响，但是在没有参考可用的情况下，很难评估对视频质量的影响。在此，基于比特流的视频质量模型可能适用，可提供包含特定于编码的设置的估计。这样的模型通常使用几个输入参数，例如。比特率，帧率，分辨率，视频编解码器，QP值等。但是，对于给定的比特流，确定选择哪些编码参数，例如，编码通过的次数，并不是一件容易的事。这就引出了我们下面的研究问题：给定未知的视频比特流，使用了哪种编码设置？为了解决这个逆向工程问题，我们引入了一个称为prenc的系统。除了用于视频质量估计之外，此类算法还可以用于其他应用程序中，例如视频取证。我们通过使用prenc来区分一遍和两遍编码来证明我们的概念。从将问题建模为分类任务开始，估计基于比特流的特征，我们进一步描述一种具有特征选择的机器学习方法，以自动预测给定视频比特流的编码次数。我们的大规模评估包括16个短电影类型的4K视频，这些视频通过不同的设置（分辨率，编解码器，比特率）进行分段和编码，因此我们总共分析了131.976个DASH视频片段。我们进一步表明，基于50％的训练和50％的验证方法（没有源视频重叠），我们的系统是可靠的，在F1评分中，我们的分类性能为65％。此外，我们还将详细描述所使用的基于比特流的功能，功能池策略，并在我们的评估中包括其他机器学习算法。

著录项

来源
《International Conference on Quality of Multimedia Experience》|2020年|1-6|共6页
会议地点
作者
Steve Göring; Rakesh Rao Ramachandra Rao; Alexander Raake;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
video encoding; video features; video quality; machine learning;

机译：视频编码;视频功能;视频质量;机器学习;

相似文献

外文文献
中文文献
专利

1. Performance analysis of machine learning for arbitrary downsizing of pre-encoded HEVC video [J] . Van Luong Pham, De Praeter Johan, Van Wallendael Glenn, Consumer Electronics, IEEE Transactions on . 2015,第4期

机译：机器学习的性能分析，可任意缩小预编码的HEVC视频
2. Comparison of logistic regression, support vector machines, and deep learning classifiers for predicting memory encoding success using human intracranial EEG recordings [J] . Akshay Arora, Jui-Jui Lin, Alec Gasperian, Journal of neural engineering . 2018,第6期

机译：使用人类颅内脑电图记录预测记忆编码成功的逻辑回归，支持向量机和深度学习分类器的比较
3. Predicting split decisions of coding units in HEVC video compression using machine learning techniques [J] . Hassan Mahitab, Shanableh Tamer Multimedia Tools and Applications . 2019,第23期

机译：使用机器学习技术预测HEVC视频压缩中编码单元的分割决策
4. A Computationally Efficient Model for Predicting Successful Memory Encoding Using Machine-Learning-based EEG Channel Selection [C] . Krishnakant V. Saboo, Yogatheesan Varatharajah, Brent M. Berry, International IEEE/EMBS Conference on Neural Engineering . 2019

机译：使用基于机器学习的EEG通道选择预测成功的内存编码的计算有效模型
5. Low complexity H.264 video encoder design using machine learning techniques. [D] . Carrillo, Paula. 2008

机译：使用机器学习技术的低复杂度H.264视频编码器设计。
6. QUATgo: Protein quaternary structural attributes predicted by two-stage machine learning approaches with heterogeneous feature encoding [O] . Chi-Hua Tung, Ching-Hsuan Chien, Chi-Wei Chen, 2020

机译：Quatgo：由异构特征编码的两级机器学习方法预测的蛋白质四季结构属性
7. Hype versus hope: Deep learning encodes more predictive and robust brain imaging representations than standard machine learning [O] . Anees Abrol, Zening Fu, Mustafa Salman, 2020

机译：炒作与希望：深度学习编码比标准机器学习更具预测性和强大的脑成像表示

Prenc — Predict Number of Video Encoding Passes with Machine Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅