Complexity reduction of versatile video coding standard: a deep learning approach

Zouidi Naima; Belghith Fatma; Kessentini Amina; Masmoudi Nouri

首页> 外文期刊>Journal of electronic imaging >Complexity reduction of versatile video coding standard: a deep learning approach

【24h】

Complexity reduction of versatile video coding standard: a deep learning approach

机译：多功能视频编码标准的复杂性降低：深度学习方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The new video coding standard, known as versatile video coding (VVC) is projected to be concluded by the end of 2020. This standard is conducted mainly to address 8k videos and emerging applications such as 360 deg and high dynamic range. Intraprediction is a part of the prediction step in the video coding that exploits spatial redundancy. This module has been improved, compared to the high-efficiency video coding (HEVC), by increasing the set of angular intraprediction modes (IPM) from 33 to 65 to model directional textures more accurately. Moreover, a quadtree plus binary tree (QTBT) structure replaced the QT of the HEVC. These improvements targeting at enhancing the coding efficiency resulted in significant coding complexity, especially in terms of encoding time. This paper fits into this context. It evokes the optimizations of the intramode and coding unit size decisions using statistical methods of fast decision and deep learning. A fast intramode decision algorithm is proposed for the different binary depths of the QTBT structure. Thus, an optimization by deep learning for square blocks is also included. Results show that the combinations of these two approaches can significantly reduce the complexity of the VVC encoder. Under the all intra (AI) configuration, a reduction of about 61.04% of the intraencoding time is achieved while maintaining an acceptable rate distortion performance. (c) 2021 SPIE and IS&T [DOI: 10.1117/1.JEI.30.2.023002]Video traffic is continuing to grow at a huge rate. According to a Cisco study,1 video consumption will surpass 80% of global IP traffic by 2022. Unsurprisingly, the emerging applications, such as 360 deg and high dynamic range videos, have rapidly gained great attention from video consumers and further advanced the shareability of video content. The foreseeable future will also attest to the dominance of beyond ultrahigh definition qualities and high frame rate videos. Due to this rapid evolution, the need for higher coding efficiency than that of the current standard

机译：新的视频编码标准，称为多功能视频编码（VVC）被投影为在2020年底结束。本标准主要用于解决8K视频和新兴应用，如360°和高动态范围。 Intreapiction是用于利用空间冗余的视频编码中预测步骤的一部分。与高效视频编码（HEVC）相比，该模块得到了改进，通过增加33到65的角度内读取模式（IPM），更准确地模拟定向纹理。此外，Quadtree加二叉树（QTBT）结构替换了HEVC的Qt。靶向增强编码效率的这些改进导致显着的编码复杂性，尤其是在编码时间方面。本文适合这种背景。它唤起了使用快速决策和深度学习的统计方法的intramode和编码单元大小决策的优化。提出了一种快速intramode决策算法，用于QTBT结构的不同二进制深度。因此，还包括深度学习方块的优化。结果表明，这两种方法的组合可以显着降低VVC编码器的复杂性。在所有帧内（AI）配置下，在保持可接受的速率失真性能的同时实现约61.04％的境内编码时间。（c）2021个SPIE和IS＆T [DOI：10.1117 / 1.JEI.30.2.023002]视频流量继续以巨大的速度增长。根据思科的研究，1个视频消耗将超过2022年的全球IP流量的80％。不成意，新兴应用程序，如360°和高动态范围视频，从视频消费者迅速获得了很大的关注，并进一步推动了令人愉快的可满贯性视频内容。可预见的未来也将证明超史定义素质和高帧率视频的优势。由于这种快速的进化，需要比当前标准更高的编码效率

著录项

来源
《Journal of electronic imaging》 |2021年第2期|023002.1-023002.22|共22页
作者
Zouidi Naima; Belghith Fatma; Kessentini Amina; Masmoudi Nouri;
展开▼
作者单位

Univ Sfax Natl Sch Engn Lab Elect & Informat Technol Sfax Tunisia;

Univ Sfax Natl Sch Engn Lab Elect & Informat Technol Sfax Tunisia;

Univ Sfax Natl Sch Engn Lab Elect & Informat Technol Sfax Tunisia|Univ Gabes Higher Inst Comp & Multimedia Gabes Gabes Tunisia;

Univ Sfax Natl Sch Engn Lab Elect & Informat Technol Sfax Tunisia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
versatile video coding; intraprediction; quadtree plus binary tree; deep learning; complexity reduction;

机译：多功能视频编码;内部;Quadtree加二叉树;深入学习;复杂性减少;
入库时间 2022-08-19 01:58:49

相似文献

外文文献
中文文献
专利

1. CNN-LSTM Learning Approach-Based Complexity Reduction for High-Efficiency Video Coding Standard [J] . Soulef Bouaafia, Randa Khemiri, Amna Maraoui, Scientific programming . 2021,第a期

机译：基于CNN-LSTM学习方法的高效视频编码标准的复杂性降低
2. Low-Complexity Error Resilient HEVC Video Coding: A Deep Learning Approach [J] . Taiyu Wang, Fan Li, Xiaoya Qiao, IEEE Transactions on Image Processing . 2021,第1期

机译：低复杂性错误弹性HEVC视频编码：深度学习方法
3. Video Coding Standards Progress Report: Joint Video Experts Team Launches the Versatile Video Coding Project [J] . Gary J. Sullivan SMPTE motion imaging journal . 2018,第8期

机译：视频编码标准进度报告：视频专家联合小组启动了多功能视频编码项目
4. Complexity and Coding Efficiency Assessment of the Versatile Video Coding Standard [C] . Icaro Siqueira, Guilherme Correa, Mateus Grellert IEEE International Symposium on Circuits and Systems . 2021

机译：多功能视频编码标准的复杂性和编码效率评估
5. Computational Complexity Management of H.264/AVC Video Coding Standard. [D] . Solak, Serdar Burak. 2010

机译：H.264 / AVC视频编码标准的计算复杂性管理。
6. SpikeSegNet-a deep learning approach utilizing encoder-decoder network with hourglass for spike segmentation and counting in wheat plant from visual imaging [O] . Tanuj Misra, Alka Arora, Sudeep Marwaha, 2020

机译：SpikeSegNet-一种深度学习方法利用带有沙漏的编码器-解码器网络对小麦植株中的穗进行分割并通过视觉成像进行计数
7. Maximum-Entropy-Model-Enabled Complexity Reduction Algorithm in Modern Video Coding Standards [O] . Xiantao Jiang, Tian Song, Takafumi Katayama 2020

机译：现代视频编码标准中最大 - 熵模型复杂性复杂性算法

Complexity reduction of versatile video coding standard: a deep learning approach

摘要

著录项

相似文献

相关主题

期刊订阅