Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

Darwish Saad M.; Almajtomi Ahmed A. J.

摘要

Video compression has great significance in the communication of motion pictures. Video compression techniques try to remove the different types of redundancy within or between video sequences. In the temporal domain, the video compression techniques remove the redundancies between the highly correlated consequence frames of the video. In the spatial domain, the video compression techniques remove the redundancies between the highly correlated consequence pixels (samples) in the same frame. Evolving neural-networks based video coding research efforts are focused on improving existing video codecs by performing better predictions that are incorporated within the same codec framework or holistic methods of end-to-end video compression schemes. Current neural network-based video compression adapts static codebook to achieve compression that leads to learning inability from new samples. This paper proposes a modified video compression model that adapts the genetic algorithm to build an optimal codebook for adaptive vector quantization that is used as an activation function inside the neural network's hidden layer. Background subtraction algorithm is employed to extract motion objects within frames to generate the context-based initial codebook. Furthermore, Differential Pulse Code Modulation (DPCM) is utilized for lossless compression of significant wavelet coefficients; whereas low energy coefficients are lossy compressed using Learning Vector Quantization (LVQ) neural networks. Finally, Run Length Encoding (RLE) is engaged to encode the quantized coefficients to achieve a higher compression ratio. Experiments have proven the system's ability to achieve higher compression ratio with acceptable efficiency measured by PSNR.

机译：视频压缩在运动图片的通信方面具有重要意义。视频压缩技术尝试在视频序列内或在视频序列之间删除不同类型的冗余。在时间域中，视频压缩技术消除了视频的高度相关后果帧之间的冗余。在空间域中，视频压缩技术将在同一帧中的高度相关后果像素（样本）之间取出冗余。基于神经网络的视频编码研究工作集中在通过执行在端到端视频压缩方案的相同编解码器框架或整体方法内的更好的预测来改善现有的视频编解码器。目前基于神经网络的视频压缩适应静态码本以实现压缩，从而导致从新样本中学习无法实现。本文提出了一种修改的视频压缩模型，它适应遗传算法来构建用于自适应矢量量化的最佳码本，其用作神经网络隐藏层内的激活函数。背景下减法算法用于提取帧内的运动对象以生成基于上下文的初始码本。此外，差分脉冲码调制（DPCM）用于显着小波系数的无损压缩;而低能量系数是使用学习矢量量化（LVQ）神经网络压缩的有损压缩。最后，运行长度编码（RLE）与编码量化系数进行编码以实现更高的压缩比。实验证明了系统能够以PSNR测量的可接受效率实现更高的压缩比。

Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

摘要

著录项

相关主题

期刊订阅