Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding

Wenrui Dai; Yangmei Shen; Xin Tang; Junni Zou; Hongkai Xiong; Chang Wen Chen

首页> 外文期刊>IEEE Transactions on Image Processing >Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding

【24h】

Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding

机译：时空在线字典学习的稀疏表示法用于有希望的视频编码

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Classical dictionary learning methods for video coding suffer from high computational complexity and interfered coding efficiency by disregarding its underlying distribution. This paper proposes a spatio-temporal online dictionary learning (STOL) algorithm to speed up the convergence rate of dictionary learning with a guarantee of approximation error. The proposed algorithm incorporates stochastic gradient descents to form a dictionary of pairs of 3D low-frequency and high-frequency spatio-temporal volumes. In each iteration of the learning process, it randomly selects one sample volume and updates the atoms of dictionary by minimizing the expected cost, rather than optimizes empirical cost over the complete training data, such as batch learning methods, e.g., K-SVD. Since the selected volumes are supposed to be independent identically distributed samples from the underlying distribution, decomposition coefficients attained from the trained dictionary are desirable for sparse representation. Theoretically, it is proved that the proposed STOL could achieve better approximation for sparse representation than K-SVD and maintain both structured sparsity and hierarchical sparsity. It is shown to outperform batch gradient descent methods (K-SVD) in the sense of convergence speed and computational complexity, and its upper bound for prediction error is asymptotically equal to the training error. With lower computational complexity, extensive experiments validate that the STOL-based coding scheme achieves performance improvements than H.264/AVC or High Efficiency Video Coding as well as existing super-resolution-based methods in rate-distortion performance and visual quality.

机译：用于视频编码的经典词典学习方法由于忽略了其基础分布而遭受了高计算复杂度和编码效率的困扰。提出了一种时空在线词典学习算法，在保证近似误差的前提下，加快了词典学习的收敛速度。提出的算法结合了随机梯度下降来形成3D低频和高频时空体积对的字典。在学习过程的每一次迭代中，它都会随机选择一个样本量并通过使预期成本最小化来更新字典的原子，而不是在诸如批量学习方法（例如K-SVD）之类的完整训练数据上优化经验成本。由于假定所选择的体积是来自基础分布的独立的相同分布的样本，所以对于稀疏表示，希望从训练后的字典获得分解系数。从理论上证明，所提出的STOL可以比K-SVD获得更好的稀疏表示近似，并同时保持结构稀疏性和层次稀疏性。从收敛速度和计算复杂度的角度来看，它表现出优于批次梯度下降法（K-SVD），并且其预测误差的上限渐近等于训练误差。以较低的计算复杂度，大量实验证明，基于STOL的编码方案比H.264 / AVC或高效视频编码以及现有的基于超分辨率的方法在码率失真性能和视觉质量上均实现了性能改进。

著录项

来源
《IEEE Transactions on Image Processing》 |2016年第10期|4580-4595|共16页
作者
Wenrui Dai; Yangmei Shen; Xin Tang; Junni Zou; Hongkai Xiong; Chang Wen Chen;
展开▼
作者单位

Department of Biomedical Informatics, University of California at San Diego, La Jolla, CA, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
K-SVD; Online dictionary learning; sparse representation; stochastic gradient descent; video coding;

机译：K-SVD在线字典学习稀疏表示随机梯度下降视频编码;

相似文献

外文文献
中文文献
专利

1. Sparse Spatio-Temporal Representation With Adaptive Regularized Dictionary Learning for Low Bit-Rate Video Coding [J] . Xiong H., Pan Z., Ye X., Circuits and Systems for Video Technology, IEEE Transactions on . 2013,第4期

机译：低比特率视频编码的自适应正则字典学习的稀疏时空表示
2. Low bit-rate SNR scalable video coding based on overcomplete dictionary learning and sparse representation [J] . Maziar Irannejad, Homayoun Mahdavi-Nasab Multidimensional systems and signal processing . 2020,第2期

机译：低比特率SNR可伸缩视频编码，基于过度顺序字典学习和稀疏表示
3. Sparse Codes Auto-Extractor for Classification: A Joint Embedding and Dictionary Learning Framework for Representation [J] . Zhao Zhang, Fanzhang Li, Tommy W. S. Chow, IEEE Transactions on Signal Processing . 2016,第14期

机译：分类的稀疏代码自动提取器：用于表示的联合嵌入和字典学习框架
4. Online dictionary learning based intra-frame video coding via sparse representation [C] . Sun Yipeng, Xu Mai, Tao Xiaoming, 2012 15th International Symposium on Wireless Personal Multimedia Communications. . 2012

机译：基于稀疏表示的基于在线字典学习的帧内视频编码
5. Semi-Blind Source Separation via Sparse Representations and Online Dictionary Learning. [D] . Rambhatla, Sirisha. 2012

机译：通过稀疏表示和在线词典学习进行半盲源分离。
6. Sparse coding and dictionary learning for spike trains to find spatio-temporal patterns [O] . Taro Tezuka 2015

机译：穗序列的稀疏编码和字典学习以找到时空模式
7. Semi-blind Source Separation via Sparse Representations and Online Dictionary Learning [O] . Rambhatla, Sirisha, Haupt, Jarvis D. 2015

机译：通过稀疏表示和在线的半盲源分离字典学习
8. Online Dictionary Learning for Sparse Coding [R] . Mairal, J., Bach, F., Ponce, J., 2009

机译：稀疏编码的在线词典学习

Sparse Representation With Spatio-Temporal Online Dictionary Learning for Promising Video Coding

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅