Video Decolorization Based on the CNN and LSTM Neural Network

Liu Shiguang; Wang Huixin; Zhang Xiaoli

首页> 外文期刊>ACM transactions on multimedia computing communications and applications >Video Decolorization Based on the CNN and LSTM Neural Network

【24h】

Video Decolorization Based on the CNN and LSTM Neural Network

机译：基于CNN和LSTM神经网络的视频脱色

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video decolorization is the process of transferring three-channel color videos into single-channel grayscale videos, which is essentially the decolorization operation of video frames. Most existing video decolorization algorithms directly apply image decolorization methods to decolorize video frames. However, if we only take the single-frame decolorization result into account, it will inevitably cause temporal inconsistency and flicker phenomenon meaning that the same local content between continuous video frames may display different gray values. In addition, there are often similar local content features between video frames, which indicates redundant information. To solve the preceding problems, this article proposes a novel video decolorization algorithm based on the convolutional neural network and the long short-term memory neural network. First, we design a local semantic content encoder to learn and extract the same local content of continuous video frames, which can better preserve the contrast of video frames. Second, a temporal feature controller based on the bi-directional recurrent neural networks with Long short-term memory units is employed to refine the local semantic features, which can greatly maintain temporal consistency of the video sequence to eliminate the flicker phenomenon. Finally, we take advantages of deconvolution to decode the features to produce the grayscale video sequence. Experiments have indicated that our method can better preserve the local contrast of video frames and the temporal consistency over the state of the-art.

机译：视频脱色是将三声道彩色视频传输到单通道灰度视频的过程，这基本上是视频帧的偏差操作。大多数现有的视频偏差算法直接应用图像脱色方法以脱色视频帧。但是，如果我们只考虑单帧脱色结果，则它将不可避免地引起时间不一致和闪烁现象，这意味着连续视频帧之间的相同本地内容可以显示不同的灰度值。此外，视频帧之间通常存在类似的本地内容特征，其指示冗余信息。为了解决前面的问题，本文提出了一种基于卷积神经网络和长短期记忆神经网络的新型视频脱色算法。首先，我们设计一个本地语义内容编码器来学习和提取连续视频帧的相同本地内容，这可以更好地保留视频帧的对比度。其次，采用基于具有长短期存储器单元的双向复发性神经网络的时间特征控制器来优化局部语义特征，这可以大大维持视频序列的时间一致性以消除闪烁现象。最后，我们采取了解压缩的优势来解码功能以产生灰度视频序列。实验表明，我们的方法可以更好地保留视频帧的局部对比度和最先进的局部一致性。

著录项

来源
《ACM transactions on multimedia computing communications and applications》 |2021年第3期|88.1-88.18|共18页
作者
Liu Shiguang; Wang Huixin; Zhang Xiaoli;
展开▼
作者单位

Tianjin Univ Sch Comp Sci & Technol Div Intelligence & Comp 135 Yaguan Rd Haihe Educ Pk Tianjin 300350 Peoples R China;

Tianjin Univ Sch Comp Sci & Technol Div Intelligence & Comp 135 Yaguan Rd Haihe Educ Pk Tianjin 300350 Peoples R China;

Tianjin Univ Sch Comp Sci & Technol Div Intelligence & Comp 135 Yaguan Rd Haihe Educ Pk Tianjin 300350 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Video decolorization; convolution neural network; RNN; LSTM; temporal consistency;

机译：视频脱色;卷积神经网络;RNN;LSTM;时间一致性;

相似文献

外文文献
中文文献
专利

1. Video facial emotion recognition based on local enhanced motion history image and CNN-CTSLSTM networks [J] . Hu Min, Wang Haowen, Wang Xiaohua, Journal of visual communication & image representation . 2019,第FEBa期

机译：基于局部增强运动历史图像和CNN-CTSLSTM网络的视频面部情感识别
2. Video facial emotion recognition based on local enhanced motion history image and CNN-CTSLSTM networks [J] . Hu Min, Wang Haowen, Wang Xiaohua, Journal of visual communication & image representation . 2019,第Feba期

机译：基于本地增强运动历史图像和CNN-CTSLSTM网络的视频面部情感识别
3. Groundwater level forecasting with artificial neural networks: a comparison of long short-term memory (LSTM), convolutional neural networks (CNNs), and non-linear autoregressive networks with exogenous input (NARX) [J] . Wunsch Andreas, Liesch Tanja, Broda Stefan Hydrology and Earth System Sciences Discussions . 2021,第3期

机译：与人工神经网络的地下水位预测：具有外源输入（NARX）的长短期记忆（LSTM），卷积神经网络（CNNS）和非线性自回归网络的比较
4. Automatic Classification of Indian Languages into Tonal and Non-tonal Categories Using Cascade Convolutional Neural Network (CNN)-Long Short-Term Memory (LSTM) Recurrent Neural Networks [C] . Chuya China, Dipjyoti Bisharad, Rabul Hussain Laskar International Conference on Signal Processing and Communication Systems . 2018

机译：使用级联卷积神经网络（CNN）-长短期记忆（LSTM）递归神经网络将印度语言自动分类为音调和非音调类别
5. Pig Pose Estimation Based on Extracted Data of Mask R-CNN with VGG Neural Network for Classifications [D] . Lee, Sang Kwan. 2020

机译：基于掩模R-CNN提取数据与VGG神经网络分类的猪姿势估计
6. Advertising Click-Through Rate Prediction Based on CNN-LSTM Neural Network [O] . Danqing Zhu 2021

机译：基于CNN-LSTM神经网络的广告点击速率预测
7. Groundwater level forecasting with artificial neural networks: a comparison of long short-term memory (LSTM), convolutional neural networks (CNNs), and non-linear autoregressive networks with exogenous input (NARX) [O] . Andreas Wunsch, Tanja Liesch, Stefan Broda 2021

机译：人工神经网络的地下水位预测：具有外源输入（NARX）的长短期记忆（LSTM），卷积神经网络（CNNS）和非线性自回归网络的比较

Video Decolorization Based on the CNN and LSTM Neural Network

摘要

著录项

相似文献

相关主题

期刊订阅