Taylor saves for later: Disentanglement for video prediction using Taylor representation

Pan Ting; Jiang Zhuqing; Han JiananWen ShipingMen AidongWang Haiying

首页> 外文期刊>Neurocomputing >Taylor saves for later: Disentanglement for video prediction using Taylor representation

【24h】

Taylor saves for later: Disentanglement for video prediction using Taylor representation

机译：Taylor saves for later: Disentanglement for video prediction using Taylor representation

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

Video prediction is a challenging task with wide application prospects in meteorology and robot systems. Existing works fail to trade off short-term and long-term prediction performances and extract robust latent dynamics laws in video frames. We propose a two-branch seq-to-seq deep model to disentangle the Taylor feature and the residual feature in video frames by a novel recurrent prediction module (TaylorCell) and residual module, based on a novel principle for feature separation. TaylorCell can expand the video frames' high-dimensional features into the finite Taylor series to describe the latent laws. In TaylorCell, we propose the Taylor prediction unit (TPU) and the memory correction unit (MCU). TPU employs the first input frame's derivative information to predict the future frames, avoiding error accumulation. MCU distills all past frames' information to correct the predicted Taylor feature from TPU. Correspondingly, the residual module extracts the residual feature complementary to the Taylor feature. Due to the characteristic of the Taylor series, our model works better on datasets with short-range spatial dependencies and stable dynamics. On three generalist datasets (Moving MNIST, TaxiBJ, Human 3.6), our model reaches and outperforms the state-of-the-art model in the short-term and long-term forecast, respectively. Ablation experiments demonstrate the contributions of each module in our model. (c) 2021 Elsevier B.V. All rights reserved.

著录项

来源
《Neurocomputing》 |2022年第1期|166-174|共9页
作者
Pan Ting; Jiang Zhuqing; Han JiananWen ShipingMen AidongWang Haiying;
展开▼
作者单位

Beijing Univ Posts & Telecommun, 10 Xitucheng Rd, Beijing, Peoples R China;

Beijing Univ Posts & Telecommun, 10 Xitucheng Rd, Beijing, Peoples R China|Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China;

Univ Technol Sydney, Fac Engn & Informat Technol, Australian AI Inst, Sydney, NSW, Australia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Video prediction; Taylor series; Feature disentanglement; Deep learning;
入库时间 2024-01-25 00:13:20

Taylor saves for later: Disentanglement for video prediction using Taylor representation

摘要

著录项

相关主题

期刊订阅