Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

Li Jia; Fu Kui; Zhao Shengwei; Ge Shiming

首页> 外文期刊>IEEE Transactions on Image Processing >Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

【24h】

Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

机译：用于高效估计空中视频显着性的时空知识蒸馏

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The performance of video saliency estimation techniques has achieved significant advances along with the rapid development of Convolutional Neural Networks (CNNs). However, devices like cameras and drones may have limited computational capability and storage space so that the direct deployment of complex deep saliency models becomes infeasible. To address this problem, this paper proposes a dynamic saliency estimation approach for aerial videos via spatiotemporal knowledge distillation. In this approach, five components are involved, including two teachers, two students and the desired spatiotemporal model. The knowledge of spatial and temporal saliency is first separately transferred from the two complex and redundant teachers to their simple and compact students, while the input scenes are also degraded from high-resolution to low-resolution to remove the probable data redundancy so as to greatly speed up the feature extraction process. After that, the desired spatiotemporal model is further trained by distilling and encoding the spatial and temporal saliency knowledge of two students into a unified network. In this manner, the inter-model redundancy can be removed for the effective estimation of dynamic saliency on aerial videos. Experimental results show that the proposed approach is comparable to 11 state-of-the-art models in estimating visual saliency on aerial videos, while its speed reaches up to 28,738 FPS and 1,490.5 FPS on the GPU and CPU platforms, respectively.

机译：视频显着估计技术的性能随着卷积神经网络（CNNS）的快速发展而达成了重要进步。然而，像相机和无人机这样的设备可能具有有限的计算能力和存储空间，使得复杂的深度显着模型的直接部署变得不可行。为了解决这个问题，本文提出了通过时空知识蒸馏的空中视频动态显着性估算方法。在这种方法中，涉及五个组成部分，包括两位教师，两个学生和所需的时空模型。首先将空间和时间显着性的知识从两个复杂和冗余教师单独转移到他们简单和紧凑的学生，而输入场景也从高分辨率降低到低分辨率以消除可能的数据冗余，以便大大加快特征提取过程。之后，通过蒸馏和编码两个学生的空间和时间显着知识进入统一网络，进一步训练所需的时空模型。以这种方式，可以删除模型间冗余，以便有效估计在空中视频上的动态显着性。实验结果表明，该方法可与11估计空中视频的视觉显着性相比，其速度分别在GPU和CPU平台上达到高达28,738 FPS和1,490.5 FPS。

著录项

来源
《IEEE Transactions on Image Processing》 |2020年第2020期|1902-1914|共13页
作者
Li Jia; Fu Kui; Zhao Shengwei; Ge Shiming;
展开▼
作者单位

Beihang Univ Sch Comp Sci & Engn State Key Lab Virtual Real Technol & Syst Beijing 100191 Peoples R China|Peng Chong Lab Shenzhen 518000 Guangdong Peoples R China|Beihang Univ Beijing Adv Innovat Ctr Big Data & Brain Comp Beijing 100191 Peoples R China;

Beihang Univ Sch Comp Sci & Engn State Key Lab Virtual Real Technol & Syst Beijing 100191 Peoples R China|Peng Chong Lab Shenzhen 518000 Guangdong Peoples R China;

Chinese Acad Sci Inst Informat Engn Beijing 100095 Peoples R China|Univ Chinese Acad Sci Sch Cyber Secur Beijing 100095 Peoples R China;

Chinese Acad Sci Inst Informat Engn Beijing 100095 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Computational modeling; Redundancy; Estimation; Visualization; Data models; Spatiotemporal phenomena; Drones; Spatiotemporal knowledge distillation; visual saliency estimation; aerial video;

机译：计算建模;冗余;估计;可视化;数据模型;时尚现象;无人机;时尚知识蒸馏;视觉显着估计;空中视频;

相似文献

外文文献
中文文献
专利

1. A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos (vol 13, e0192246, 2018) [J] . Aghamohammadi Amirhossein, Ang Mei Choo, Sundararajan Elankovan A., Journal of land use science . 2018,第1a3期

机译：航空视频中视觉目标跟踪的平行时空显着性和鉴别在线学习方法（Vol 13，E0192246,2018）
2. Moving object detection in aerial video based on spatiotemporal saliency [J] . Shen Hao, Li Shuxiao, Zhu Chengfei, 中国航空学报（英文版） . 2013,第005期

机译：基于时空显着性的航空视频运动目标检测
3. Self-supervised pain intensity estimation from facial videos via statistical spatiotemporal distillation [J] . Tavakolian Mohammad, Lopez Miguel Bordallo, Liu Li Pattern recognition letters . 2020,第Deca期

机译：通过统计时滞蒸馏从面部影片自我监督的疼痛强度估算
4. Video Saliency Estimation via Encoding Deep Spatiotemporal Saliency Cues [C] . Jun Wang, Chang Tian, Lei Hu, International Conference on Wireless Communications and Signal Processing . 2018

机译：通过编码深时空显着性提示进行视频显着性估计
5. A Novel Framework for Real-time Traffic Flow Parameter Estimation from Aerial Videos. [D] . Ke, Ruimin. 2016

机译：航空视频实时交通流量参数估计的新框架。
6. A parallel spatiotemporal saliency and discriminative online learning method for visual target tracking in aerial videos [O] . Amirhossein Aghamohammadi, Mei Choo Ang, Elankovan A. Sundararajan, 2012

机译：航空视频视觉目标跟踪的并行时空显着性和判别式在线学习方法
7. Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency [O] . Jia Li, Kui Fu, Shengwei Zhao, 2020

机译：用于高效估计空中视频显着性的时空知识蒸馏

Spatiotemporal Knowledge Distillation for Efficient Estimation of Aerial Video Saliency

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅