...
首页> 外文期刊>IEEE Transactions on Circuits and Systems for Video Technology >T-CNN: Tubelets With Convolutional Neural Networks for Object Detection From Videos
【24h】

T-CNN: Tubelets With Convolutional Neural Networks for Object Detection From Videos

机译:T-CNN:具有卷积神经网络的小管,用于从视频中检测对象

获取原文
获取原文并翻译 | 示例

摘要

The state-of-the-art performance for object detection has been significantly improved over the past two years. Besides the introduction of powerful deep neural networks, such as GoogleNet and VGG, novel object detection frameworks, such as R-CNN and its successors, Fast R-CNN, and Faster R-CNN, play an essential role in improving the state of the art. Despite their effectiveness on still images, those frameworks are not specifically designed for object detection from videos. Temporal and contextual information of videos are not fully investigated and utilized. In this paper, we propose a deep learning framework that incorporates temporal and contextual information from tubelets obtained in videos, which dramatically improves the baseline performance of existing still-image detection frameworks when they are applied to videos. It is called T-CNN, i.e., tubelets with convolutional neural networks. The proposed framework won newly introduced an object-detection-from-video task with provided data in the ImageNet Large-Scale Visual Recognition Challenge 2015. Code is publicly available at https://github.com/myfavouritekk/T-CNN.
机译:在过去两年中,对象检测的最新性能得到了显着改善。除了引入强大的深度神经网络(例如GoogleNet和VGG)之外,新颖的对象检测框架(例如R-CNN及其后继者Fast R-CNN和Faster R-CNN)在改善状态方面也起着至关重要的作用。艺术。尽管它们在静止图像上有效,但是这些框架并不是专门为视频中的对象检测而设计的。视频的时间和上下文信息未得到充分调查和利用。在本文中,我们提出了一种深度学习框架,该框架将视频中获得的细小管的时间和上下文信息融合在一起,当将其应用于视频时,可以显着提高现有静止图像检测框架的基线性能。它被称为T-CNN,即具有卷积神经网络的小管。拟议的框架在ImageNet大规模视觉识别挑战赛2015中赢得了新引入的从视频进行对象检测的任务,并提供了数据。代码可在https://github.com/myfavouritekk/T​​-CNN上公开获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号