首页> 外文会议>IEEE Winter Conference on Applications of Computer Vision >Toward Interactive Self-Annotation For Video Object Bounding Box: Recurrent Self-Learning And Hierarchical Annotation Based Framework

【24h】

Toward Interactive Self-Annotation For Video Object Bounding Box: Recurrent Self-Learning And Hierarchical Annotation Based Framework

机译：面向视频对象边界框的交互式自注释：基于递归自学习和分层注释的框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Amount and variety of training data drastically affect the performance of CNNs. Thus, annotation methods are becoming more and more critical to collect data efficiently. In this paper, we propose a simple yet efficient Interactive Self-Annotation framework to cut down both time and human labor cost for video object bounding box annotation. Our method is based on recurrent self-supervised learning and consists of two processes: automatic process and interactive process, where the automatic process aims to build a supported detector to speed up the interactive process. In the Automatic Recurrent Annotation, we let an off-the-shelf detector watch unlabeled videos repeatedly to reinforce itself automatically. At each iteration, we utilize the trained model from the previous iteration to generate better pseudo ground-truth bounding boxes than those at the previous iteration, recurrently improving self-supervised training the detector. In the Interactive Recurrent Annotation, we tackle the human-in-the-loop annotation scenario where the detector receives feedback from the human annotator. To this end, we propose a novel Hierarchical Correction module, where the annotated frame-distance binarizedly decreases at each time step, to utilize the strength of CNN for neighbor frames. Experimental results on various video datasets demonstrate the advantages of the proposed framework in generating high-quality annotations while reducing annotation time and human labor costs.

机译：训练数据的数量和种类极大地影响了CNN的性能。因此，注释方法对于有效地收集数据变得越来越重要。在本文中，我们提出了一个简单而有效的交互式自注释框架，以减少视频对象边界框注释的时间和人工成本。我们的方法基于循环式自我监督学习，包括两个过程：自动过程和交互过程，其中自动过程旨在构建一个受支持的检测器以加快交互过程。在自动循环注释中，我们让现成的检测器反复观看未标记的视频以自动增强自身。在每次迭代中，我们利用前一次迭代中的训练模型来生成比前一次迭代中更好的伪地面真假边界框，从而不断改善对检测器的自我监督训练。在“交互式循环注释”中，我们解决了检测者从人工注释者接收反馈的“人在回路”注释场景。为此，我们提出了一种新颖的分层校正模块，其中带注释的帧距离在每个时间步均以二进制方式减小，以利用CNN的强度用于相邻帧。在各种视频数据集上的实验结果证明了提出的框架在生成高质量注释时的优势，同时减少了注释时间和人工成本。

著录项

来源
《IEEE Winter Conference on Applications of Computer Vision 》|2020年|3220-3229|共10页
会议地点
作者
Trung-Nghia Le; Sugimoto Akihiro; Shintaro Ono; Hiroshi Kawasaki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Detectors; Labeling; Tools; Training data; Task analysis; Training; Machine learning;

机译：检测器;标签;工具;培训数据;任务分析;培训;机器学习;

相似文献

外文文献
中文文献
专利

1. Video object tracking and segmentation with box annotation [J] . Signal Processing. Image Communication: A Publication of the the European Association for Signal Processing . 2020 ,第期

机译：带盒子注释的视频对象跟踪和分段
2. DeepCut: Object Segmentation From Bounding Box Annotations Using Convolutional Neural Networks [J] . Martin Rajchl, Matthew C. H. Lee, Ozan Oktay, IEEE Transactions on Medical Imaging . 2017 ,第2期

机译：DeepCut：使用卷积神经网络从边界注释中进行对象分割
3. Learning arbitrary-shape object detector from bounding-box annotation by searching region-graph [J] . Wang Liantao, Lu Jianfeng, Li Xiangyu, Pattern recognition letters . 2017 ,第FEBa1期

机译：通过搜索区域图从包围盒注释中学习任意形状的物体检测器
4. ScribbleBox: Interactive Annotation Framework for Video Object Segmentation [C] . Bowen Chen, Huan Ling, Xiaohui Zeng, European Conference on Computer Vision . 2020

机译：Scribblebox：视频对象分段的交互式注释框架
5. Object Recognition in Videos Utilizing Hierarchical and Temporal Objectness with Deep Neural Networks. [D] . Peng, Liang. 2017

机译：利用具有深度神经网络的分层和时间对象性的视频中的对象识别。
6. A Deep-Learning Model with Task-Specific Bounding Box Regressors and Conditional Back-Propagation for Moving Object Detection in ADAS Applications [O] . Guan-Ting Lin, Vinay Malligere Shivanna, Jiun-In Guo 2020

机译：具有任务特定边界框的深度学习模型以及用于在ADAS应用中移动对象检测的条件反向传播
7. ScribbleBox: Interactive Annotation Framework for Video Object Segmentation [O] . Bowen Chen, Huan Ling, Xiaohui Zeng, 2020

机译：Scribblebox：视频对象分段的交互式注释框架

Toward Interactive Self-Annotation For Video Object Bounding Box: Recurrent Self-Learning And Hierarchical Annotation Based Framework

摘要

著录项

相似文献

相关主题

期刊订阅