首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning
【24h】

Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning

机译:像素明智的度量学习,实现了惊人的快速视频对象分割

获取原文

摘要

This paper tackles the problem of video object segmentation, given some user annotation which indicates the object of interest. The problem is formulated as pixel-wise retrieval in a learned embedding space: we embed pixels of the same object instance into the vicinity of each other, using a fully convolutional network trained by a modified triplet loss as the embedding model. Then the annotated pixels are set as reference and the rest of the pixels are classified using a nearest-neighbor approach. The proposed method supports different kinds of user input such as segmentation mask in the first frame (semi-supervised scenario), or a sparse set of clicked points (interactive scenario). In the semi-supervised scenario, we achieve results competitive with the state of the art but at a fraction of computation cost (275 milliseconds per frame). In the interactive scenario where the user is able to refine their input iteratively, the proposed method provides instant response to each input, and reaches comparable quality to competing methods with much less interaction.
机译:给定一些指示感兴趣对象的用户注释,本文解决了视频对象分割问题。问题被表述为在学习的嵌入空间中逐像素检索:我们使用经过修改的三重态损失训练的完全卷积网络作为嵌入模型,将同一对象实例的像素嵌入彼此附近。然后将带注释的像素设置为参考,并使用最近邻居方法对其余像素进行分类。所提出的方法支持不同类型的用户输入,例如第一帧中的分割蒙版(半监督方案)或单击点的稀疏集(交互式方案)。在半监督的情况下,我们获得了与最新技术水平相当的结果,但计算成本却很小(每帧275毫秒)。在用户能够迭代地优化其输入的交互式场景中,所提出的方法可为每个输入提供即时响应,并且与具有更少交互的竞争方法相比,具有可比的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号