首页> 外文会议>European conference on computer vision >Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features
【24h】

Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features

机译:先随机后组装:学习与对象无关的视觉关系特征

获取原文

摘要

Due to the fact that it is prohibitively expensive to completely annotate visual relationships, i.e., the (obj1, rel, obj2) triplets, relationship models are inevitably biased to object classes of limited pairwise patterns, leading to poor generalization to rare or unseen object combinations. Therefore, we are interested in learning object-agnostic visual features for more generalizable relationship models. By "agnostic", we mean that the feature is less likely biased to the classes of paired objects. To alleviate the bias, we propose a novel Shuffle-Then-Assemble pre-training strategy. First, we discard all the triplet relationship annotations in an image, leaving two unpaired object domains without objl-obj'2 alignment. Then, our feature learning is to recover possible objl-obj2 pairs. In particular, we design a cycle of residual transformations between the two domains, to capture shared but not object-specific visual patterns. Extensive experiments on two visual relationship benchmarks show that by using our pre-trained features, naive relationship models can be consistently improved and even outperform other state-of-the-art relationship models. Code has been made available.
机译:由于完全注释视觉关系(即(obj1,rel,obj2)三元组)非常昂贵,因此关系模型不可避免地偏向于有限的成对模式的对象类,从而导致对稀有或看不见的对象组合的概括性很差。因此,我们有兴趣学习与对象无关的视觉特征,以获取更通用的关系模型。所谓“不可知论”,是指该特征不太可能偏向配对对象的类别。为了减轻这种偏见,我们提出了一种新颖的Shuffle-Then-Assemble预训练策略。首先,我们丢弃图像中的所有三元组关系注释,留下两个没有objl-obj'2对齐的不成对的对象域。然后,我们的特征学习是恢复可能的objl-obj2对。特别是,我们设计了两个域之间的残差变换循环,以捕获共享的但不是特定于对象的视觉模式。在两个视觉关系基准上进行的大量实验表明,通过使用我们预先训练的功能,天真的关系模型可以得到持续改进,甚至优于其他最新的关系模型。代码已可用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号