首页> 外文会议>IEEE Conference on Applications of Computer Vision >Learn How to Choose: Independent Detectors Versus Composite Visual Phrases
【24h】

Learn How to Choose: Independent Detectors Versus Composite Visual Phrases

机译:了解如何选择:独立探测器与复合视觉短语

获取原文

摘要

Most approaches for scene parsing, recognition or retrieval use detectors that are either (i) independently trained or (ii) jointly trained for conjunctions of object-object or object-attribute phrases. We posit that neither of these two extremes is uniformly optimal, in terms of performance, across all categories and conjunctions. The choice of whether one should train an independent or composite detector should be made for each possible conjunction separately, and depends on the statistics of the dataset as well. For example, person holding phone may be more accurately modeled using a single composite detector, while tall person may be more accurately modeled as combination of two detectors. We extensively study this issue in the context of multiple problems and datasets. Further, for e ciency, we propose a predictor that is based on a number of category speci c features (e.g., sample size, entropy, etc.) for whether independent or joint composite detector may be more accurate for a given conjunction. We show that our prediction and selection mechanism generalizes and leads to improved performance on a number of large-scale datasets and vision tasks.
机译:用于场景解析,识别或检索的大多数方法都使用检测器,这些检测器要么是(i)独立训练的,要么是(ii)联合训练的,用于对象-对象或对象-属性短语的结合。我们认为,就表现而言,在所有类别和联结中,这两个极端都不是统一最优的。应该为每个可能的结合点分别选择是训练一个独立的检测器,还是应该训练一个复合检测器,并且还取决于数据集的统计数据。例如,可以使用单个复合检测器对手持电话的人进行更准确的建模,而将两个检测器的组合对高个子的人进行更准确的建模。我们在多个问题和数据集的背景下对此问题进行了广泛的研究。此外,为了提高效率,我们提出了一个预测器,该预测器基于多个类别特定特征(例如样本大小,熵等)来确定独立或联合复合检测器对于给定的结合点可能更准确。我们表明,我们的预测和选择机制可以概括并导致许多大型数据集和视觉任务的性能提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号