【24h】

Learning Class-Based Graph Representation for Object Detection

机译:基于类的基于类的图形表示,用于对象检测

获取原文

摘要

Object detection has achieved a tremendous advancement based on feature-based learning in the vision space, while little work has focused on reasoning in the perception space like humans. One of the greatest challenges lies in that it is difficult to build a connectivity model in the topological space for relational reasoning, since the current network is better at modeling the distribution of structured data. To settle this issue, we introduce a novel graph modeling mechanism with class-based graph representation, which contributes to modeling the high-order topology structure that maps the data distribution to make the detection models have better relational reasoning ability. In this mechanism, we propose three learning subtasks, i.e., vision-to-perception embedding, perception reasoning graph representation, and perception-to-vision modeling. The mechanism based on such subtasks effectively maintains the independence of the original detection network and the proposed mechanism-based model, thus it can be well integrated with existing detection models without additional modification. The experimental results demonstrate the feasibility and effectiveness of our proposed mechanism, and the new state-of-the-art performance can be achieved on the public challenging datasets for object detection.
机译:对象检测基于视觉空间中基于特征的学习的基于特征学习实现了巨大进步,而小的工作已经专注于人类的感知空间的推理。其中一个最大的挑战在于,难以在拓扑空间中建立一个连接模型以进行关系推理,因为当前网络更好地建模结构化数据的分布。为了解决这个问题,我们介绍了一种新的图形建模机制,具有基于类的图形表示,有助于建模映射数据分布的高阶拓扑结构,使检测模型具有更好的关系推理能力。在这种机制中,我们提出了三个学习子任务,即视觉依赖于感知嵌入,感知推理图表示和感知到视觉建模。基于这些子特设的机制有效地保持了原始检测网络的独立性和所提出的基于机构的模型,因此可以很好地与现有的检测模型结合而无需额外修改。实验结果表明了我们所提出的机制的可行性和有效性,并且可以在公众具有挑战性的数据集中实现新的最先进的性能进行物体检测。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号