Learning Class-Based Graph Representation for Object Detection

机译：基于类的基于类的图形表示，用于对象检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Object detection has achieved a tremendous advancement based on feature-based learning in the vision space, while little work has focused on reasoning in the perception space like humans. One of the greatest challenges lies in that it is difficult to build a connectivity model in the topological space for relational reasoning, since the current network is better at modeling the distribution of structured data. To settle this issue, we introduce a novel graph modeling mechanism with class-based graph representation, which contributes to modeling the high-order topology structure that maps the data distribution to make the detection models have better relational reasoning ability. In this mechanism, we propose three learning subtasks, i.e., vision-to-perception embedding, perception reasoning graph representation, and perception-to-vision modeling. The mechanism based on such subtasks effectively maintains the independence of the original detection network and the proposed mechanism-based model, thus it can be well integrated with existing detection models without additional modification. The experimental results demonstrate the feasibility and effectiveness of our proposed mechanism, and the new state-of-the-art performance can be achieved on the public challenging datasets for object detection.

机译：对象检测基于视觉空间中基于特征的学习的基于特征学习实现了巨大进步，而小的工作已经专注于人类的感知空间的推理。其中一个最大的挑战在于，难以在拓扑空间中建立一个连接模型以进行关系推理，因为当前网络更好地建模结构化数据的分布。为了解决这个问题，我们介绍了一种新的图形建模机制，具有基于类的图形表示，有助于建模映射数据分布的高阶拓扑结构，使检测模型具有更好的关系推理能力。在这种机制中，我们提出了三个学习子任务，即视觉依赖于感知嵌入，感知推理图表示和感知到视觉建模。基于这些子特设的机制有效地保持了原始检测网络的独立性和所提出的基于机构的模型，因此可以很好地与现有的检测模型结合而无需额外修改。实验结果表明了我们所提出的机制的可行性和有效性，并且可以在公众具有挑战性的数据集中实现新的最先进的性能进行物体检测。

著录项

来源
《European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems》|2020年|2275-3053p|共8页
会议地点
作者
Shuyu Miao; Rui Feng; Yuejie Zhang; Weiguo Fan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Model for Learning Topographically Organized Parts-Based Representations of Objects in Visual Cortex: Topographic Nonnegative Matrix Factorization [J] . Kenji Hosoda, Masataka Watanabe, Heiko Wersing, Neural computation . 2009,第9期

机译：用于在视觉皮质中学习基于地形组织部分的对象表示的模型：地形非负矩阵分解
2. Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection [J] . Xu Hongyu, Lv Xutao, Wang Xiaoyu, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2021,第6期

机译：深度区域：通用物体检测的混合表示和深度学习
3. Learning channel-wise spatio-temporal representations for video salient object detection [J] . Huang Kan, Li Ge, Liu Shan Neurocomputing . 2020,第Auga25期

机译：学习视频突出对象检测的渠道方面的时空表示
4. Learning Class-Based Graph Representation for Object Detection [C] . Shuyu Miao, Rui Feng, Yuejie Zhang, European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems . 2020

机译：基于类的基于类的图形表示，用于对象检测
5. SigSpace: Class-based feature representation for scalable and distributed machine learning. [D] . Doddala, Seetha Rama Pradyumna. 2016

机译：SigSpace：用于扩展和分布式机器学习的基于类的功能表示。
6. Reinforced AdaBoost Learning for Object Detection with Local Pattern Representations [O] . Younghyun Lee, David K. Han, Hanseok Ko 2013

机译：增强的AdaBoost学习用于使用局部模式表示进行对象检测
7. A Model for Learning Topographically Organized Parts-Based Representations of Objects in Visual Cortex: Topographic Nonnegative Matrix Factorization [O] . Hosoda, K., Watanabe, M., Wersing H, Koerner E, Tsujino H, Tamura, H., 2009

机译：一种用于学习地形上基于零件的视觉皮层中物体表示的模型：地形非负矩阵分解

Learning Class-Based Graph Representation for Object Detection

摘要

著录项

相似文献

相关主题

期刊订阅