首页> 外文会议>IEEE Applied Imagery Pattern Recognition Workshop >Multi-modal Data Analysis and Fusion for Robust Object Detection in 2D/3D Sensing

【24h】

Multi-modal Data Analysis and Fusion for Robust Object Detection in 2D/3D Sensing

机译：2D / 3D感测中鲁棒对象检测的多模态数据分析和融合

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multi-modal data is useful for complex imaging scenarios due to the exclusivity of information found in each modality, but there is a lack of meaningful comparisons of different modalities for object detection. In our work, we propose three contributions: (1) Release of a multi-modal, ground-based small object detection dataset, (2) A performance comparison of 2D and 3D imaging modalities using state-of-the-art algorithms, and (3) a multi-modal fusion framework for 2D/3D sensing. The new dataset encompasses various small objects for detection in EO, IR, and LiDAR modalities. The labeled data has comparable resolutions across each modality for better performance analysis. The modality comparison conducted in this work uses advanced deep learning algorithms, such as Mask R-CNN for 2D imaging and PointNet++ for 3D imaging. The comparisons are conducted with similar parameter sizes and the results are analyzed for specific instances where each modality performed the best. To complement the effectiveness of different data modalities, we developed a fusion strategy to combine detection networks operating on different modalities into a single detection output for accurate object detection and region segmentation. Our fusion strategy utilized the state of the art networks listed above as backbone networks to obtain a confidence score from each modality. The network then determines which modality to base the object detection off of based on those confidences. The effectiveness of the proposed fusion method is being evaluated on the multi-modal dataset for object detection and segmentation and we observe superior performance when compared to single-modality algorithms.

机译：由于每个码形中发现的信息的排他性，多模态数据对于复杂的成像方案非常有用，但是对物体检测的不同模式缺乏有意义的比较。在我们的工作中，我们提出了三个贡献：（1）释放多模态，基于地面小物体检测数据集，（2）使用最先进的算法的2D和3D成像方式的性能比较，以及（3）用于2D / 3D感测的多模态融合框架。新数据集包含各种小型物体，用于在EO，IR和LIDAR方式中检测。标记数据在每个模态中具有可比的分辨率，以进行更好的性能分析。在本工作中进行的模态比较使用高级深度学习算法，例如用于2D成像和PointNet ++的掩模R-CNN，用于3D成像。使用相似的参数大小进行比较，并对每个模态执行最佳的特定实例进行分析结果。为了补充不同数据模式的有效性，我们开发了一种将在不同方式运行的检测网络与精确对象检测和区域分割的单个检测输出相结合的融合策略。我们的融合策略利用上面列出的美术网络的状态作为骨干网络，以获得来自每种方式的置信度。然后，网络确定基于这些信心的对象检测到哪个模块。在与单模算法相比，正在对对象检测和分割的多模态数据集进行评估所提出的融合方法的有效性，并且在单模态算法相比，我们观察卓越的性能。

著录项

来源
《IEEE Applied Imagery Pattern Recognition Workshop 》|2020年|1-7|共7页
会议地点
作者
Jonathan Schierl; Quinn Graehling; Theus Aspiras; Vijay Asari; Andre Van Rynbach; Dave Rabb;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computer vision; Visualization; Three-dimensional displays; Neural networks; Object detection; Computer architecture; Streaming media;

机译：计算机愿景;可视化;三维显示器;神经网络;物体检测;计算机架构;流媒体;

相似文献

外文文献
中文文献
专利

1. Subject-dependent classification for robust idle state detection using multi-modal neuroimaging and data-fusion techniques in BCI [J] . Lee Min-Ho, Fazli Siamac, Mehnert Jan, Pattern Recognition: The Journal of the Pattern Recognition Society . 2015 ,第8期

机译：在BCI中使用多模式神经成像和数据融合技术进行鲁棒的空闲状态检测的主题相关分类
2. Robust Detection and Tracking Method for Moving Object Based on Radar and Camera Data Fusion [J] . Bai Jie, Li Sen, Huang Libo, IEEE sensors journal . 2021 ,第9期

机译：基于雷达和相机数据融合的移动对象的鲁棒检测与跟踪方法
3. A Deep Multiscale Fusion Method via Low-Rank Sparse Decomposition for Object Saliency Detection Based on Urban Data in Optical Remote Sensing Images [J] . Cheng Zhang, Dan He Wireless communications & mobile computing . 2020 ,第1期

机译：基于光学遥感图像中的城市数据的低级对象显着性检测的低级别稀疏分解的深层多尺度融合方法
4. Multi-resolution analysis of 3D multi-modal objects using a 2D quincunx wavelet analysis [C] . M. Toubin, C. Dumont, F. Truchetet, Conference on Intelligent Robots and Computer Vision . 1999

机译：使用2D Quincunx小波分析3D多模态对象的多分辨率分析
5. 3D Object Detection, Instance Segmentation and Classification from 3D Range and 2D Color Images [D] . Shen, Xiaoke. 2021

机译：3D对象检测，实例分段和3D范围和2D彩色图像的分类
6. One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection [O] . Minle Li, Yihua Hu, Nanxiang Zhao, 2019

机译：用于3D目标检测的单阶段多传感器数据融合卷积神经网络
7. Multi-resolution analysis of 3D multi-modal objects using a 2D quincunx wavelet analysis [O] . M. Toubin, C. Dumont, F. Truchetet, 2007

机译：使用二维梅花形小波分析对3D多模态对象进行多分辨率分析

Multi-modal Data Analysis and Fusion for Robust Object Detection in 2D/3D Sensing

摘要

著录项

相似文献

相关主题

期刊订阅