Cutting Edge: Soft Correspondences in Multimodal Scene Parsing

机译：切削刃：多模式场景解析中的软对应关系

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Exploiting multiple modalities for semantic scene parsing has been shown to improve accuracy over the single-modality scenario. Existing methods, however, assume that corresponding regions in two modalities have the same label. In this paper, we address the problem of data misalignment and label inconsistencies, e.g., due to moving objects, in semantic labeling, which violate the assumption of existing techniques. To this end, we formulate multimodal semantic labeling as inference in a CRF, and introduce latent nodes to explicitly model inconsistencies between two domains. These latent nodes allow us not only to leverage information from both domains to improve their labeling, but also to cut the edges between inconsistent regions. To eliminate the need for hand tuning the parameters of our model, we propose to learn intra-domain and inter-domain potential functions from training data. We demonstrate the benefits of our approach on two publicly available datasets containing 2D imagery and 3D point clouds. Thanks to our latent nodes and our learning strategy, our method outperforms the state-of-the-art in both cases.

机译：已经显示出利用多种方式进行语义场景解析，以提高单模场景的准确性。然而，现有方法假设两个模态中的相应区域具有相同的标签。在本文中，我们解决了数据未对准和标签不一致的问题，例如，由于移动物体，在语义标记中，违反现有技术的假设。为此，我们将多式数语义标记标志为CRF中的推断，并引入潜在节点以显式模拟两个域之间的不一致性。这些潜在节点允许我们不仅可以利用来自两个域的信息来改善其标签，而且还可以在不一致的区域之间切割边缘。为了消除手部调节我们模型的参数，我们建议学习域内和域间潜在功能训练数据。我们展示了我们在包含2D图像和3D点云的两个公开的数据集中的方法的好处。由于我们的潜在节点和我们的学习策略，我们的方法在这两种情况下都优于最先进的。

著录项

来源
《IEEE International Conference on Computer Vision》|2015年||共9页
会议地点
作者
Sarah Taghavi Namin; Mohammad Najafi; Mathieu Salzmann; Lars Petersson;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41-53;
关键词

相似文献

外文文献
中文文献
专利

1. Parsing very high resolution urban scene images by learning deep ConvNets with edge-aware loss [J] . Zheng Xianwei, Huan Linxi, Xia Gui-Song, ISPRS Journal of Photogrammetry and Remote Sensing . 2020,第Deca期

机译：通过学习具有边缘感知损失的深探伤来解析非常高分辨率的城市场景图像
2. Cutting-Edge Biotechs Need Cutting-Edge Software [J] . Genetic Engineering & Biotechnology News: The Information Source of the Biotechnology Industry . 2019,第4期

机译：尖端生物技术需要尖端软件
3. Banzai+Tatoo: Using cutting-edge parsers for implementing high-performance servers [J] . Julien Cervelle, Remi Forax, Gautier Loyaute, Science of Computer Programming . 2012,第9期

机译：Banzai + Tatoo：使用尖端的解析器来实现高性能服务器
4. Cutting Edge: Soft Correspondences in Multimodal Scene Parsing [C] . Sarah Taghavi Namin, Mohammad Najafi, Mathieu Salzmann, IEEE International Conference on Computer Vision . 2015

机译：最前沿：多模式场景解析中的软对应
5. Software framework for parsing and interpreting gestures in a multimodal virtual environment context [D] . Rioux, Francois 2005

机译：用于在多模式虚拟环境中解析和解释手势的软件框架
6. Toward multimodality oral cancer diagnosis in the XXI century: Blending cutting edge imaging and genomic/proteomic definition of suspicious lesions [O] . Anh Le, Diana Messadi, Joel Epstein, 2011

机译：迈向二十一世纪的多模态口腔癌诊断：结合前沿影像技术和可疑病变的基因组/蛋白质组学定义
7. Banzai+Tatoo: Using cutting-edge parsers for implementing high-performance servers [O] . Cervelle Julien, Forax Rémi, Loyauté Gautier, 2012

机译：Banzai + Tatoo：使用最先进的解析器实现高性能服务器

Cutting Edge: Soft Correspondences in Multimodal Scene Parsing

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅