Few-labeled visual recognition for self-driving using multi-view visual-semantic representation

Wang Da-Han; Li Jianmin; Zhu Shunzhi

首页> 外文期刊>Neurocomputing >Few-labeled visual recognition for self-driving using multi-view visual-semantic representation

【24h】

Few-labeled visual recognition for self-driving using multi-view visual-semantic representation

机译：使用多视图视觉语义表示的自动驾驶的少量视觉识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

For a car self-driving system, it is vital to accurately recognize the objects on the road. This is often achieved by analyzing the images captured by various cameras. However, in real applications, we often have limited number of labeled images. Besides, it is very expensive to manually annotate objects in images. It is hard to learn reliable classifiers when only few-labeled images are available. To alleviate these problems, multi-view information is used. However, only using visual information is not enough for reliable classification. Besides, we often have limited images with labels. To cope with these problems, in this paper, we propose a novel multi-view visual-semantic representation method for few-labeled visual recognition ((MVS)-S-2). We make use of the state-of-the-art deep convolutional neural networks by viewing them as different views to extract semantic representations of images. This is achieved by using the learned deep convolutional neural networks to make predictions of the semantics of images. Both the visual and semantic representations of images are then used to predict the categories of images by combining the predictions of multi-views with visual and semantic consistency constraints. Experiments on four public available datasets prove the effectiveness of the proposed MV2S method. (C) 2020 Elsevier B.V. All rights reserved.

机译：对于汽车自动驾驶系统，准确识别道路上的物体至关重要。这通常是通过分析由各种相机捕获的图像来实现的。但是，在实际应用中，我们经常具有有限数量的标记图像。此外，手动注释图像中的对象非常昂贵。只有少量标记的图像可用时，很难学习可靠的分类器。为了减轻这些问题，使用多视图信息。但是，只有使用视觉信息不足以可用于可靠的分类。此外，我们经常有有限的图像标签。为了应对这些问题，在本文中，我们提出了一种用于少量标记的视觉识别（（MVS）-2）的新型多视觉视觉语义表示方法。我们通过将它们视为不同的视图来利用最先进的深度卷积神经网络以提取图像的语义表示。这是通过使用学习的深度卷积神经网络来实现的，以便预测图像的语义。然后，通过将多视图的预测与视觉和语义一致性约束组合来预测图像的视觉和语义表示来预测图像类别。四个公共可用数据集的实验证明了所提出的MV2S方法的有效性。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Neurocomputing》 |2021年第7期|361-367|共7页
作者
Wang Da-Han; Li Jianmin; Zhu Shunzhi;
展开▼
作者单位

Xiamen Univ Technol Sch Comp & Informat Engn Fujian Key Lab Pattern Recognit & Image Understan 600 Ligong Rd Xiamen 361024 Fujian Peoples R China;

Xiamen Univ Technol Sch Comp & Informat Engn Fujian Key Lab Pattern Recognit & Image Understan 600 Ligong Rd Xiamen 361024 Fujian Peoples R China;

Xiamen Univ Technol Sch Comp & Informat Engn Fujian Key Lab Pattern Recognit & Image Understan 600 Ligong Rd Xiamen 361024 Fujian Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Few-labeled view recognition; Self-driving object categorization; Multi-view learning; Semantic representation;

机译：少数标记的视图识别;自动驾驶对象分类;多视图学习;语义表示;

Few-labeled visual recognition for self-driving using multi-view visual-semantic representation

摘要

著录项

相关主题

期刊订阅