首页> 外文期刊>Expert systems with applications >Hierarchical visual localization for visually impaired people using multimodal images
【24h】

Hierarchical visual localization for visually impaired people using multimodal images

机译:使用多式联运图像的视觉障碍者的分层视觉本地化

获取原文
获取原文并翻译 | 示例
           

摘要

Localization is one of the crucial issues in assistive technology for visually impaired people. In this paper, we propose a novel hierarchical visual localization pipeline based on the wearable assistive navigation device for visually impaired people. The proposed pipeline involves the deep descriptor network, 2D-3D geometric verification and online sequence matching. Images in different modalities (RGB, Infrared and Depth) are fed into Dual Desc network to generate robust attentive global descriptors and local features. The global descriptors are leveraged to retrieve the coarse candidates of query images. The 2D local features, as well as 3D sparse point cloud, are used in geometric verification to select the optimal results from the retrieved candidates. Finally, sequence matching robustifies the localization results by synthesizing the verified results of successive frames. The proposed unified descriptor network Dual Desc surpasses the state-of-the-art NetVLAD and its variant on the task of image description. Validated on the real-world dataset captured by the wearable assistive device, the proposed visual localization utilizes multimodal images to overcome the disadvantages of RGB images and robustifies the localization performance by deep descriptor network and hierarchical pipeline. In the challenging scenarios of the Yuquan dataset, the proposed method achieves the F-1 score of 0.77 and the mean localization error of 2.75, which is satisfactory in practical use. (C) 2020 Published by Elsevier Ltd.
机译:本地化是视障人士辅助技术的关键问题之一。在本文中,我们提出了一种基于可穿戴辅助导航装置的新型等级视觉定位管道,用于视力受损人员。所提出的管道涉及深度描述符网络,2D-3D几何验证和在线序列匹配。将不同模式(RGB,红外和深度)的图像送入双DESC网络,以生成强大的周度全局描述符和本地特征。可以利用全局描述符来检索查询图像的粗候候选。 2D本地功能以及3D稀疏点云用于几何验证,以选择来自检索到的候选物的最佳结果。最后,序列匹配通过合成连续帧的验证结果来强制定位结果。所提出的统一描述符网络双DESC超出了最先进的NetVlad及其对图像描述任务的变体。在可穿戴辅助设备捕获的真实数据集上验证,所提出的视觉本地化利用多模式图像来克服RGB图像的缺点,并通过深描述符网络和层级管道强制定位性能。在Yuquan数据集的具有挑战性的情况下,所提出的方法实现了0.77的F-1得分和2.75的平均定位误差,在实际使用中是令人满意的。 (c)2020由elestvier有限公司发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号