Hierarchical visual localization for visually impaired people using multimodal images

Cheng Ruiqi; Hu Weijian; Chen Hao; Fang Yicheng; Wang Kaiwei; Xu Zhijie; Bai Jian

首页> 外文期刊>Expert systems with applications >Hierarchical visual localization for visually impaired people using multimodal images

【24h】

Hierarchical visual localization for visually impaired people using multimodal images

机译：使用多式联运图像的视觉障碍者的分层视觉本地化

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Localization is one of the crucial issues in assistive technology for visually impaired people. In this paper, we propose a novel hierarchical visual localization pipeline based on the wearable assistive navigation device for visually impaired people. The proposed pipeline involves the deep descriptor network, 2D-3D geometric verification and online sequence matching. Images in different modalities (RGB, Infrared and Depth) are fed into Dual Desc network to generate robust attentive global descriptors and local features. The global descriptors are leveraged to retrieve the coarse candidates of query images. The 2D local features, as well as 3D sparse point cloud, are used in geometric verification to select the optimal results from the retrieved candidates. Finally, sequence matching robustifies the localization results by synthesizing the verified results of successive frames. The proposed unified descriptor network Dual Desc surpasses the state-of-the-art NetVLAD and its variant on the task of image description. Validated on the real-world dataset captured by the wearable assistive device, the proposed visual localization utilizes multimodal images to overcome the disadvantages of RGB images and robustifies the localization performance by deep descriptor network and hierarchical pipeline. In the challenging scenarios of the Yuquan dataset, the proposed method achieves the F-1 score of 0.77 and the mean localization error of 2.75, which is satisfactory in practical use. (C) 2020 Published by Elsevier Ltd.

机译：本地化是视障人士辅助技术的关键问题之一。在本文中，我们提出了一种基于可穿戴辅助导航装置的新型等级视觉定位管道，用于视力受损人员。所提出的管道涉及深度描述符网络，2D-3D几何验证和在线序列匹配。将不同模式（RGB，红外和深度）的图像送入双DESC网络，以生成强大的周度全局描述符和本地特征。可以利用全局描述符来检索查询图像的粗候候选。 2D本地功能以及3D稀疏点云用于几何验证，以选择来自检索到的候选物的最佳结果。最后，序列匹配通过合成连续帧的验证结果来强制定位结果。所提出的统一描述符网络双DESC超出了最先进的NetVlad及其对图像描述任务的变体。在可穿戴辅助设备捕获的真实数据集上验证，所提出的视觉本地化利用多模式图像来克服RGB图像的缺点，并通过深描述符网络和层级管道强制定位性能。在Yuquan数据集的具有挑战性的情况下，所提出的方法实现了0.77的F-1得分和2.75的平均定位误差，在实际使用中是令人满意的。（c）2020由elestvier有限公司发布

著录项

来源
《Expert systems with applications》 |2021年第3期|113743.1-113743.12|共12页
作者
Cheng Ruiqi; Hu Weijian; Chen Hao; Fang Yicheng; Wang Kaiwei; Xu Zhijie; Bai Jian;
展开▼
作者单位

Zhejiang Univ Natl Engn Res Ctr Opt Instrumentat Hangzhou Peoples R China;

Zhejiang Univ Natl Engn Res Ctr Opt Instrumentat Hangzhou Peoples R China;

Zhejiang Univ Natl Engn Res Ctr Opt Instrumentat Hangzhou Peoples R China;

Zhejiang Univ Natl Engn Res Ctr Opt Instrumentat Hangzhou Peoples R China;

Zhejiang Univ Natl Engn Res Ctr Opt Instrumentat Hangzhou Peoples R China;

Univ Huddersfield Sch Comp & Engn Huddersfield W Yorkshire England;

Zhejiang Univ Natl Engn Res Ctr Opt Instrumentat Hangzhou Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep image descriptor; Place recognition; Geometric validation; Sequence matching; RGB-D-IR image;

机译：深图像描述符;放置识别;几何验证;序列匹配;RGB-D-IR图像;

相似文献

外文文献
中文文献
专利

1. Multimodal Perception of Histological Images for Persons Who Are Blind or Visually Impaired [J] . TING ZHANG, BRADLEY S. DUERSTOCK, JUAN P. WACHS ACM transactions on accessible computing . 2017,第3期

机译：盲人或视力障碍者对组织学图像的多模态感知
2. A novel multimodal interface for improving visually impaired people's web accessibility [J] . Wai Yu, Ravi Kuber, Emma Murphy, Virtual reality . 2006,第2a3期

机译：一种新颖的多模式界面，可改善视障人士的网络访问能力
3. Comprehensive assessment for motor and visually impaired people using a hierarchical model [J] . Khan Awais Ahmad, Kbar Ghassan Ali, Ahmad Naveed Universal access in the information society . 2018,第1期

机译：使用分层模型的电机和视力障碍者综合评估
4. Image Processing Based Indoor Localization System for Assisting Visually Impaired People [C] . Tasnia Ashrafi Heya, Sayed Erfan Arefin, Amitabha Chakrabarty, Ubiquitous Positioning, Indoor Navigation and Location-Based Services . 2018

机译：基于图像处理的视觉辅助室内定位系统
5. Multimodal perception of histological images for persons blind or visually impaired. [D] . Zhang, Ting. 2014

机译：盲人或视障人士对组织学图像的多模式感知。
6. Fuzzy Logic Type-2 Based Wireless Indoor Localization System for Navigation of Visually Impaired People in Buildings [O] . Basem AL-Madani, Farid Orujov, Rytis Maskeliūnas, 2019

机译：基于模糊逻辑2型的室内视觉障碍者无线室内定位系统
7. Multimodal Collaborative Handwriting Training for Visually-Impaired People [O] . Beryl Plimmer, Andrew Crossan, Stephen A. Brewster, 2009

机译：视障人士的多模式协作手写训练

Hierarchical visual localization for visually impaired people using multimodal images

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅