...
首页> 外文期刊>International Journal of Sensors, Wireless Communication and Control >An Empirical Study of Global Descriptors for Image-based Local- ization in Dense Urban Scenes
【24h】

An Empirical Study of Global Descriptors for Image-based Local- ization in Dense Urban Scenes

机译:密集城市场景中基于图像的本地化全局描述符的实证研究

获取原文
获取原文并翻译 | 示例

摘要

Background: Visual place recognition is an interesting technology that can be used in many domains such as localizing historical photos, autonomous navigation and augmented reality. The main stream of research in that domain was based on the use of local invariant features like SIFT. Little attention was given to region descriptors which can encompass local and global visual appearances. In this paper, we provide an empirical study on two main visual descriptors: (i) Local Binary Patterns, and (ii) Covariance matrices. Methods: In order to enhance the discriminative power of the final descriptor of each type, multi-block based descriptors are designed and compared. The descriptor corresponding to each input is formed by the concatenation of the features extracted from each building block. We show experimental results on matching test images with reference images acquired in dense urban scenes in the streets of the city of Paris. The problems of scale changes and occlusions are both treated by simulation. Different combinations of processing steps are treated (covariance and LBP descriptors, mono and multi-blocks, L1 and Chi-Squared distances). Results: The obtained results for the tested scenarios show an improvement in the classification rate for at least three scenarios when using multi-block based features rather than mono-block based ones. Conclusion: The use of multi-block based features can thus enhance the discrimination of the obtained final descriptor. The corresponding matching algorithms can lead to both high accuracy and scalability.
机译:背景:视觉位置识别是一项有趣的技术,可用于许多领域,例如对历史照片进行本地化,自主导航和增强现实。该领域的研究主要基于使用SIFT等局部不变特征。几乎没有关注可包含局部和全局视觉外观的区域描述符。在本文中,我们对两个主要的视觉描述符进行了实证研究:(i)局部二元模式和(ii)协方差矩阵。方法:为了增强每种类型的最终描述符的区分能力,设计并比较了基于多块的描述符。对应于每个输入的描述符是由从每个构造块提取的特征的串联构成的。我们将匹配的测试图像与在巴黎市街道上密集的城市场景中获取的参考图像进行匹配,以显示实验结果。规模变化和遮挡的问题均通过仿真处理。处理步骤的不同组合(协方差和LBP描述子,单块和多块,L1和Chi-Squared距离)。结果:在使用基于多块的功能而不是基于单个块的功能时,所测试场景的结果显示出至少三种场景的分类率有所提高。结论:基于多块的特征的使用可以因此增强对所获得的最终描述符的辨别力。相应的匹配算法可以导致高精度和可伸缩性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号