首页> 外文会议>SIBGRAPI Conference on Graphics, Patterns and Images >Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks
【24h】

Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks

机译:补丁PlaNet:使用卷积神经网络进行补丁分类的地标识别

获取原文

摘要

In this work we address the problem of landmark recognition. We extend PlaNet, a model based on deep neural networks that approaches the problem of landmark recognition as a classification problem and performs the recognition of places around the world. We propose an extension of the PlaNet technique in which we use a voting scheme to perform the classification, dividing the image into previously defined regions and inferring the landmark based on these regions. The prediction of the model depends not only on the information of the features learned by the deep convolutional neural network architecture during training, but also uses local information from each region in the image for which the classification is made. To validate our proposal, we performed the training of the original PlaNet model and our variation using a database built with images from Flickr, and evaluated the models in the Paris and Oxford Buildings datasets. It was possible to notice that the addition of image division and voting structure improves the accuracy result of the model by 5-11 percentage points on average, reducing the level of ambiguity found during the inference of the model.
机译:在这项工作中,我们解决了地标识别的问题。我们扩展了PlaNet,它是一个基于深度神经网络的模型,该模型将地标识别问题作为分类问题进行处理,并进行世界各地的识别。我们提出了PlaNet技术的扩展,其中我们使用表决方案来执行分类,将图像划分为先前定义的区域,并根据这些区域推断地标。模型的预测不仅取决于深度卷积神经网络体系结构在训练过程中学习到的特征信息,而且还使用来自图像中进行分类的每个区域的局部信息。为了验证我们的建议,我们使用由Flickr提供的图像构建的数据库对原始PlaNet模型及其变体进行了训练,并在Paris and Oxford Buildings数据集中评估了模型。可能注意到,图像分割和投票结构的添加将模型的准确性结果平均提高了5-11个百分点,从而降低了在模型推断过程中发现的歧义程度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号