Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks

机译：补丁PlaNet：使用卷积神经网络进行补丁分类的地标识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work we address the problem of landmark recognition. We extend PlaNet, a model based on deep neural networks that approaches the problem of landmark recognition as a classification problem and performs the recognition of places around the world. We propose an extension of the PlaNet technique in which we use a voting scheme to perform the classification, dividing the image into previously defined regions and inferring the landmark based on these regions. The prediction of the model depends not only on the information of the features learned by the deep convolutional neural network architecture during training, but also uses local information from each region in the image for which the classification is made. To validate our proposal, we performed the training of the original PlaNet model and our variation using a database built with images from Flickr, and evaluated the models in the Paris and Oxford Buildings datasets. It was possible to notice that the addition of image division and voting structure improves the accuracy result of the model by 5-11 percentage points on average, reducing the level of ambiguity found during the inference of the model.

机译：在这项工作中，我们解决了地标识别的问题。我们扩展了PlaNet，它是一个基于深度神经网络的模型，该模型将地标识别问题作为分类问题进行处理，并进行世界各地的识别。我们提出了PlaNet技术的扩展，其中我们使用表决方案来执行分类，将图像划分为先前定义的区域，并根据这些区域推断地标。模型的预测不仅取决于深度卷积神经网络体系结构在训练过程中学习到的特征信息，而且还使用来自图像中进行分类的每个区域的局部信息。为了验证我们的建议，我们使用由Flickr提供的图像构建的数据库对原始PlaNet模型及其变体进行了训练，并在Paris and Oxford Buildings数据集中评估了模型。可能注意到，图像分割和投票结构的添加将模型的准确性结果平均提高了5-11个百分点，从而降低了在模型推断过程中发现的歧义程度。

著录项

来源
《SIBGRAPI Conference on Graphics, Patterns and Images》|2018年|126-133|共8页
会议地点
作者
Kelvin B. da Cunha; Lucas Maggi; Veronica Teichrieb; João Paulo Lima; Jonysberg Peixoto Quintino; Fabio Q.B. da Silva; André L.M. Santos; Helder Pinho;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Planets; Flickr; Training; Task analysis; Image recognition; Feature extraction; Computer architecture;

机译：行星; Flickr;训练;任务分析;图像识别;特征提取;计算机体系结构;

相似文献

外文文献
中文文献
专利

1. Data-level information enhancement: Motion-patch-based Siamese Convolutional Neural Networks for human activity recognition in videos [J] . Zhang Yujia, Po Lai Man, Liu Mengyang, Expert systems with applications . 2020,第Juna期

机译：数据级信息增强：视频中的运动补丁暹罗卷积神经网络，用于视频中的人类活动识别
2. Large patch convolutional neural networks for the scene classification of high spatial resolution imagery [J] . Zhong Yanfei, Fe Feng, Zhang Liangpei Journal of Applied Remote Sensing . 2016,第2期

机译：大补丁卷积神经网络用于高分辨率空间图像的场景分类
3. Biased face patching approach for age invariant face recognition using convolutional neural network [J] . International Journal of Intelligent Systems Technologies and Applications . 2020,第2期

机译：使用卷积神经网络的年龄不变性面部识别的偏置面部修补方法
4. Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks [C] . Kelvin B. da Cunha, Lucas Maggi, Veronica Teichrieb, SIBGRAPI Conference on Graphics, Patterns and Images . 2018

机译：补丁行星：使用卷积神经网络与补丁分类进行界标识别
5. Combining Convolutional Neural Networks and Graph Neural Networks for Image Classification [D] . Trivedy, Vivek. 2021

机译：结合卷积神经网络和图形神经网络的图像分类
6. Deep convolutional neural network-based patch classification for retinal nerve fiber layer defect detection in early glaucoma [O] . Rashmi Panda, Niladri B. Puhan, Aparna Rao, 2018

机译：基于深度卷积神经网络的斑块分类用于青光眼早期视网膜神经纤维层缺损的检测
7. Discriminant Patch Representation for RGB-D Face Recognition using Convolutional Neural Networks [O] . Nesrine Grati, Achraf Ben-Hamadou, Mohamed Hammami 2019

机译：使用卷积神经网络的RGB-D面部识别判别补丁表示

Patch PlaNet: Landmark Recognition with Patch Classification Using Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅