Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions

机译：通过最小化全局损失函数，使用深度连体和三重卷积网络学习局部图像描述符

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent innovations in training deep convolutional neural network (ConvNet) models have motivated the design of new methods to automatically learn local image descriptors. The latest deep ConvNets proposed for this task consist of a siamese network that is trained by penalising misclassification of pairs of local image patches. Current results from machine learning show that replacing this siamese by a triplet network can improve the classification accuracy in several problems, but this has yet to be demonstrated for local image descriptor learning. Moreover, current siamese and triplet networks have been trained with stochastic gradient descent that computes the gradient from individual pairs or triplets of local image patches, which can make them prone to overfitting. In this paper, we first propose the use of triplet networks for the problem of local image descriptor learning. Furthermore, we also propose the use of a global loss that minimises the overall classification error in the training set, which can improve the generalisation capability of the model. Using the UBC benchmark dataset for comparing local image descriptors, we show that the triplet network produces a more accurate embedding than the siamese network in terms of the UBC dataset errors. Moreover, we also demonstrate that a combination of the triplet and global losses produces the best embedding in the field, using this triplet network. Finally, we also show that the use of the central-surround siamese network trained with the global loss produces the best result of the field on the UBC dataset.

机译：训练深度卷积神经网络（ConvNet）模型的最新创新激励了自动学习局部图像描述符的新方法的设计。为此任务建议的最新深层ConvNets包含一个暹罗网络，该网络通过惩罚对局部图像补丁对的错误分类进行训练。机器学习的最新结果表明，用三重态网络代替该暹罗可以提高一些问题中的分类精度，但这尚未在本地图像描述符学习中得到证明。此外，目前的暹罗网络和三胞胎网络已通过随机梯度下降训练，该梯度下降计算是从局部图像块的单个对或三重峰计算梯度，这可能使它们易于过度拟合。在本文中，我们首先提出将三重态网络用于局部图像描述符学习的问题。此外，我们还建议使用全局损失，该损失可将训练集中的总体分类误差降至最低，从而可以提高模型的泛化能力。使用UBC基准数据集比较本地图像描述符，我们显示出在UBC数据集错误方面，三元组网络比暹罗网络产生更准确的嵌入。此外，我们还证明，使用此三元组网络，三元组和全局损失的组合可在现场产生最佳的嵌入效果。最后，我们还表明，使用经过全局损失训练的中央-环绕暹罗网络可以在UBC数据集上产生最佳的野外效果。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2016年|5385-5394|共10页
会议地点
作者
Vijay Kumar B G; Gustavo Carneiro; Ian Reid;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Measurement; Transforms; Optimization; Linear programming; Benchmark testing; Eigenvalues and eigenfunctions;

机译：训练;测量;变换;优化;线性编程;基准测试;特征值和特征函数;

相似文献

外文文献
中文文献
专利

1. Compact descriptors for sketch-based image retrieval using a triplet loss convolutional neural network [J] . T. Bui, L. Ribeiro, M. Ponti, Computer vision and image understanding . 2017,第NOVa期

机译：使用三重损失卷积神经网络的基于草图的图像检索的紧凑描述符
2. Denoising of 3D Brain MR Images with Parallel Residual Learning of Convolutional Neural Network Using Global and Local Feature Extraction [J] . Liang Wu, Shunbo Hu, Changchun Liu Computational intelligence and neuroscience . 2021,第a期

机译：使用全局和局部特征提取与卷积神经网络的平行剩余学习的3D脑MR图像的去噪
3. Anatomical-functional image fusion based on deep convolution neural networks in local Laplacian pyramid domain [J] . Huang Yuping, Li Weisheng, Du Jiao International journal of imaging systems and technology . 2021,第3期

机译：基于深卷积神经网络的解剖功能图像融合在当地拉普拉斯金字塔域
4. Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions [C] . Vijay Kumar B G, Gustavo Carneiro, Ian Reid IEEE Conference on Computer Vision and Pattern Recognition . 2016

机译：通过最大限度地降低全局损失功能，使用深暹罗和三联卷积网络学习本地图像描述符
5. The Effect of an Additive Loss Function on a Siamese Convolutional Neural Network for Re-Identification Systems [D] . Millar, Matthew Charles. 2020

机译：添加剂损耗功能对重新识别系统暹罗卷积神经网络的影响
6. Denoising of 3D Brain MR Images with Parallel Residual Learning of Convolutional Neural Network Using Global and Local Feature Extraction [O] . Liang Wu, Shunbo Hu, Changchun Liu 2021

机译：使用全局和局部特征提取与卷积神经网络的平行剩余学习的3D脑MR图像的去噪
7. Compact descriptors for sketch-based image retrieval using a triplet loss convolutional neural network [O] . T. Bui, L. Ribeiro, M. Ponti, 2017

机译：使用三联损耗卷积神经网络的基于草图的图像检索的紧凑描述符

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions

摘要

著录项

相似文献

相关主题

期刊订阅