首页> 外文会议>International Joint Conference on Neural Networks >The neural-SIFT feature descriptor for visual vocabulary object recognition
【24h】

The neural-SIFT feature descriptor for visual vocabulary object recognition

机译:用于视觉词汇对象识别的神经SIFT特征描述符

获取原文

摘要

Recognizing the semantic content of an image is a challenging problem in computer vision. Many researchers attempt to apply local image descriptors to extract features from an image, but choosing the best type of feature to use is still an open problem. Some of these systems are only trained once using a fixed descriptor, like the Scale Invariant Feature Transform (SIFT). In most cases these algorithms show good performance, but they do not learn from their mistakes once training is completed. In this paper a continuous deep neural network feedback system is proposed which consists of an adaptive neural network feature descriptor, the bag of visual words approach and a neural classifier. Two initialization methods for the neural network feature descriptor were compared, one where it was trained on SIFT descriptor output and one where it was randomly initialized. After initial training, the system propagates the classification error from the neural network classifier through the entire pipeline, updating not only the classifier itself, but also the type of features to extract. Results show that for both initialization methods the feedback system increased accuracy substantially when regular training was not able to increase it any further. The proposed neural-SIFT feature descriptor performs better than the SIFT descriptor itself even with a limited number of training instances. Initializing on an existing feature descriptor is beneficial when not a lot of training samples are available. However, when there are a lot of training samples the system is able to construct a well-performing descriptor, solely based on classifier feedback.
机译:在计算机视觉中,识别图像的语义内容是一个具有挑战性的问题。许多研究人员试图应用局部图像描述符从图像中提取特征,但是选择要使用的最佳特征类型仍然是一个悬而未决的问题。其中一些系统仅使用固定描述符(例如尺度不变特征变换(SIFT))进行了一次训练。在大多数情况下,这些算法表现出良好的性能,但是一旦训练完成,它们就不会从错误中吸取教训。本文提出了一种连续的深度神经网络反馈系统,该系统由自适应神经网络特征描述符,视觉词袋方法和神经分类器组成。比较了神经网络特征描述符的两种初始化方法,一种是在SIFT描述符输出上训练的,另一种是随机初始化的。初步训练后,系统将神经网络分类器的分类错误传播到整个管道,不仅更新分类器本身,还更新要提取的特征类型。结果表明,对于两种初始化方法,当常规训练无法进一步提高反馈系统的准确性时,反馈系统就会大大提高其准确性。所提出的神经SIFT特征描述符即使在训练实例数量有限的情况下也比SIFT描述符本身具有更好的性能。当没有足够的训练样本时,对现有特征描述符进行初始化是有益的。但是,当有很多训练样本时,系统仅基于分类器反馈就能够构造出性能良好的描述符。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号