【24h】

High Dimensional Image Categorization

机译:高维图像分类

获取原文

摘要

We are interested in varying the vocabulary size in the image categorization task with a bag-of-visualwords to investigate its influence on the classification accuracy in two cases: in the first one, both the test-set and the training set contains the same objects (with only different view points in the test-set) and the second one where objects in the test-set do not appear at all in the training set (only other objects from the same category appear). In order to perform these tasks, we need to scale-up the algorithms used to deal with millions data points in hundred of thousand dimensions. We present k-means (used in the quantization step) and SVM (used in the classification step) algorithms extended to deal with very large datasets. These new incremental and parallel algorithms can be used on various distributed architectures, like multi-thread computer, cluster or GPU (graphics processing units). The efficiency of the approach is shown with the categorization of the 3D-Dataset from Savarese and Fei-Fei containing about 6700 images of 3D objects from 10 different classes. The obtained incremental and parallel SVM algorithm is several orders of magnitude faster than usual ones (like lib-SVM, SVMperf or CB-SVM) and the incremental and parallel kmeans is at least one order of magnitude faster than usual implementations.
机译:我们有兴趣在图像分类任务中使用袋visualword在两种情况下调查其对分类准确性的影响:在第一个,测试集和训练集中包含相同的对象(测试集中只有不同的视点),并且在训练集中根本不会出现测试集中的对象(仅显示来自同一类别的其他对象)。为了执行这些任务,我们需要扩展用于处理数百万维维度的数百万数据点的算法。我们呈现K-means(在量化步骤中使用)和SVM(在分类步骤中使用)算法扩展以处理非常大的数据集。这些新的增量和并行算法可用于各种分布式架构,如多线程计算机,群集或GPU(图形处理单元)。从Savarese和Fei-Fei的3D-DataSet分类显示了该方法的效率,其中包含来自10个不同类的3D对象的约6700张图像。获得的增量和并行SVM算法是比通常的速度快(如lib-svm,svmperf或cb-svm)更快的数量级,并且增量和平行kemeans比通常的实现更快地至少一个幅度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号