...
首页> 外文期刊>Journal of Computers >Bag-of-Words and Region-Based Feature Representations in Object Categorization: A Comparative Study
【24h】

Bag-of-Words and Region-Based Feature Representations in Object Categorization: A Comparative Study

机译:对象分类中的词袋和基于区域的特征表示:比较研究

获取原文
           

摘要

The aim of object categorization is to find a given object in an image and the performance of object categorization heavily depends on the extracted features as the image descriptor. In the literature, feature representation can be broadly classified into block/region-based and bag-of-words (BoW) features. However, there is no a comparative study of using these different feature representations over different datasets and different image scales since the image sizes for object recognition are varying from different datasets. Our experimental results using the Corel and PASCAL datasets show that when images contain more complex scenes like Corel images, the block-based feature is a better choice. In addition, the larger the image scales, the better the recognition performance. On the contrary, when images contain fewer objects like PASCAL images, it is better to consider the region-based feature representation. Particularly, reducing the image scale does not degrade the recognition performance; it even shows some level of improvement. On the other hand, although the BoW feature does not perform better than the block/region based features, it shows stable performances over different datasets and different image scales. This indicates that when the chosen dataset contains a large amount of images having various types of contents, which is difficult to decide what features to be extracted, the BoW feature can be extracted as the baseline feature representation.
机译:对象分类的目的是在图像中找到给定的对象,并且对象分类的性能在很大程度上取决于提取的特征作为图像描述符。在文献中,特征表示可大致分为基于块/区域的特征和词袋(BoW)特征。但是,由于在不同的数据集上用于对象识别的图像大小有所不同,因此没有在不同的数据集和不同的图像比例上使用这些不同的特征表示的比较研究。我们使用Corel和PASCAL数据集的实验结果表明,当图像包含Corel图像等更复杂的场景时,基于块的功能是更好的选择。另外,图像比例越大,识别性能越好。相反,当图像包含较少的对象(如PASCAL图像)时,最好考虑基于区域的特征表示。特别地,减小图像尺寸不会降低识别性能。它甚至显示出一定程度的改进。另一方面,尽管BoW功能的性能不比基于块/区域的功能好,但它在不同的数据集和不同的图像比例上显示出稳定的性能。这表示当所选择的数据集包含具有各种内容类型的大量图像时,这很难决定要提取的特征,可以将BoW特征提取为基线特征表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号