To alleviate the problem that spatial position information is completely ignored in Bags of Words representa-tion, an optimized method based on multi-direction spatial Bags of Words is proposed. Firstly, the representation of spatial sub region of an image is modeled by spatial pyramid. Then, projection method is applied to local features of image blocks in horizontal, vertical and inclined ±45° , spatial structure information is well expressed in multi-direction. Fur-thermore, it reduces redundant effects of different object and the dimension of features by means of samples visual code-book. Finally, the proposed method is evaluated on two object databases, the results of comparative experiment show that the proposed algorithm has inspiring performance.%针对词袋模型完全忽略空间位置信息的问题,提出了一种多方向空间词袋模型的物体识别方法.该算法通过空间金字塔划分,形成图像的空间子区域特征表达;分别在水平、垂直和倾斜±45°上对图像局部特征向量进行投影,得到图像在多方向上的空间结构信息;采用样本视觉词典方法,既减少了不同物体类别样本带来的冗余影响,又降低了特征维数.在Caltech101和Caltech256物体库上进行了对比实验,实验结果验证了算法的有效性.
展开▼