【24h】

Images as bags of pixels

机译:图像作为像素的袋子

获取原文

摘要

We propose modeling images and related visual objects as bags of pixels or sets of vectors. For instance, gray scale images are modeled as a collection or bag of (X, Y, I) pixel vectors. This representation implies a permutational invariance over the bag of pixels, which is naturally handled by endowing each image with a permutation matrix. Each matrix permits the image to span a manifold of multiple configurations, capturing the vector set's invariance to orderings or permutation transformations. Permutation configurations are optimized while jointly modeling many images via maximum likelihood. The solution is a uniquely solvable convex program, which computes correspondence simultaneously for all images (as opposed to traditional pairwise correspondence solutions). Maximum likelihood performs a nonlinear dimensionality reduction, choosing permutations that compact the permuted image vectors into a volumetrically minimal subspace. This is highly suitable for principal components analysis which, when applied to the permutationally invariant bag of pixels representation, outperforms PCA on appearance-based vectorization by orders of magnitude. Furthermore, the bag of pixels subspace benefits from automatic correspondence estimation, giving rise to meaningful linear variations such as morphings, translations, and jointly spatio-textural image transformations. Results are shown for several datasets.
机译:我们建议将图像和相关的视觉物体建立为像素或矢量套件。例如,灰度图像被建模为(x,y,i)像素向量的集合或袋子。该表示暗示通过赋予置换矩阵的每个图像自然地处理的像素袋上的置性不变性。每个矩阵允许图像跨越多个配置的歧管,捕获向量集的传送到排序或排列变换的不变性。优化置换配置,同时通过最大可能性共同建模许多图像。该解决方案是一种独特的凸形程序,其同时计算所有图像(与传统成对对应解决方案相反)。最大可能性执行非线性维度降低,选择将允许的图像向量紧凑的排列到体积最小的子空间中。这是高度适用于主要成分分析,当应用于允许不变的像素表示袋时,通过数量级的基于外观的矢量化优异。此外,像素子空间袋来自自动对应估计,从而产生了有意义的线性变化,例如变形,翻译和共同的时空图像变换。结果显示为几个数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号