首页> 外文期刊>Journal of the Optical Society of America, A. Optics, image science, and vision >Enhancing spatio-chromatic representation with more-than-three color coding for image description
【24h】

Enhancing spatio-chromatic representation with more-than-three color coding for image description

机译:用更多三种颜色编码增强两种色彩表示,用于图像描述

获取原文
获取原文并翻译 | 示例
           

摘要

The extraction of spatio-chromatic features from color images is usually performed independently on each color channel. Usual 3D color spaces, such as RGB, present a high inter-channel correlation for natural images. This correlation can be reduced using color-opponent representations, but the spatial structure of regions with small color differences is not fully captured in two generic Red-Green and Blue-Yellow channels. To overcome these problems, we propose new color coding that is adapted to the specific content of each image. Our proposal is based on two steps: (a) setting the number of channels to the number of distinctive colors we find in each image (avoiding the problem of channel correlation), and (b) building a channel representation that maximizes contrast differences within each color channel (avoiding the problem of low local contrast). We call this approach more-than- three color coding (MTT) to emphasize the fact that the number of channels is adapted to the image content. The higher the color complexity of an image, the more channels can be used to represent it. Here we select distinctive colors as the most predominant in the image, which we call color pivots, and we build the new color coding strategy using these color pivots as a basis. To evaluate the proposed approach, we measure the efficiency in an image categorization task. We show how a generic descriptor improves performance at the description level when applied to the MTT coding. (C) 2017 Optical Society of America
机译:从彩色图像提取来自彩色图像的时空特征通常在每个颜色通道上独立地执行。通常的3D颜色空间,例如RGB,为自然图像提供高间间相关性。使用颜色对比表示可以减少这种相关性,但是在两个通用的红色和蓝色黄色通道中没有完全捕获具有小的颜色差异的区域的空间结构。为了克服这些问题,我们提出了适应每个图像的特定内容的新颜色编码。我们的提案基于两个步骤:(a)将通道数设置为在每个图像中发现的独特颜色的数量(避免信道相关问题),(b)构建最大化每个图像的信道表示,以最大化每个频道表示颜色通道(避免局部对比度低)。我们称之为三于三种颜色编码(MTT),以强调通道数适于图像内容的事实。图像的颜色复杂度越高,可以使用越多的通道代表它。在这里,我们选择独特的颜色作为图像中最主要的颜色,我们呼叫颜色枢轴,我们使用这些颜色枢轴构建新的颜色编码策略作为基础。为了评估所提出的方法,我们测量图像分类任务中的效率。我们展示了在应用于MTT编码时,通用描述符如何在描述级别提高性能。 (c)2017年光学学会

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号