首页> 外文学位 >Image sequence segmentation using multiple features and edge fusion: Its algorithm and VLSI architecture.
【24h】

Image sequence segmentation using multiple features and edge fusion: Its algorithm and VLSI architecture.

机译:使用多种功能和边缘融合的图像序列分割:其算法和VLSI架构。

获取原文
获取原文并翻译 | 示例

摘要

Semantic object representation is an important step for digital multimedia applications such as object-based coding, content-based access, and manipulations. This dissertation presents an image sequence segmentation algorithm and its VLSI architecture which provides initial region information for the video coding and the semantic object representation in image sequences. Our objective is to develop a hardware-friendly segmentation algorithm and its architecture by combining static and dynamic features simultaneously in one scheme.; In the initial stage of the algorithm, a multiple feature space is transformed to a label space by using the self-organizing feature maps (SOFM) neural networks. The next stage is an edge fusion in which edge information is incorporated into the neural network outputs to generate more precisely located boundaries of segmentation.; Pixel-based feature vectors consisting of three color, motion, and two texture features are extracted from two frames of an image sequence. These feature vectors are smoothed and normalized. A soft weighting scheme is applied to the normalized features. The weighting scheme suppresses unreliable feature components in a feature vector by making their values low. In order to generate the segmentation label space, the weighted multiple feature space is transformed to the one-dimensional label space using the SOFM neural networks. The oversegmented segmentation labels are further processed by incorporating edge information in order to generate segmented region boundaries closer to edges. The edge fusion is an iterative region merging process using a similarity criterion consisting of color difference, region geometry, and edge information between two regions. Experimental results for a variety of MPEG image sequences are evaluated and compared with an existing segmentation method to clarify the advantages of the proposed algorithm objectively.; The proposed algorithm differs from existing methods as followings: (1) it can segment textured images with low-dimensional texture features, (2) it leads to more meaningful segmentation region boundaries, and (3) it is easier to be mapped into hardware than existing methods.; Also, this dissertation proposes a VLSI segmentation architecture of the proposed algorithm. The proposed segmentation scheme is mapped into a dedicated hardware system. The dedicated special-purpose system consists of motion estimation, edge detection, edge linking, median and min filters, feature normalization and weighting, the systolic feature labeling, and edge fusion subsystems which can be easily mapped into systolic and pipelined architectures. Computational and hardware complexities of the proposed system architecture are estimated in terms of the number of clock cycles, arithmetic components and memory requirement. The proposed VLSI architecture makes it possible to perform image sequence segmentation in real-time.
机译:语义对象表示对于数字多媒体应用(例如,基于对象的编码,基于内容的访问和操作)而言,是重要的一步。本文提出了一种图像序列分割算法及其VLSI架构,为视频编码和图像序列中的语义对象表示提供了初始区域信息。我们的目标是通过在一个方案中同时组合静态和动态特征,开发一种硬件友好的分割算法及其体系结构。在算法的初始阶段,通过使用自组织特征图(SOFM)神经网络将多特征空间转换为标签空间。下一步是边缘融合,其中将边缘信息合并到神经网络输出中,以生成更精确定位的分割边界。从图像序列的两个帧中提取包含三个颜色,运动和两个纹理特征的基于像素的特征向量。这些特征向量经过平滑和归一化。将软加权方案应用于归一化特征。加权方案通过降低特征向量的值来抑制其不可靠。为了生成分割标签空间,使用SOFM神经网络将加权的多个特征空间转换为一维标签空间。通过合并边缘信息进一步处理过分割的分割标签,以便生成更靠近边缘的分割区域边界。边缘融合是使用包含色差,区域几何形状和两个区域之间的边缘信息的相似性准则进行的迭代区域合并过程。评估了各种MPEG图像序列的实验结果,并将其与现有的分割方法进行比较,从而客观地阐明了该算法的优势。所提出的算法与现有方法的不同之处在于:(1)它可以分割具有低维纹理特征的纹理图像;(2)它导致更有意义的分割区域边界;(3)比起映射,它更容易映射到硬件中现有方法。此外,本文提出了该算法的VLSI分割架构。所提出的分割方案被映射到专用硬件系统中。专用的专用系统包括运动估计,边缘检测,边缘链接,中值和最小值过滤器,特征归一化和权重,收缩期特征标记以及可以轻松映射到收缩期和流水线体系结构中的边缘融合子系统。根据时钟周期数,算术组件和内存需求,估算了所提出系统体系结构的计算和硬件复杂性。提出的VLSI架构使实时执行图像序列分割成为可能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号