首页> 外文期刊>International Journal on Document Analysis and Recognition >Mean-Shift segmentation and PDE-based nonlinear diffusion: toward a common variational framework for foreground/background document image segmentation
【24h】

Mean-Shift segmentation and PDE-based nonlinear diffusion: toward a common variational framework for foreground/background document image segmentation

机译:均值漂移分割和基于PDE的非线性扩散:朝着前景/背景文档图像分割的通用变体框架

获取原文
获取原文并翻译 | 示例
           

摘要

The presence of noise in images of degraded documents limits the direct application of segmentation approaches and can lead to the presence of a number of different artifacts in the final segmented image. A possible solution is the integration of a pre-filtering step which may improve the segmentation quality through the reduction of such noise. This study demonstrated that combining the Mean-Shift clustering algorithm and the tensor-driven diffusion process into a joint iterative framework produced promising results. For instance, this framework generates segmented images with reduced edge and background artifacts when compared to results obtained after applying each method separately. This improvement is explained by the mutual interaction of global and local information, introduced, respectively, by the Mean-Shift and the anisotropic diffusion. Another point of note is that the anisotropic diffusion process smoothed images while preserving edge continuities. The convergence of this framework was defined automatically under a stopping criterion not previously defined when the diffusion process was applied alone. To obtain a fast convergence, the common framework utilizes the speedup algorithm of the Fukunaga and Hostetler Mean-Shift formulation already proposed by Lebourgeois et al. (International Conference on Document Analysis and Recognition (ICDAR), pp 52-56, 2013). This new variant of the Mean-Shift algorithm produced similar results to the original one, but ran faster due to the application of the integral volume. The first application of this framework was document ink bleed-through removal where noise is stemmed from the interference of the verso side on the recto side, thus perturbing the legibility of the original text. Other categories of images could also be subjected to the proposed framework application.
机译:降级文档图像中噪声的存在限制了分割方法的直接应用,并可能导致最终分割图像中存在许多不同的伪像。可能的解决方案是集成预过滤步骤,该步骤可以通过减少此类噪声来提高分割质量。这项研究表明,将均值漂移聚类算法和张量驱动的扩散过程组合到联合迭代框架中可产生可喜的结果。例如,与分别应用每种方法后获得的结果相比,此框架可生成具有减少的边缘和背景伪像的分割图像。整体信息和局部信息的相互影响解释了这种改进,分别通过均值漂移和各向异性扩散引入。另一个要注意的是,各向异性扩散过程使图像平滑,同时保留了边缘连续性。当单独应用扩散过程时,该框架的收敛是根据先前未定义的停止条件自动定义的。为了获得快速收敛,通用框架利用Lebourgeois等人已经提出的Fukunaga和Hostetler Mean-Shift公式的加速算法。 (国际文档分析和识别会议(ICDAR),第52-56页,2013年)。 Mean-Shift算法的这一新变体产生了与原始算法相似的结果,但由于积分体积的应用,运行速度更快。该框架的第一个应用是消除文档墨水的渗色,其中噪音是由于背面的正反面干扰而产生的,从而干扰了原始文本的可读性。其他类别的图像也可以接受建议的框架应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号