首页> 外文会议> >Color document image segmentation for automated document entry systems
【24h】

Color document image segmentation for automated document entry systems

机译:用于自动文档输入系统的彩色文档图像分割

获取原文

摘要

Monochrome document image segmentation has been studied for over ten years. On the other hand, how to segment color document images is still an open research field. We propose an approach for segmenting color document images. Unlike the common practice in monochrome documents that objects are black on a white background the components in color documents can be any color. To cope with their variety, the first step of our approach is to create a binary image of edge-representation. Then page segmentation is carried out in the binary image using the CRLA procedure. Finally, we utilize the geometric features and the color information to classify the segmented blocks into text lines and picture components. The identified text lines are then further transformed into the white-background/black-text format for OCR processing. The proposed approach was implemented on a Pentium/l33 PC and the experimental results have demonstrated its feasibility.
机译:单色文档图像分割已经研究了十多年。另一方面,如何分割彩色文档图像仍然是一个开放的研究领域。我们提出了一种分割彩色文档图像的方法。与单色文档中的对象在白色背景上为黑色的单色做法不同,彩色文档中的成分可以是任何颜色。为了应对它们的多样性,我们方法的第一步是创建边缘表示的二进制图像。然后使用CRLA程序在二进制图像中执行页面分割。最后,我们利用几何特征和颜色信息将分割后的块分类为文本行和图片成分。然后将识别出的文本行进一步转换为白色背景/黑色文本格式,以进行OCR处理。所提出的方法是在奔腾/ 133 PC上实现的,实验结果证明了其可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号