...
首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >A multi-plane approach for text segmentation of complex document images
【24h】

A multi-plane approach for text segmentation of complex document images

机译:用于复杂文档图像的文本分割的多平面方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This study presents a new method, namely the multiplane segmentation approach, for segmenting and extracting textual objects from various real-life complex document images. The proposed multi-plane segmentation approach first decomposes the document image into distinct object planes to extract and separate homogeneous objects including textual regions of interest, non-text objects such as graphics and pictures, and background textures. This process consists of two stages-localized histogram multilevel thresholding and multi-plane region matching and assembling. Then a text extraction procedure is applied Oil the resultant planes to detect and extract textual objects with different characteristics in the respective planes. The proposed approach processes document images regionally and adaptively according to their respective local features. Hence detailed characteristics of the extracted textual objects, Particularly small characters with thin strokes, as well as gradational illuminations of characters, can be well-preserved. Moreover, this way also allows background objects with uneven, gradational, and sharp variations in contrast, illumination, and texture to be handled easily and well. Experimental results on real-life complex document images demonstrate that the proposed approach is effective in extracting textual objects with Various illuminations, sizes, and font styles from various types of complex document images.
机译:这项研究提出了一种新方法,即多平面分割方法,用于从各种现实生活中的复杂文档图像中分割和提取文本对象。提出的多平面分割方法首先将文档图像分解为不同的对象平面,以提取和分离同构对象,包括感兴趣的文本区域,诸如图形和图片之类的非文本对象以及背景纹理。该过程包括两个阶段:局部直方图多级阈值化和多平面区域匹配与组装。然后,应用文本提取程序为所得平面上油,以检测和提取在各个平面中具有不同特征的文本对象。所提出的方法根据其各自的局部特征来局部地和自适应地处理文档图像。因此,可以很好地保留所提取文本对象的详细特征,特别是笔触细小的小字符以及字符的层次照亮。而且,这种方式还允许容易且良好地处理对比度,照度和纹理具有不均匀,渐变和急剧变化的背景对象。在现实生活中的复杂文档图像上的实验结果表明,该方法可有效地从各种类型的复杂文档图像中提取具有各种照明,大小和字体样式的文本对象。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号