【24h】

Page segmentation and classification based on pattern-list analysis

机译:基于模式列表分析的页面细分和分类

获取原文

摘要

In this paper, a new algorithm based on pattern-list analysis is proposed for page segmentation and classification. There are three steps in the algorithm: the bounding rectangle location, the pattern formation and the pattern classification, after which the patterns that may be wrongly classified are further classified by their contextual information. Experimental results show the accuracy of the algorithm in segmenting text and non-text regions, especially for the case of document images with irregular-shaped halftone regions. The algorithm is valid only for binary document images.
机译:本文提出了一种基于模式列表分析的新算法,用于页面分割和分类。该算法包括三个步骤:边界矩形位置,图案形成和图案分类,然后根据其上下文信息对可能被错误分类的图案进行进一步分类。实验结果证明了该算法在分割文本和非文本区域时的准确性,特别是对于具有不规则形状的半色调区域的文档图像而言。该算法仅对二进制文档图像有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号