首页> 外文会议>IAPR International Conference on Pattern Recognition >A fast and efficient method for extracting text paragraphs and graphics from unconstrained documents
【24h】

A fast and efficient method for extracting text paragraphs and graphics from unconstrained documents

机译:一种快速有效的方法,用于从无约束文件中提取文本段落和图形

获取原文

摘要

Outlines a fast and efficient method for extracting graphics and text paragraphs from printed documents. The method presented is based on bottom-up approach to document analysis and it achieves very good performance in most cases. During the preprocessing characters are linked together to form blocks. Created blocks are segmented, labelled and merged into paragraphs. Simultaneously, graphics are extracted from the image. Algorithms for each step of processing are presented. Also, the obtained experimental results are included.
机译:概述了从打印文件中提取图形和文本段落的快速有效方法。所呈现的方法是基于对文档分析的自下而上的方法,并且在大多数情况下实现了非常好的性能。在预处理字符期间,链接在一起形成块。已创建的块被分段,标记并合并到段落中。同时,从图像中提取图形。提出了每个处理步骤的算法。此外,包括所获得的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号