首页> 外文期刊>Pattern Analysis and Applications >Image-based logical document structure recognition
【24h】

Image-based logical document structure recognition

机译:基于图像的逻辑文档结构识别

获取原文
获取原文并翻译 | 示例

摘要

The paper presents a complete solution for recognition of textual and graphic structures in various types of documents acquired from the Internet. In the proposed approach, the document structure recognition problem is divided into sub-problems. The first one is localizing logical structure elements within the document. The second one is recognizing segmented logical structure elements. The input to the method is an image of document page, the output is the XML file containing all graphic and textual elements included in the document, preserving the reading order of document blocks. This file contains information about the identity and position of all logical elements in the document image. The paper describes all details of the proposed method and shows the results of the experiments validating its effectiveness. The results of the proposed method for paragraph structure recognition are comparable to the referenced methods which offer segmentation only.
机译:本文提出了一个完整的解决方案,用于识别从Internet获取的各种类型文档中的文本和图形结构。在所提出的方法中,文档结构识别问题被分为子问题。第一个是在文档中本地化逻辑结构元素。第二个是识别分段的逻辑结构元素。该方法的输入是文档页面的图像,输出是包含文档中包含的所有图形和文本元素的XML文件,从而保留了文档块的读取顺序。该文件包含有关文档图像中所有逻辑元素的标识和位置的信息。本文介绍了该方法的所有细节,并显示了验证其有效性的实验结果。所提出的用于段落结构识别的方法的结果与仅提供分段的参考方法具有可比性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号