首页> 外文OA文献 >Correcting the Document Layout: A Machine Learning Approach
【2h】

Correcting the Document Layout: A Machine Learning Approach

机译:纠正文档布局:一种机器学习方法

摘要

In this paper, a machine learning approach to support the user during the correction of the layout analysis is proposed. Layout analysis is the process of extracting a hierarchical structure describing the layout of a page. In our approach, the layout analysis is performed in two steps: firstly, the global analysis determines possible areas containing paragraphs, sections, columns, figures and tables, and secondly, the local analysis groups together blocks that possibly fall within the same area. The result of the local analysis process strongly depends on the quality of the results of the first step. We investigate the possibility of supporting the user during the correction of the results of the global analysis. This is done by allowing the user to correct the results of the global analysis and then by learning rules for layout correction from the sequence of user actions. Experimental results on a set of multi-page documents are reported and commented.
机译:本文提出了一种在布局分析的校正过程中为用户提供支持的机器学习方法。布局分析是提取描述页面布局的层次结构的过程。在我们的方法中,布局分析分两个步骤进行:首先,全局分析确定包含段落,节,列,图形和表格的可能区域,其次,局部分析组将可能属于同一区域的块放在一起。本地分析过程的结果在很大程度上取决于第一步结果的质量。我们调查在更正全局分析结果期间支持用户的可能性。通过允许用户校正全局分析的结果,然后从用户操作序列中学习用于布局校正的规则,可以完成此操作。报告并评论了一组多页文档的实验结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号