首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >ICDAR2017 Competition on Recognition of Documents with Complex Layouts - RDCL2017
【24h】

ICDAR2017 Competition on Recognition of Documents with Complex Layouts - RDCL2017

机译:ICDAR2017以复杂布局识别文件的竞争 - RDCL2017

获取原文

摘要

This paper presents an objective comparative evaluation of page segmentation and region classification methods for documents with complex layouts. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2017, presenting the results of the evaluation of seven methods - five submitted, two state-of-the-art systems (commercial and open-source). Three scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and two evaluating both segmentation and region classification (one focusing only on text regions). For the first time, nested region content (table cells, chart labels etc.) are evaluated in addition to the top-level page content. Text recognition was a bonus challenge and was not taken up by all participants. The results indicate that an innovative approach has a clear advantage but there is still a considerable need to develop robust methods that deal with layout challenges, especially with the non-textual content.
机译:本文介绍了具有复杂布局的文件的页面细分和区域分类方法的客观比较评估。它描述了在ICDAR2017的背景下举行的竞争(Modus Operandi,DataSet和评估方法),呈现了七种方法的评估结果 - 五个提交的,两个最先进的系统(商业和开源)。本文报告了三种情况,一个方案,一种评估方法的方法,准确分段区域和两个评估分割和区域分类(仅在文本区域上的一个注重)。首次,除了顶级页面内容之外,还会评估嵌套区域内容(表格单元格,图表等)。文本认可是奖金挑战,所有参与者都没有被占用。结果表明,一种创新的方法具有明显的优势,但仍有相当多的需要开发处理布局挑战的强大方法,特别是在非文本内容中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号