首页> 外文会议>ACM/IEEE-CS joint conference on digital libraries >GROTOAP: GROund Truth for Open Access Publications
【24h】

GROTOAP: GROund Truth for Open Access Publications

机译:GROTOAP:开放访问出版物的地面真相

获取原文

摘要

The field of digital document content analysis includes many important tasks, for example page segmentation or zone classification. It is impossible to build effective solutions for such problems and evaluate their performance without a reliable test set, that contains both input documents and expected results of segmentation and classification. In this paper we present GROTOAP - a test set useful for training and performance evaluation of page segmentation and zone classification tasks. The test set contains input articles in a digital form and corresponding ground truth files. All input documents included in the test set have been selected from DOAJ database, which indexes articles published under CC-BY license. The whole test set is available under the same license.
机译:数字文档内容分析领域包括许多重要任务,例如页面分段或区域分类。对于此类问题构建有效的解决方案并在没有可靠的测试集的情况下评估其性能,其中包含输入文档和分段和分类的预期结果。在本文中,我们呈现Grotoap - 一种用于培训和绩效评估的测试集,可以对页面分割和区域分类任务。测试集包含数字表单和相应的地面真实文件中的输入文章。测试集中包含的所有输入文档已选自DOAJ数据库,该数据库,其索引在CC-By许可下发布的文章。整个测试集可根据同一许可提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号