首页> 外文会议>ACM/IEEE-CS joint conference on digital libraries >GROTOAP: GROund Truth for Open Access Publications
【24h】

GROTOAP: GROund Truth for Open Access Publications

机译:GROTOAP:适用于开放获取出版物的GROund真相

获取原文

摘要

The field of digital document content analysis includes many important tasks, for example page segmentation or zone classification. It is impossible to build effective solutions for such problems and evaluate their performance without a reliable test set, that contains both input documents and expected results of segmentation and classification. In this paper we present GROTOAP - a test set useful for training and performance evaluation of page segmentation and zone classification tasks. The test set contains input articles in a digital form and corresponding ground truth files. All input documents included in the test set have been selected from DOAJ database, which indexes articles published under CC-BY license. The whole test set is available under the same license.
机译:数字文档内容分析领域包括许多重要任务,例如页面分割或区域分类。如果没有可靠的测试集,其中既包含输入文档又包含细分和分类的预期结果,则无法针对此类问题构建有效的解决方案并评估其性能。在本文中,我们介绍了GROTOAP-一种可用于培训和评估页面分割和区域分类任务的性能的测试集。测试集包含数字形式的输入文章和相应的基本信息文件。测试集中包含的所有输入文档均已从DOAJ数据库中选择,该数据库对CC-BY许可下发布的文章进行了索引。整个测试仪可在同一许可证下获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号