首页> 外文会议>International Workshop on Document Analysis Systems >Ground Truth for Layout Analysis Performance Evaluation
【24h】

Ground Truth for Layout Analysis Performance Evaluation

机译:布局分析绩效评估的原始真理

获取原文

摘要

Over the past two decades a significant number of layout analysis (page segmentation and region classification) approaches have been proposed in the literature. Each approach has been devised for and/or evaluated using (usually small) application-specific datasets. While the need for objective performance evaluation of layout analysis algorithms is evident, there does not exist a suitable dataset with ground truth that reflects the realities of everyday documents (widely varying layouts, complex entities, colour, noise etc.). The most significant impediment is the creation of accurate and flexible (in representation) ground truth, a task that is costly and must be carefully designed. This paper discusses the issues related to the design, representation and creation of ground truth in the context of a realistic dataset developed by the authors. The effectiveness of the ground truth discussed in this paper has been successfully shown in its use for two international page segmentation competitions (ICDAR2003 and ICDAR2005).
机译:在过去的二十几十年中,在文献中提出了大量的布局分析(页面分割和区域分类)方法。已经设计了每个方法和/或使用(通常是小)特定于应用程序的数据集进行评估。虽然对布局分析算法的客观性能评估的需求很明显,但不存在具有地面真理的合适的数据集,这些事实反映了日常文件的现实(广泛不同的布局,复杂的实体,颜色,颜色,噪声等)。最重要的阻碍是建立准确和灵活的(在代表中)地理真理,这是一个昂贵的任务,必须精心设计。本文讨论了与作者开发的一个现实数据集中的设计,表示和创建地面真理的问题。本文讨论的地面真理的有效性已成功显示其用于两个国际页面细分竞争(ICDAR2003和ICDAR2005)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号