首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >ICDAR2017 Competition on Layout Analysis for Challenging Medieval Manuscripts
【24h】

ICDAR2017 Competition on Layout Analysis for Challenging Medieval Manuscripts

机译:ICDAR2017挑战中世纪手稿版面分析大赛

获取原文

摘要

This paper reports on the ICDAR2017 Competition on Layout Analysis for Challenging Medieval Manuscripts (HisDoc-Layout-Comp) and provides further details and discussions. In this competition we introduce a new challenging dataset and state-of-the-art benchmark results for pixel-labelling and text line segmentation. The DIVA-HisDB comprises medieval manuscripts with complex layout in contrast to previous datasets, where rectangular text blocks and only a few decorative elements exist. In particular, the images of this competition contain many interlinear and marginal glosses as well as texts in various sizes and decorated letters. This makes the distinction of the four target labels (text, comment, decoration, and background) more difficult. In addition, to reflect the needs of scholars in the humanities, we request multi-labeling of certain regions (decorated text as text and decoration). Furthermore, we measure not just the accuracy, but the Intersection over Union (IU) of pixel sets, which better reflects the real performance. Indeed, in our results we observe that the accuracy appears to be rather high, but the IU reveals, that there is still room for improvement. For the task of line segmentation, the recognition results are rather low (overall error higher than 5%). Noteworthy, a combination of the best layout analysis method with an adapted seam-carving based method achieves better results than the best contestant.
机译:本文报道了ICDAR2017挑战性中世纪手稿布局分析大赛(HisDoc-Layout-Comp),并提供了更多详细信息和讨论。在本次比赛中,我们引入了一个新的具有挑战性的数据集以及用于像素标记和文本行分割的最新基准测试结果。 DIVA-HisDB包含中世纪手稿,与以前的数据集相比,中世纪的手稿具有复杂的布局,在以前的数据集中,矩形文本块和少量装饰元素存在。特别是,这场比赛的图像包含许多线性的和边缘的光泽感,以及各种大小和修饰字母的文字。这使得区分四个目标标签(文本,注释,修饰和背景)更加困难。此外,为了反映人文学科领域的学者的需求,我们要求对某些区域进行多重标记(装饰文字作为文字和装饰)。此外,我们不仅测量精度,而且还测量像素集的并集交集(IU),它可以更好地反映真实性能。确实,在我们的结果中,我们观察到准确性似乎很高,但是IU显示,仍有改进的空间。对于线段分割任务,识别结果相当低(总误差高于5%)。值得注意的是,最佳布局分析方法与基于缝隙雕刻的适应方法相结合,可获得比最佳参赛者更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号