首页> 外文期刊>International Journal on Document Analysis and Recognition >Combination of deep neural networks and logical rules for record segmentation in historical handwritten registers using few examples
【24h】

Combination of deep neural networks and logical rules for record segmentation in historical handwritten registers using few examples

机译:利用少数例子,历史手写寄存器中录制分割的深度神经网络和逻辑规则的组合

获取原文
获取原文并翻译 | 示例
       

摘要

This work focuses on the layout analysis of historical handwritten registers, in which local religious ceremonies were recorded. The aim of this work is to delimit each record using few available training data. To this end, two approaches are proposed. Firstly, three state-of-the-art object detection networks are explored and compared. Further experiments are then conducted on Mask R-CNN, as it yields the best performance. Secondly, we introduce and investigate Deep&Syntax, a hybrid system that takes advantages of recurrent patterns to delimit each record, by combining u-shaped networks and logical rules. Finally, these two approaches are evaluated on 3708 French records (sixteenth-eighteenth centuries), as well as on the Esposalles public database, containing 253 Spanish records (seventeenth century). While both systems perform well on homogeneous documents, we observe a significant drop in performance with Mask R-CNN on more challenging documents, especially when trained on a small, non-representative subset. By contrast, Deep&Syntax relies on steady patterns and is therefore able to process a wider range of documents with less training data. When both systems are trained on 120 documents, Deep&Syntax produces 15% more match configurations and reduces the ZoneMap surface error metric by 30%. It also outperforms Mask R-CNN when trained on a database three times smaller. As Deep&Syntax generalizes better, we believe it can be used for massive parish register processing, as collecting and annotating a sufficiently large and representative set of training data is not always achievable.
机译:这项工作侧重于历史手写寄存器的布局分析,其中记录了当地宗教仪式。这项工作的目的是使用一些可用的培训数据来分隔每个记录。为此,提出了两种方法。首先,探讨了三个最先进的对象检测网络并进行比较。然后在掩模R-CNN上进行进一步的实验,因为它产生了最佳性能。其次,我们通过组合U形网络和逻辑规则来介绍和调查深度和语法,这是一种混合系统,该混合系统采用反复模式来分隔每个记录。最后,这两种方法是在3708法国记录(第十八世纪)以及esposalles公共数据库上进行评估,其中包含253名西班牙语记录(十七世纪)。虽然两个系统在同类文件上表现良好,但我们在更具挑战性文件中观察到掩模R-CNN的性能显着下降,特别是当培训在小型非代表性的子集上时。相比之下,深度和语法依赖于稳定模式,因此能够处理具有较少培训数据的更广泛的文档。当两个系统接受120个文档培训时,Deep&Syntax会产生15%的匹配配置,并将Zonemap表面误差度量减少30%。当在数据库中训练三次时,它还优于掩模R-CNN。随着深度和语法呈现更好,我们认为它可以用于大规模教区寄存器处理,因为收集和注释了足够大而代表性的训练数据并不总是可以实现的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号