Combination of deep neural networks and logical rules for record segmentation in historical handwritten registers using few examples

Tarride Solene; Lemaitre Aurelie; Couasnon Bertrand; Tardivel Sophie

首页> 外文期刊>International Journal on Document Analysis and Recognition >Combination of deep neural networks and logical rules for record segmentation in historical handwritten registers using few examples

【24h】

Combination of deep neural networks and logical rules for record segmentation in historical handwritten registers using few examples

机译：利用少数例子，历史手写寄存器中录制分割的深度神经网络和逻辑规则的组合

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work focuses on the layout analysis of historical handwritten registers, in which local religious ceremonies were recorded. The aim of this work is to delimit each record using few available training data. To this end, two approaches are proposed. Firstly, three state-of-the-art object detection networks are explored and compared. Further experiments are then conducted on Mask R-CNN, as it yields the best performance. Secondly, we introduce and investigate Deep&Syntax, a hybrid system that takes advantages of recurrent patterns to delimit each record, by combining u-shaped networks and logical rules. Finally, these two approaches are evaluated on 3708 French records (sixteenth-eighteenth centuries), as well as on the Esposalles public database, containing 253 Spanish records (seventeenth century). While both systems perform well on homogeneous documents, we observe a significant drop in performance with Mask R-CNN on more challenging documents, especially when trained on a small, non-representative subset. By contrast, Deep&Syntax relies on steady patterns and is therefore able to process a wider range of documents with less training data. When both systems are trained on 120 documents, Deep&Syntax produces 15% more match configurations and reduces the ZoneMap surface error metric by 30%. It also outperforms Mask R-CNN when trained on a database three times smaller. As Deep&Syntax generalizes better, we believe it can be used for massive parish register processing, as collecting and annotating a sufficiently large and representative set of training data is not always achievable.

机译：这项工作侧重于历史手写寄存器的布局分析，其中记录了当地宗教仪式。这项工作的目的是使用一些可用的培训数据来分隔每个记录。为此，提出了两种方法。首先，探讨了三个最先进的对象检测网络并进行比较。然后在掩模R-CNN上进行进一步的实验，因为它产生了最佳性能。其次，我们通过组合U形网络和逻辑规则来介绍和调查深度和语法，这是一种混合系统，该混合系统采用反复模式来分隔每个记录。最后，这两种方法是在3708法国记录（第十八世纪）以及esposalles公共数据库上进行评估，其中包含253名西班牙语记录（十七世纪）。虽然两个系统在同类文件上表现良好，但我们在更具挑战性文件中观察到掩模R-CNN的性能显着下降，特别是当培训在小型非代表性的子集上时。相比之下，深度和语法依赖于稳定模式，因此能够处理具有较少培训数据的更广泛的文档。当两个系统接受120个文档培训时，Deep＆Syntax会产生15％的匹配配置，并将Zonemap表面误差度量减少30％。当在数据库中训练三次时，它还优于掩模R-CNN。随着深度和语法呈现更好，我们认为它可以用于大规模教区寄存器处理，因为收集和注释了足够大而代表性的训练数据并不总是可以实现的。

著录项

来源
《International Journal on Document Analysis and Recognition》 |2021年第2期|77-96|共20页
作者
Tarride Solene; Lemaitre Aurelie; Couasnon Bertrand; Tardivel Sophie;
展开▼
作者单位

Doptim Rennes France|Univ Rennes CNRS IRISA Rennes France;

Univ Rennes CNRS IRISA Rennes France;

Univ Rennes CNRS IRISA Rennes France;

Doptim Rennes France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Historical handwritten documents; Deep neural networks; Hybrid systems; Layout analysis;

机译：历史手写文件;深神经网络;混合系统;布局分析;
入库时间 2022-08-19 02:30:08

相似文献

外文文献
中文文献
专利

1. Deep neural networks for record counting in historical handwritten documents [J] . Capobianco Samuele, Marinai Simone Pattern recognition letters . 2019,第MARa期

机译：深度神经网络用于历史手写文档中的记录计数
2. Transcription of Spanish Historical Handwritten Documents with Deep Neural Networks [J] . Emilio Granell, Edgard Chammas, Laurence Likforman-Sulem, Journal of Imaging . 2018,第1期

机译：带有深层神经网络的西班牙历史手写文件的转录
3. Quality grading of jujubes using composite convolutional neural networks in combination with RGB color space segmentation and deep convolutional generative adversarial networks [J] . Guo Zhongyuan, Zheng Hong, Xu Xiaohang, Journal of food process engineering . 2021,第2期

机译：使用复合卷积神经网络与RGB彩色空间分割和深卷积生成对抗网络相结合的枣的质量分级
4. Deep Convolutional Neural Networks for Recognition of Historical Handwritten Kannada Characters [C] . H. T. Chandrakala, G. Thippeswamy International Conference on Frontiers of Intelligent Computing : Theory and Applications . 2020

机译：识别历史手写kannada字符的深度卷积神经网络
5. Plant Segmentation by Supervised Machine Learning Methods and Phenotypic Trait Extraction of Soybean Plants Using Deep Convolutional Neural Networks with Transfer Learning [D] . Adams, Jason R. 2020

机译：植物分割通过深度卷积神经网络与转移学习的豆豆植物的植物分割和表型特性
6. Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks [O] . Md Zahangir Alom, Paheding Sidike, Mahmudul Hasan, 2018

机译：使用最先进的深度卷积神经网络进行手写Bangla字符识别
7. Combination of deep neural networks and logical rules for record segmentation in historical handwritten registers using few examples [O] . Solène Tarride, Aurélie Lemaitre, Bertrand Coüasnon, 2021

机译：少数例子中历史手写寄存器中历史寄存器中录制细分的逻辑规则的组合

Combination of deep neural networks and logical rules for record segmentation in historical handwritten registers using few examples

摘要

著录项

相似文献

相关主题

期刊订阅