Text alignment in early printed books combining deep learning and dynamic programming

Ziran Zahra; Pic Xavier; Innocenti Simone Undri; Mugnai Daniele; Marinai Simone

首页> 外文期刊>Pattern recognition letters >Text alignment in early printed books combining deep learning and dynamic programming

【24h】

Text alignment in early printed books combining deep learning and dynamic programming

机译：早期印刷书中的文本对齐结合了深度学习和动态规划

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe a technique for transcript alignment in early printed books by using deep models in combination with dynamic programming algorithms. Two object detection models, based on Faster R-CNN, are trained to locate words. We first train an initial model to recognize generic words and hyphens by using information about the number of words in text lines. Using the model prediction on pages with a line-by-line ground-truth annotation is available, we train a second model able to detect landmark words. The alignment is then based on the identification of landmark words in pages where we only know the text corresponding to zones in the page. The proposed technique is evaluated on a publicly available digitization of the Gutenberg Bible while the transcription is based on the Vulgata, a late 4th century Latin translation of the Bible. (C) 2020 Elsevier B.V. All rights reserved.

机译：通过使用深度模型与动态编程算法结合使用深度模型，我们描述了一种用于早期印刷书籍的转录对准技术。两个对象检测模型基于更快的R-CNN，训练以定位单词。我们首先通过使用关于文本行中的单词数量的信息来训练初始模型来识别通用单词和连字符。使用与逐行地面实际注释的页面上的模型预测可用，我们训练第二个模型能够检测地标单词。然后，对齐基于在页面中的标志标记的识别，我们只知道与页面中的区域对应的文本。在转录基于vutenberg圣经的公开可用数字化上，评估了所提出的技术，而转录是基于vutgata，这是圣经的4世纪晚期的拉丁语翻译。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Pattern recognition letters》 |2020年第5期|109-115|共7页
作者
Ziran Zahra; Pic Xavier; Innocenti Simone Undri; Mugnai Daniele; Marinai Simone;
展开▼
作者单位

Univ Florence Florence Italy;

Enseirb Matmeca Bordeaux France;

Univ Florence Florence Italy;

Univ Florence Florence Italy;

Univ Florence Florence Italy;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Dynamic programming; Early printed books; Faster R-CNN; Object detection;

机译：动态编程;早期印刷书籍;更快的R-CNN;物体检测;

相似文献

外文文献
中文文献
专利

1. Deep Learning Aided Fingerprint-Based Beam Alignment for mmWave Vehicular Communication [J] . Satyanarayana K., El-Hajjar Mohammed, Mourad Alain A. M., IEEE Transactions on Vehicular Technology . 2019,第11期

机译：基于深度学习的基于指纹的毫米波车辆通信光束对准
2. MII: A Novel Text Classification Model Combining Deep Active Learning with BERT [J] . Anman Zhang, Bohan Li, Wenhuan Wang, Computers, Materials & Continua . 2020,第3期

机译：MII：一种新颖的文本分类模型与伯特深度主动学习结合
3. Dynamic Energy Management of a Microgrid Using Approximate Dynamic Programming and Deep Recurrent Neural Network Learning [J] . Zeng Peng, Li Hepeng, He Haibo, Smart Grid, IEEE Transactions on . 2019,第4期

机译：基于近似动态规划和深度递归神经网络学习的微电网动态能量管理
4. The Early Japanese Books Text Line Segmentation base on Image Processing and Deep Learning [C] . Bing Lyu, Ryo Akama, Hiroyuki Tomiyama, International Conference on Advanced Mechatronic Systems . 2019

机译：基于图像处理和深度学习的早期日语书籍文本行分割
5. Learning to draw, drawing to learn: Theory and practice in Italian printed drawing books, 1600--1700. [D] . Greist, Alexandra Arvilla. 2011

机译：学习绘画，学习绘画：意大利印刷图画书的理论与实践，1600--1700。
6. Comparing Comprehension of a Long Text Read in Print Book and on Kindle: Where in the Text and When in the Story? [O] . Anne Mangen, Gérard Olivier, Jean-Luc Velay 1979

机译：比较对印刷书籍和Kindle阅读的长文本的理解：文本中的位置以及故事中的时间？
7. The changing dynamics of teaching and learning spaces : where does the printed book fit? [O] . Burch, Tony, Nagy, Judith 2007

机译：教学空间的变化动态：印刷书籍适合哪里？

Text alignment in early printed books combining deep learning and dynamic programming

摘要

著录项

相似文献

相关主题

期刊订阅