Computerized Counting of Individuals in Ottoman Population Registers with Deep Learning

机译：深度学习奥斯曼人口登记册中个人的计算机计数

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The digitalization of historical documents continues to gain pace for further processing and extract meanings from these documents. Page segmentation and layout analysis are crucial for historical document analysis systems. Errors in these steps will create difficulties in the information retrieval processes. Degradation of documents, digitization errors and varying layout styles complicate the segmentation of historical documents. The properties of Arabic scripts such as connected letters, ligatures, diacritics and different writing styles make it even more challenging to process Arabic historical documents. In this study, we developed an automatic system for counting registered individuals and assigning them to populated places by using a CNN-based architecture. To evaluate the performance of our system, we created a labeled dataset of registers obtained from the first wave of population registers of the Ottoman Empire held between the 1840s-1860s. We achieved promising results for classifying different types of objects and counting the individuals and assigning them to populated places.

机译：历史文献的数字化继续加快步伐，以进行进一步处理并从这些文献中提取含义。页面分段和布局分析对于历史文档分析系统至关重要。这些步骤中的错误将在信息检索过程中造成困难。文档的降级，数字化错误和不同的布局样式使历史文档的分割变得复杂。阿拉伯文字的属性，例如连接的字母，连字，变音符号和不同的写作风格，使得处理阿拉伯历史文献更具挑战性。在这项研究中，我们开发了一种自动系统，该系统可以使用基于CNN的体系结构对注册的个人进行计数并将其分配到人口稠密的地方。为了评估我们系统的性能，我们创建了一个带标签的寄存器数据集，该数据集是从1840年代至1860年代之间举行的奥斯曼帝国的第一批人口登记册获得的。在对不同类型的物体进行分类并对个体进行计数并将其分配到人口稠密的地方方面，我们取得了令人鼓舞的结果。

著录项

来源
《IAPR International Workshop on Document Analysis Systems》|2020年|277-290|共14页
会议地点
作者
Yekta Said Can; Mustafa Erdem Kabadayi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Page segmentation; Historical document analysis Convolutional Neural Networks; Arabic layout analysis;

机译：页面细分;历史文献分析卷积神经网络;阿拉伯语版式分析;

相似文献

外文文献
中文文献
专利

1. Deep learning algorithms use density-based crowd counting to track penguin populations [J] . Dennis Scimeca Vision Systems Design . 2020,第3期

机译：深度学习算法使用基于密度的人群计数来追踪企鹅种群
2. Method for identifying eligible individuals for a prevalence survey in the absence of a disease register or population register [J] . RichardsonA.K., ClarkeG., SabelC.E., Internal medicine journal . 2012,第11期

机译：在没有疾病登记簿或人口登记簿的情况下识别符合条件的个人进行流行病调查的方法
3. Detection of interannual population trends in seven herbivores from a West African savannah: a comparison between dung counts and direct counts of individuals [J] . African Journal of Ecology . 2017,第4期

机译：从西非大草原中检测七个食草动物中持续持续的人口趋势：粪便数量与个人直接计数的比较
4. Curation of Historical Arabic Handwritten Digit Datasets from Ottoman Population Registers: A Deep Transfer Learning Case Study [C] . Yekta Said Can, M. Erdem Kabadayı IEEE International Conference on Big Data . 2020

机译：奥斯曼人群寄存器的历史阿拉伯手写数字数据集的策划：深度转移学习案例研究
5. Empathy profiles and humanistic behavior scores of registered nurses who use computerized and non-computerized nursing documentation. [D] . Maakestad, Martha. 1993

机译：使用计算机化和非计算机化护理文档的注册护士的同理档案和人文行为评分。
6. Pheno‐Deep Counter: a unified and versatile deep learning architecture for leaf counting [O] . Mario Valerio Giuffrida, Peter Doerner, Sotirios A. Tsaftaris -1

机译：Pheno‐Deep计数器：用于叶子计数的统一且通用的深度学习架构
7. Learning to count: A deep learning framework for graphlet count estimation [O] . Xutong Liu, Yu-Zhen Janice Chen, John C. S. Lui, 2020

机译：学习数量：石墨数估计的深度学习框架

Computerized Counting of Individuals in Ottoman Population Registers with Deep Learning

摘要

著录项

相似文献

相关主题

期刊订阅