首页> 外文会议>International Conference on Document Analysis and Recognition >A Binarization-Free Clustering Approach to Segment Curved Text Lines in Historical Manuscripts

【24h】

A Binarization-Free Clustering Approach to Segment Curved Text Lines in Historical Manuscripts

机译：一种无二进制化聚类方法，可以在历史手稿中进行弯曲文本线

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text line segmentation is one of the main parts of document image analysis, it provides crucial information for automated reading, word spotting, alignment between image and transcription, or indexing of documents. Yet it remains an open problem for handwritten historical documents because of complex layouts on the one hand, such as curved and touching text lines, and binarization problems on the other hand, caused by ornaments, wrinkles, stains, holes, etc. In this paper, we propose a binarization-free clustering method for text line segmentation that is not only able to cope with touching text lines, but also with complex baseline curvature. Avoiding the assumption of straight baselines, small interest point clusters are grouped into text lines based on their local orientation. Experiments conducted on artificially distorted images of the Saint Gall database show promising results.

机译：文本线段是文档图像分析的主要部分之一，它为自动阅读，单词斑点，图像和转录之间的对齐或文件索引提供了重要信息。然而，它仍然是手写历史文档的开放问题，因为一方面是复杂的布局，例如弯曲和触摸文本线，另一方面，由装饰品，皱纹，污渍，孔等引起的二值化问题，我们提出了一种自由化的聚类方法，用于文本线分割，不仅能够应对触摸文本线，还具有复杂的基线曲率。避免了直基线的假设，小兴趣点集群基于其本地方向分组为文本线。在圣胆管数据库的人为扭曲图像上进行的实验表明了有希望的结果。

著录项

来源
《International Conference on Document Analysis and Recognition 》|2013年||共5页
会议地点
作者
Garz Angelika; Fischer Andreas; Bunke Horst; Ingold Rolf;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词
curved lines; historical documents; local features; text line segmentation;

机译：弯曲线;历史文件;局部特征;文本线分段;

相似文献

外文文献
中文文献
专利

1. A Multi-Agent Approach to Segment Arabic Handwritten Text Lines [J] . Elkhayati Mohsine, Elkettani Youssfi, Mourchid Mohammed International journal of cognitive informatics and natural intelligence . 2020 ,第4期

机译：一种多智能段的段段阿拉伯语手写文本线
2. A novel method for segmenting and straightening of text lines in handwritten Telugu documents based on smearing and regression approach [J] . Mslb. Subrahmanyam, V Vijaya Kumar, B Eswara Reddy International Journal of Engineering & Technology . 2018 ,第3期

机译：基于涂抹和回归方法的手写泰卢固语文档中文本行的分割和拉直的新方法
3. A graph-based approach for segmenting touching lines in historical handwritten documents [J] . David Fernandez-Mota, Josep Llados, Alicia Fornes International Journal on Document Analysis and Recognition . 2014 ,第3期

机译：基于图的分割历史手写文档中的触摸线的方法
4. A Binarization-Free Clustering Approach to Segment Curved Text Lines in Historical Manuscripts [C] . Garz Angelika, Fischer Andreas, Bunke Horst, International Conference on Document Analysis and Recognition . 2013

机译：历史文献中弯曲文本行的无二值化聚类方法
5. Manuscripts, Texts and Geographical Writings: A Study of Dunhuang Manuscript P.2005 [D] . Sun, Yingying. 2017

机译：手稿，文字和地理著作：《敦煌手稿P.2005》研究
6. Tractatus simplex de cortice peruuiano: A plain treatise on the Peruvian bark (The Stanitz Manuscript): a late seventeenth or early eighteenth century anonymous manuscript account of the Jesuits bark published in its original Latin text with a translation introduction and notes [O] . Andreas-Holger Maehle 1993

机译：Tractatus simplex de cortice peruuiano：秘鲁树皮上的一篇普通论文（ Stanitz手稿）：17世纪末或18世纪初匿名发表的耶稣会士树皮的匿名手稿以其原始拉丁文字出版并附有翻译简介和注释
7. A novel method for segmenting and straightening of text lines in handwritten Telugu documents based on smearing and regression approach [O] . Mslb. Subrahmanyam, V Vijaya Kumar, B Eswara Reddy 2018

机译：一种基于涂抹和回归方法的手写遥控文献中文本线分割和矫正的新方法
8. Statistical Approach to Retrieving Historical Manuscript Images without Recognition [R] . Rath, T. M. , Lavrenko, V. , Manmatha, R. 2003

机译：无识别检索历史稿件图像的统计方法

A Binarization-Free Clustering Approach to Segment Curved Text Lines in Historical Manuscripts

摘要

著录项

相似文献

相关主题

期刊订阅