首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >A Robust and Binarization-Free Approach for Text Line Detection in Historical Documents
【24h】

A Robust and Binarization-Free Approach for Text Line Detection in Historical Documents

机译:一种鲁棒且无二值化的历史文档文本行检测方法

获取原文

摘要

Text line extraction from complex handwritten documents, especially for historical collections, is still an unsolved problem. There is a strong demand for reliable and robust approaches since text line extraction is a crucial pre-processing step for modern text recognition and keyword spotting systems. We propose a binarization-free system which employs a newly developed clustering approach based on so-called 'superpixels'. Although multiple ways of generating superpixels were developed in the past, we demonstrate that even a standard method yields impressive results. Our clustering approach is applicable to various scenarios by making use of general characteristics of text lines (e.g., curvilinearity, interline spacings, local homogeneity), and without adapting its parametrization. State-of-the-art results are achieved by the same parametrization for 8 different well-established benchmarking datasets. These datasets cover historical and modern texts as well as images with diverse resolutions and fonts. The system is developed for detecting text lines in complex scenarios. It is not tuned to assign foreground pixels to detected text lines. Thus, superior performance is achieved for the historical datasets for which no pixel hit accuracy of 95% is required. Remarkably, for the dataset of the ICDAR 2015 Competition on Text Line Detection in Historical Documents, the average cost per text line was reduced from 9.77 (winning team) to 8.19.
机译:从复杂的手写文档中提取文本行,尤其是对于历史收藏,仍然是一个尚未解决的问题。由于文本行提取是现代文本识别和关键字识别系统的关键预处理步骤,因此对可靠和健壮的方法提出了很高的要求。我们提出了一种无二值化的系统,该系统采用了基于所谓的“超像素”的最新开发的聚类方法。尽管过去开发了多种生成超像素的方法,但我们证明,即使是标准方法也可以产生令人印象深刻的结果。我们的聚类方法可以利用文本行的一般特征(例如曲线线性,行间间距,局部均匀性)适用于各种场景,而无需对其参数化进行调整。通过对8个不同的公认基准数据集进行相同的参数化,可获得最先进的结果。这些数据集涵盖了历史和现代文本以及具有不同分辨率和字体的图像。该系统是为检测复杂场景中的文本行而开发的。不调整为将前景像素分配给检测到的文本行。因此,对于不需要像素命中精度为95%的历史数据集,可以获得卓越的性能。值得注意的是,对于ICDAR 2015年历史文档中文本行检测竞赛的数据集,每条文本行的平均成本从9.77(获胜团队)降低到8.19。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号