首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Script-Independent Text Line Segmentation in Freestyle Handwritten Documents
【24h】

Script-Independent Text Line Segmentation in Freestyle Handwritten Documents

机译:自由式手写文档中与脚本无关的文本行分割

获取原文
获取原文并翻译 | 示例

摘要

Text line segmentation in freestyle handwritten documents remains an open document analysis problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine printed or hand-printed documents. In this paper, we propose a novel approach based on density estimation and a state-of-the-art image segmentation technique, the level set method. From an input document image, we estimate a probability map, where each element represents the probability that the underlying pixel belongs to a text line. The level set method is then exploited to determine the boundary of neighboring text lines by evolving an initial estimate. Unlike connected component based methods ( [1], [2] for example), the proposed algorithm does not use any script-specific knowledge. Extensive quantitative experiments on freestyle handwritten documents with diverse scripts, such as Arabic, Chinese, Korean, and Hindi, demonstrate that our algorithm consistently outperforms previous methods [1]-[3]. Further experiments show the proposed algorithm is robust to scale change, rotation, and noise.
机译:自由式手写文档中的文本行分割仍然是一个开放文档分析问题。曲线文本行和相邻文本行之间的小间隙对为机器打印或手工打印的文档开发的算法提出了挑战。在本文中,我们提出了一种基于密度估计和最新图像分割技术的新方法,即水平集方法。从输入的文档图像中,我们估计一个概率图,其中每个元素代表基础像素属于文本行的概率。然后利用水平集方法通过发展初始估计来确定相邻文本行的边界。与基于连接组件的方法(例如[1],[2])不同,所提出的算法不使用任何脚本特定的知识。在具有多种脚本(例如阿拉伯文,中文,韩文和印地文)的自由式手写文档上的大量定量实验表明,我们的算法始终优于以前的方法[1]-[3]。进一步的实验表明,所提出的算法对于缩放变化,旋转和噪声具有鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号