【24h】

A method for text-line segmentation for unconstrained Arabic and Persian handwritten text image

机译:一种不受约束的阿拉伯和波斯手写文本图像的文本行分割方法

获取原文

摘要

One of the challenging parts of freestyle handwritten text documents recognition area is text line segmentation problem. Curvilinear text lines and small gaps between neighboring text lines present a challenge to algorithms developed for machine printed or hand-printed documents. In this paper, we propose a novel approach based on painting algorithm by dividing of a text image into number of vertical segments which is called striping. As Arabic and Persian scripts present a lot of dots, we considered historical available nastaliq scanned pages for experiments. Results show the proposed algorithm is robust to scale change, rotation, and noise. The proposed method may contribute significantly for the development of applications related to OCR.
机译:自由样式手写文本文档识别区域中最具挑战性的部分之一是文本线分割问题。曲线文本行和相邻文本行之间的小间隙对为机器打印或手工打印的文档开发的算法提出了挑战。在本文中,我们提出了一种基于绘画算法的新方法,该方法是将文本图像划分为多个垂直段,这称为条纹。由于阿拉伯语和波斯语文字带有很多点,因此我们考虑使用历史可用的nastaliq扫描页面进行实验。结果表明,该算法对缩放变化,旋转和噪声具有鲁棒性。所提出的方法可能对与OCR相关的应用程序的开发做出重大贡献。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号