首页> 外文会议>12th International Conference on Frontiers in Handwriting Recognition >A Novel Arabic Baseline Estimation Algorithm Based on Sub-Words Treatment
【24h】

A Novel Arabic Baseline Estimation Algorithm Based on Sub-Words Treatment

机译:基于子词处理的阿拉伯语基线估计新算法

获取原文

摘要

Baseline detection is an essential preprocessing step for many OCR systems, it has a direct effect on the efficiency and reliability of characters segmentation and features extraction stages, which contribute strongly to yielding higher recognition accuracy. For Arabic handwritten, the conventional methods which extract baseline as straight line are ill-suited because some Arabic words may be contracted from two or more sub-words (PAWs), and the distribution of these sub-words can produce different slant angles within the same word. Focused on the source of the problem, we propose a novel Arabic baseline estimation algorithm in which the PAW level is the real basic block to be processed rather than word level. Experimental results using IFN/ENIT [1] database demonstrate the efficiency of the proposed algorithm.
机译:基线检测对于许多OCR系统来说是必不可少的预处理步骤,它直接影响字符分割和特征提取阶段的效率和可靠性,这对提高识别精度有很大贡献。对于阿拉伯语手写体,将基线提取为直线的常规方法是不合适的,因为某些阿拉伯语单词可能会从两个或更多个子单词(PAW)中收缩,并且这些子单词的分布可能会在汉字内产生不同的倾斜角度。同一个字。针对问题的根源,我们提出了一种新颖的阿拉伯语基线估计算法,其中,PAW级别是要处理的实际基本块,而不是单词级别。使用IFN / ENIT [1]数据库的实验结果证明了该算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号