首页> 外文会议>International Conference on Image Information Processing >Text line segmentation of multilingual handwritten documents using fourier approximation
【24h】

Text line segmentation of multilingual handwritten documents using fourier approximation

机译:使用傅里叶近似的多语言手写文本的文本线分割

获取原文

摘要

The most important and crucial tasks in online/ offline handwritten document recognition is line/word segmentation. As compared to a printed document, line/word segmentation of handwritten document is a complicated task. The typical handwritten document have irregular skews, overlapped lines, variable gaps between lines and different size words. The recent improvements in machine learning algorithms introduced an end-to-end line level recognition of printed and handwritten text with good performance. But still line segmentation of paragraph is required before proceeding for recognition. In this paper, we proposed an improved piece-wise projection based line segmentation method for handwritten documents which is more accurate than existing methods without compromising the execution speed. Our novel methodology applies signal approximation (using fourier series in trigonometric form) and statistical approach for better line segmentation. The proposed method is capable of segmenting lines, independent of language, with performance of 99.53% on in-house CDAC dataset (having 5974 lines) and 98.11% on ICDAR Competition 2013 dataset (having 2649 lines). The dataset used for experiments consists of english, spanish, hindi and bangla handwritten paragraphs.
机译:在线/离线手写文档识别中最重要和最重要的任务是行/单词分段。与打印的文档相比,手写文档的行/单词分段是一个复杂的任务。典型的手写文档具有不规则的偏移,重叠的线,线条之间的可变间隙和不同尺寸的单词。最近机器学习算法的改进引入了具有良好性能的印刷和手写文本的端到端线路级别。但在进行认可之前,需要段落的仍然是段落分割。在本文中,我们提出了一种改进的基于转换的基于分割的线分割方法,用于手写文档,其比现有方法更准确,而不会影响执行速度。我们的新方法应用了信号逼近(使用三角形级别的傅里叶级数)和更好的线路分割的统计方法。该方法能够在内部CDAC数据集(具有5974行)上的内部CDAC数据集(具有5974行)和98.11 %的性能,性能为99.53 %的分割线。用于实验的数据集由英语,西班牙语,印地语和孟加拉手写段组成。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号