首页> 外文会议>International Conference on Frontiers in Handwriting Recognition >A Machine Learning Approach to Detection of Core Region of Online Handwritten Bangla Word Samples
【24h】

A Machine Learning Approach to Detection of Core Region of Online Handwritten Bangla Word Samples

机译:一种在线孟加拉手写单词样本核心区域检测的机器学习方法

获取原文

摘要

Core region detection of handwritten cursive words is an important step towards their automatic recognition. Several preprocessing operations such as height normalization, slant estimation etc. Are often based on this core region. This is particularly useful for word recognition of major Indian scripts, which have large character sets. The main parts of majority of these characters belong to the core region that is bounded above by a headline and bounded below by an imaginary base line. Only a few such characters or their parts appear either above or below the core region. A few approaches are available in the literature for detection of such a core region of offline handwritten word samples of Latin script. Also, a similar region is often determined for recognition of images of printed Indian scripts. However, none of these approaches have studied detection of core region of an unconstrained online handwritten word. In this article, we propose a novel method for detection of the core region of online handwritten word samples of Bangla, a major Indian script. For this we first perform smoothing on the samples and then segment a stroke into sub strokes. We compute certain novel positional features from each such sub stroke. Using these features, a multilayer perceptron (MLP) is trained by back propagation (BP) algorithm. On the basis of the output of the MLP, we determine the position of both the headline and the baseline. We have tested this approach on a recently developed large database of online unconstrained handwriting Bangla word samples. The proposed approach would also work on similar samples of Devanagari, another major Indian script. Experimental results are encouraging.
机译:手写草书单词的核心区域检测是朝着它们的自动识别迈出的重要一步。一些预处理操作(例如高度归一化,倾斜估计等)通常基于此核心区域。这对于具有较大字符集的主要印度文字的单词识别特别有用。这些字符中的大多数主要部分属于核心区域,该核心区域在上方由标题限制,在下方由虚构基线限制。在核心区域的上方或下方仅出现几个这样的字符或其部分。文献中提供了几种方法来检测拉丁文字的离线手写单词样本的核心区域。而且,通常确定相似的区域以识别印刷的印度文字的图像。但是,这些方法都没有研究无约束的在线手写单词的核心区域的检测。在本文中,我们提出了一种新颖的方法来检测印度主要文字孟加拉语在线手写单词样本的核心区域。为此,我们首先对样本进行平滑处理,然后将笔划细分为子笔划。我们从每个这样的子笔划计算某些新颖的位置特征。利用这些功能,可以通过反向传播(BP)算法训练多层感知器(MLP)。根据MLP的输出,我们确定标题和基线的位置。我们已经在最近开发的在线无约束手写孟加拉语单词样本的大型数据库中测试了这种方法。拟议的方法也可以在另一种主要的印度文字“梵文”的相似样本上使用。实验结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号