首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals
【24h】

Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals

机译:印度文字的手写数字数据库和混合数字的多阶段识别

获取原文
获取原文并翻译 | 示例

摘要

This article primarily concerns the problem of isolated handwritten numeral recognition of major Indian scripts. The principal contributions presented here are (a) pioneering development of two databases for handwritten numerals of two most popular Indian scripts, (b) a multistage cascaded recognition scheme using wavelet based multiresolution representations and multilayer perceptron classifiers and (c) application of (b) for the recognition of mixed handwritten numerals of three Indian scripts Devanagari, Bangla and English. The present databases include respectively 22,556 and 23,392 handwritten isolated numeral samples of Devanagari and Bangla collected from real-life situations and these can be made available free of cost to researchers of other academic Institutions. In the proposed scheme, a numeral is subjected to three multilayer perceptron classifiers corresponding to three coarse-to-fine resolution levels in a cascaded manner. If rejection occurred even at the highest resolution, another multilayer perceptron is used as the final attempt to recognize the input numeral by combining the outputs of three classifiers of the previous stages. This scheme has been extended to the situation when the script of a document is not known a priori or the numerals written on a document belong to different scripts. Handwritten numerals in mixed scripts are frequently found in Indian postal mails and table-form documents.
机译:本文主要涉及主要印度文字的孤立手写数字识别问题。这里介绍的主要贡献是(a)首次开发了两个数据库,用于两个最受欢迎的印度文字的手写数字;(b)使用基于小波的多分辨率表示和多层感知器分类器的多级级联识别方案,以及(c)应用(b)用于识别三种印度文字梵文,孟加拉语和英语的混合手写数字。目前的数据库分别包括从现实生活中收集的22,556和23,392个手写的孤立的数字梵文和孟加拉语数字样本,这些样本可以免费提供给其他学术机构的研究人员。在所提出的方案中,数字以级联方式经受对应于三个粗糙到精细分辨率级别的三个多层感知器分类器。如果即使在最高分辨率下也会发生拒绝,则通过组合前一级的三个分类器的输出,将另一个多层感知器用作识别输入数字的最终尝试。该方案已扩展到以下情况:文档的脚本不是先验的,或者写在文档上的数字属于不同的脚本。在印度邮政邮件和表格文件中经常发现混合脚本中的手写数字。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号