首页> 外文会议>International Conference on Intelligent Computing, Communication and Devices >Off-line Handwritten Script Identification from Eastern Indian Document Images Using Logistic Model Tree
【24h】

Off-line Handwritten Script Identification from Eastern Indian Document Images Using Logistic Model Tree

机译:使用Logistic模型树从东方印度文档图像的离线手写脚本识别

获取原文
获取外文期刊封面目录资料

摘要

Script identification from document images is a complex real-life problem for a multi-script country like India where 13 official scripts are present. To develop an optical character recognizer for a specific language, it is necessary to identify the script first by which the document is written. In this paper, scripts from the off-line handwritten document images written by any one of the four popular scripts in eastern India, namely Bangla, Roman, Devanagari, and Oriya, are identified. A document-level approach is followed for the same. Using some mathematical, structural, and script-dependent feature, a multi-dimensional feature set is constructed. Finally, logistic model tree (LMT) is applied for classification and an average accuracy rate of 95.5 % is obtained with a fivefold crossvalidation.
机译:文档图像的脚本标识是一个复杂的现实生活问题,适用于像印度这样的多脚本国家/地区,其中有13个官方脚本。要为特定语言开发光学字符识别器,必须首先识别写入文档的脚本。在本文中,识别了来自印度东部的四个流行脚本中的任何一个,即Bangla,Roman,Devanagari和Oroya所写的离线手写文件图像的脚本。遵循相同的文档级方法。使用一些数学,结构和脚本依赖的功能,构建了多维功能集。最后,申请了物流模型树(LMT)的分类,使用五倍的交叉验证获得了95.5%的平均精度率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号