Off-line Handwritten Script Identification from Eastern Indian Document Images Using Logistic Model Tree

机译：使用Logistic模型树从东方印度文档图像的离线手写脚本识别

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Script identification from document images is a complex real-life problem for a multi-script country like India where 13 official scripts are present. To develop an optical character recognizer for a specific language, it is necessary to identify the script first by which the document is written. In this paper, scripts from the off-line handwritten document images written by any one of the four popular scripts in eastern India, namely Bangla, Roman, Devanagari, and Oriya, are identified. A document-level approach is followed for the same. Using some mathematical, structural, and script-dependent feature, a multi-dimensional feature set is constructed. Finally, logistic model tree (LMT) is applied for classification and an average accuracy rate of 95.5 % is obtained with a fivefold crossvalidation.

机译：文档图像的脚本标识是一个复杂的现实生活问题，适用于像印度这样的多脚本国家/地区，其中有13个官方脚本。要为特定语言开发光学字符识别器，必须首先识别写入文档的脚本。在本文中，识别了来自印度东部的四个流行脚本中的任何一个，即Bangla，Roman，Devanagari和Oroya所写的离线手写文件图像的脚本。遵循相同的文档级方法。使用一些数学，结构和脚本依赖的功能，构建了多维功能集。最后，申请了物流模型树（LMT）的分类，使用五倍的交叉验证获得了95.5％的平均精度率。

著录项

来源
《International Conference on Intelligent Computing, Communication and Devices》|2015年||共9页
会议地点
作者
Sk Md Obaidullah; Nibaran Das; Kaushik Roy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-532;
关键词
Document image analysis; Handwritten script identification; Offline documents; Classification; Optical character recognizer;

机译：文档图像分析;手写脚本识别;离线文件;分类;光学字符识别器;

相似文献

外文文献
中文文献
专利

1. AUTOMATIC LINE-LEVEL SCRIPT IDENTIFICATION FROM HANDWRITTEN DOCUMENT IMAGES - A REGION-WISE CLASSIFICATION FRAMEWORK FOR INDIAN SUBCONTINENT [J] . Sk Md Obaidullah, Chayan Halder, K. C. Santosh, Malaysian Journal of Computer Science . 2018,第1期

机译：手写文档图像的自动行级脚本识别-印度次大陆的区域明智分类框架
2. Handwritten Indic Script Identification in Multi-Script Document Images: A Survey [J] . Obaidullah Sk Md, Santosh K. C., Das Nibaran, International Journal of Pattern Recognition and Artificial Intelligence . 2018,第10期

机译：多脚本文档图像中的手写印度文字识别：一项调查
3. Bangla and Oriya Script Lines Identification from Handwritten Document Images in Tri-script Scenario [J] . Sk Md Obaidullah, Chayan Halder, Nibaran Das, International Journal of Service Science, Management, Engineering, and Technology . 2016,第1期

机译：在三脚本方案中从手写文档图像中识别孟加拉语和奥里亚语脚本行
4. Off-line Handwritten Script Identification from Eastern Indian Document Images Using Logistic Model Tree [C] . Sk Md Obaidullah, Nibaran Das, Kaushik Roy International Conference on Intelligent Computing, Communication and Devices . 2015

机译：使用Logistic模型树从东方印度文档图像的离线手写脚本识别
5. Document image analysis techniques for handwritten text segmentation, document image rectification and digital collation. [D] . Salvi, Dhaval. 2014

机译：用于手写文本分割，文档图像校正和数字整理的文档图像分析技术。
6. Shallow Landslide Susceptibility Mapping: A Comparison between Logistic Model Tree Logistic Regression Naïve Bayes Tree Artificial Neural Network and Support Vector Machine Algorithms [O] . Viet-Ha Nhu, Ataollah Shirzadi, Himan Shahabi, 2020

机译：浅层滑坡敏感性图：逻辑模型树逻辑回归朴素贝叶斯树人工神经网络和支持向量机算法之间的比较
7. AUTOMATIC LINE-LEVEL SCRIPT IDENTIFICATION FROM HANDWRITTEN DOCUMENT IMAGES - A REGION-WISE CLASSIFICATION FRAMEWORK FOR INDIAN SUBCONTINENT [O] . Sk Md Obaidullah, Chayan Halder, K. C. Santosh, 2018

机译：手写文档图像的自动线路级脚本识别 - 印度次大陆的一个区域明智的分类框架
8. Script-Independent Text Line Segmentation in Freestyle Handwritten Documents [R] . Li, Y. , Zheng, Y. , Doermann, D. , 2006

机译：自由式手写文档中与脚本无关的文本行分割

Off-line Handwritten Script Identification from Eastern Indian Document Images Using Logistic Model Tree

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅