首页> 外文会议>International Doctoral Symposium on Applied Computation and Security Systems >Line, Word, and Character Segmentation from Bangla Handwritten Text-A Precursor Toward Bangla HOCR

【24h】

Line, Word, and Character Segmentation from Bangla Handwritten Text-A Precursor Toward Bangla HOCR

机译：来自Bangla手写文本的线，单词和字符细分 - 对Bangla Hocr的前兆

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The basic functionalities of optical character recognition (OCR) are to recognize and extract text to digitally editable text from document images. Apart from this, an OCR has other potentials in document image processing such as in automatic document sorter, writer identification/verification. In current situation, various commercially available OCR systems can be found mostly for Roman script. Development of an unconstrained offline handwritten character recognition system is one of the most challenging tasks for the research community. Things get more complicated when we consider Indic scripts like Bangla which contains more than 280 modified and compound characters along with isolated characters. For recognition of handwritten document, the most convenient way is to segment the text into characters or character parts. So line, word and character level segmentation plays a vital role in the development of such a system. In this paper, a scheme for tri-level segmentation (line, word, and character) is presented. Encouraging segmentation results are achieved on a set of 50 handwritten text documents.

机译：光学字符识别（OCR）的基本功能是识别并从文档图像中提取以数字可编辑文本的文本。除此之外，OCR还具有文件图像处理中的其他潜力，例如在自动文档分拣机中，写入器识别/验证。在目前的情况下，可以获得各种可商购的OCR系统，主要用于罗马脚本。开发不受约束的离线手写字符识别系统是研究界最具挑战性的任务之一。当我们考虑Bangla等指示脚本时，它会变得更加复杂，其中包含超过280个修改和复合字符以及隔离字符。为了识别手写文档，最方便的方法是将文本分段为字符或字符部件。因此，Word和字符级分割在这种系统的开发中起着重要作用。本文介绍了三级分段（行，单词和字符）的方案。鼓励细分结果在一套50个手写文本上实现。

著录项

来源
《International Doctoral Symposium on Applied Computation and Security Systems 》|2018年|x 179 pages|共12页
会议地点
作者
Payel Rakshit; Chayan Halder; Subhankar Ghosh; Kaushik Roy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.4-532;
关键词
OCR; Bangla handwritten character recognition; Line segmentation; Word segmentation; Character segmentation;

机译：OCR;Bangla手写字符识别;线分割;字分割;字符分割;

相似文献

外文文献
中文文献
专利

1. Word Extraction and Character Segmentation from Text Lines of Unconstrained Handwritten Bangla Document Images [J] . Ram Sarkar, Samir Malakar, Nibaran Das, Journal of Intelligent Systems . 2011 ,第3期

机译：从不受约束的手写孟加拉语文档图像的文本行中提取单词并进行字符分割
2. Segmentation-based recognition system for handwritten Bangla and Devanagari words using conventional classification and transfer learning [J] . Pramanik Rahul, Bag Soumen Image Processing, IET . 2020 ,第5期

机译：基于分割的孟加拉和德比拉语单词使用传统分类和转移学习的分割识别系统
3. A novel segmentation technique for online handwritten Bangla words [J] . Sen Shibaprasad, Chowdhury Shubham, Mitra Mridul, Pattern recognition letters . 2020 ,第Nova期

机译：一种新的在线手写孟加拉词的分段技术
4. Line, Word, and Character Segmentation from Bangla Handwritten Text-A Precursor Toward Bangla HOCR [C] . Payel Rakshit, Chayan Halder, Subhankar Ghosh, International Doctoral Symposium on Applied Computation and Security Systems . 2018

机译：来自Bangla手写文本的线，单词和字符细分 - 对Bangla Hocr的前兆
5. Question Bias and Biased Question Words in Mandarin, German and Bangla [D] . Xu, Beibei. 2017

机译：普通话，德语和孟加拉语的问题偏向和偏向疑问词
6. Handwritten Bangla Character Recognition Using the State-of-the-Art Deep Convolutional Neural Networks [O] . Md Zahangir Alom, Paheding Sidike, Mahmudul Hasan, 2018

机译：使用最先进的深度卷积神经网络进行手写Bangla字符识别
7. Segmentation‐based recognition system for handwritten Bangla and Devanagari words using conventional classification and transfer learning [O] . Rahul Pramanik, Soumen Bag 2020

机译：基于分割的孟加拉和德比拉语单词使用传统分类和转移学习的分割识别系统

Line, Word, and Character Segmentation from Bangla Handwritten Text-A Precursor Toward Bangla HOCR

摘要

著录项

相似文献

相关主题

期刊订阅