A multi-font OCR system for printed Telugu text

机译：用于打印Telugu文本的多字体OCR系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work describes the design and development of a Telugu Optical Character Recognition system for printed text (TOSP). Pre- processing tasks considered in this paper are: Conversion of a grey scale image to a binary image, image rectification, skew detection and removal, segmentation of text into lines, words and basic symbols. Basic symbols are identified as the fundamental unit of segmentation in this paper which are recognized by the recognizer. The combinations of these basic symbols that together form characters and compound characters of Telugu are also determined to complete the recognition process. The special feature of TOSP is that it is designed to handle multiple sizes and multiple fonts. Further, the output produced by TOSP can directly be opened in any Indian language software that supports transliteration facility into Telugu script and edited. Several such softwares are popular and available.

机译：这项工作描述了用于打印文本（TOSP）的Telugu光学字符识别系统的设计和开发。本文考虑的预处理任务是：将灰度图像转换为二进制图像，图像整流，偏斜检测和删除，文本分段为行，单词和基本符号。基本符号被识别为本文中的分割基本单元，该纸张被识别器识别。还决定了这些基本符号的组合，形成Telugu的字符和复合特征以完成识别过程。 TOSP的特殊功能是它旨在处理多种大小和多个字体。此外，通过TOP生产的输出可以直接在任何印度语言软件中打开，该软件支持转换设施进入Telugu脚本并编辑。几个这样的软件很受欢迎和可用。

著录项

来源
《Language Engineering Conference》|2003年||共11页
会议地点
作者
C. Vasantha Lakshmi; C. Patvardhan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. OCR- The 3 Layered Approach for Decision Making State and Identification of Telugu Hand Written and Printed Consonants and Conjunct Consonants by Using Advanced Fuzzy Logic Controller [J] . B.Rama, Santosh Kumar Henge International Journal of Artificial Intelligence & Applications (IJAIA) . 2016,第3期

机译：OCR-使用高级模糊逻辑控制器对泰卢固语手写，印刷辅音和辅音的决策状态和识别的三层方法
2. An optical character recognition system for printed Telugu text [J] . C. Vasantha Lakshmi, C. Patvardhan Pattern Analysis and Applications . 2004,第2期

机译：用于打印泰卢固语文本的光学字符识别系统
3. SEGMENTATION OF OVERLAPPING TEXT LINES, CHARACTERS IN PRINTED TELUGU TEXT DOCUMENT IMAGES [J] . M Swamy Das, Dr. CRK Reddy, Dr. A Govardhan, International Journal of Engineering Science and Technology . 2010,第11期

机译：打印的泰卢固文本文档图像中重叠的文本行，字符的分段
4. A multi-font OCR system for printed Telugu text [C] . Lakshmi, C.V., Patvardhan, . 2003

机译：用于打印泰卢固语文本的多字体OCR系统
5. A hybrid two-dimensional HMM and MLP OCR system for processing multi-font and low-quality English documents. [D] . Fu, Nenghong. 2004

机译：混合的二维HMM和MLP OCR系统，用于处理多字体和低质量的英语文档。
6. Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users [O] . Leo Neat, Ren Peng, Siyang Qin, -1

机译：场景文本访问：针对盲用户的移动OCR模式的比较
7. OCR of Printed Telugu Text with High Recognition Accuracies [O] . C. Vasantha Lakshmi, Ritu Jain, C. Patvardhan 2014

机译：具有高识别精度的印刷泰卢固语文本的OCR
8. Design, Integration, and Evaluation of Form-Based Handprint and OCR Systems [R] . Wilson, C. L., Geist, J., Garris, M. D., 1996

机译：基于表格的手印和OCR系统的设计，集成和评估

A multi-font OCR system for printed Telugu text

摘要

著录项

相似文献

相关主题

期刊订阅