Morphological Reconstruction for Word Level Script Identification.

B. V. Dhandra; Mallikarjun Hangarge

首页> 外文期刊>International Journal of Computer Science and Security >Morphological Reconstruction for Word Level Script Identification.

【24h】

Morphological Reconstruction for Word Level Script Identification.

机译：词级脚本识别的形态重建。

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A line of a bilingual document page may contain text words in regional languageand numerals in English. For Optical Character Recognition (OCR) of such adocument page, it is necessary to identify different script forms before running anindividual OCR system. In this paper, we have identified a tool of morphologicalopening by reconstruction of an image in different directions and regionaldescriptors for script identification at word level, based on the observation thatevery text has a distinct visual appearance. The proposed system is developedfor three Indian major bilingual documents, Kannada, Telugu and Devnagaricontaining English numerals. The nearest neighbour and k-nearest neighbouralgorithms are applied to classify new word images. The proposed algorithm istested on 2625 words with various font styles and sizes. The results obtained arequite encouraging

机译：双语文档页面的一行可能包含区域语言的文字单词和英语数字。对于此类文档页面的光学字符识别（OCR），在运行单个OCR系统之前，有必要识别不同的脚本形式。在本文中，我们发现每个文本都有明显的视觉外观，因此我们通过在不同方向重建图像和区域描述符来识别单词级别的脚本，从而确定了一种形态学开放工具。拟议的系统是针对三个印度主要的双语文档（卡纳达语，泰卢固语和德夫纳加里语）开发的，其中包含英文数字。最近邻和k最近邻算法被用于对新单词图像进行分类。该算法在2625个具有各种字体样式和大小的单词上进行了测试。获得的结果相当令人鼓舞

著录项

来源
《International Journal of Computer Science and Security》 |2007年第1期|共页
作者
B. V. Dhandra; Mallikarjun Hangarge;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Extraction of Root Words using Morphological Analyzer for Devanagari Script [J] . Sharvari S. Govilkar, J. W. Bakal, Sagar R. Kulkarni International Journal of Information Technology and Computer Science . 2016,第1期

机译：使用形态分析仪提取梵文脚本中的词根
2. Word-Level Multi-Script Indic Document Image Dataset and Baseline Results on Script Identification [J] . Chayan Halder, Nibaran Das, Kaushik Roy, International journal of computer vision and iImage processing . 2017,第2期

机译：Word级多脚本指示文档图像数据集和脚本识别的基准结果
3. Improved word-level handwritten Indic script identification by integrating small convolutional neural networks [J] . Ukil Soumya, Ghosh Swarnendu, Obaidullah Sk Md, Neural computing & applications . 2020,第7期

机译：通过整合小型卷积神经网络改进了单独的手写依据脚本识别
4. Word-wise Script Identification from Bilingual Documents Based on Morphological Reconstruction [C] . Dhandra, B.V., Mallikarjun, . 2006

机译：基于形态重构的双语文档逐字识别
5. THE IDENTIFICATION OF LIFE SCRIPT ELEMENTS BY PERSONS POSSESSING VARYING LEVELS OF TRAINING AND EXPERIENCE IN TRANSACTIONAL ANALYSIS PRINCIPLES AND LIFE SCRIPT THEORY. [D] . PREPURA, WAYNE ANDREW. 1979

机译：在交易分析原理和寿命脚本理论中，通过掌握变化的训练水平和经验的人员来识别寿命脚本元素。
6. Subword segmentation--leveling out morphological variations for medical document retrieval. [O] . U. Hahn, M. Honeck, M. Piotrowski, 2001

机译：子词分割-整理出用于医学文档检索的形态变化。
7. Extraction of Root Words using Morphological Analyzer for Devanagari Script [O] . Sharvari S. Govilkar, J. W. Bakal, Sagar R. Kulkarni 2016

机译：利用形态分析仪提取根词对德文艺术脚本的影响

Morphological Reconstruction for Word Level Script Identification.

摘要

著录项

相似文献

相关主题

期刊订阅