...
首页> 外文期刊>International Journal of Computer Science and Security >Morphological Reconstruction for Word Level Script Identification.
【24h】

Morphological Reconstruction for Word Level Script Identification.

机译:词级脚本识别的形态重建。

获取原文
           

摘要

A line of a bilingual document page may contain text words in regional languageand numerals in English. For Optical Character Recognition (OCR) of such adocument page, it is necessary to identify different script forms before running anindividual OCR system. In this paper, we have identified a tool of morphologicalopening by reconstruction of an image in different directions and regionaldescriptors for script identification at word level, based on the observation thatevery text has a distinct visual appearance. The proposed system is developedfor three Indian major bilingual documents, Kannada, Telugu and Devnagaricontaining English numerals. The nearest neighbour and k-nearest neighbouralgorithms are applied to classify new word images. The proposed algorithm istested on 2625 words with various font styles and sizes. The results obtained arequite encouraging
机译:双语文档页面的一行可能包含区域语言的文字单词和英语数字。对于此类文档页面的光学字符识别(OCR),在运行单个OCR系统之前,有必要识别不同的脚本形式。在本文中,我们发现每个文本都有明显的视觉外观,因此我们通过在不同方向重建图像和区域描述符来识别单词级别的脚本,从而确定了一种形态学开放工具。拟议的系统是针对三个印度主要的双语文档(卡纳达语,泰卢固语和德夫纳加里语)开发的,其中包含英文数字。最近邻和k最近邻算法被用于对新单词图像进行分类。该算法在2625个具有各种字体样式和大小的单词上进行了测试。获得的结果相当令人鼓舞

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号