首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Multi-oriented Bangla and Devnagari text recognition
【24h】

Multi-oriented Bangla and Devnagari text recognition

机译:多向孟加拉语和天哪语文本识别

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

There are printed complex documents where text lines of a single page may have different orientations or the text lines may be curved in shape. As a result, it is difficult to detect the skew of such documents and hence character segmentation and recognition of such documents are a complex task. In this paper, using background and foreground information we propose a novel scheme towards the recognition of Indian complex documents of Bangla and Devnagari script. In Bangla and Devnagari documents usually characters in a word touch and they form cavity regions. To take care of these cavity regions, background information of such documents is used. Convex hull and water reservoir principle have been applied for this purpose. Here, at first, the characters are segmented from the documents using the background information of the text. Next, individual characters are recognized using rotation invariant features obtained from the foreground part of the characters. For character segmentation, at first, writing mode of a touching component (word) is detected using water reservoir principle based features. Next, depending on writing mode and the reservoir base-region of the touching component, a set of candidate envelope points is then selected from the contour points of the component. Based on these candidate points, the touching component is finally segmented into individual characters. For recognition of multi-sized/multi-oriented characters the features are computed from different angular information obtained from the external and internal contour pixels of the characters. These angular information are computed in such a way that they do not depend on the size and rotation of the characters. Circular and convex hull rings have been used to divide a character into smaller zones to get zone-wise features for higher recognition results. We combine circular and convex hull features to improve the results and these features are fed to support vector machines (SVM) for recognition. From our experiment we obtained recognition results of 99.18% (98.86%) accuracy when tested on 7515 (7874) Devnagari (Bangla) characters.
机译:存在打印的复杂文档,其中单页的文本行可能具有不同的方向,或者文本行的形状可能是弯曲的。结果,难以检测这种文档的歪斜,因此,对这些文档进行字符分割和识别是一项复杂的任务。在本文中,利用背景和前景信息,我们提出了一种新的方案,用于识别孟加拉语和德夫纳加里语脚本的印度复杂文档。在Bangla和Devnagari中,文档中的字符通常都是文字触摸的,它们形成空腔区域。为了照顾这些空腔区域,使用了此类文件的背景信息。凸包和蓄水原理已用于此目的。在这里,首先,使用文本的背景信息从文档中分割字符。接下来,使用从字符的前景部分获得的旋转不变特征来识别单个字符。对于字符分割,首先,使用基于蓄水原理的特征来检测触摸组件(单词)的书写模式。接下来,根据书写模式和触摸组件的容器基础区域,然后从组件的轮廓点中选择一组候选包络点。基于这些候选点,最终将触摸组件分割为单个字符。为了识别多尺寸/多方向的字符,根据从字符的外部和内部轮廓像素获得的不同角度信息来计算特征。这些角度信息的计算方式不取决于字符的大小和旋转。圆形和凸形船体环已用于将字符划分为较小的区域,以获取区域特征,以获得更高的识别结果。我们结合使用圆形和凸形船体特征来改善结果,并将这些特征馈入支持向量机(SVM)进行识别。从我们的实验中,当对7515(7874)个Devnagari(Bangla)字符进行测试时,我们获得了99.18%(98.86%)准确度的识别结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号