Multi-oriented Bangla and Devnagari text recognition

Pal U.; Pratim Roy P.; Tripathy N.; Llads J.

首页> 外文期刊>Pattern Recognition: The Journal of the Pattern Recognition Society >Multi-oriented Bangla and Devnagari text recognition

【24h】

Multi-oriented Bangla and Devnagari text recognition

机译：多向孟加拉语和天哪语文本识别

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There are printed complex documents where text lines of a single page may have different orientations or the text lines may be curved in shape. As a result, it is difficult to detect the skew of such documents and hence character segmentation and recognition of such documents are a complex task. In this paper, using background and foreground information we propose a novel scheme towards the recognition of Indian complex documents of Bangla and Devnagari script. In Bangla and Devnagari documents usually characters in a word touch and they form cavity regions. To take care of these cavity regions, background information of such documents is used. Convex hull and water reservoir principle have been applied for this purpose. Here, at first, the characters are segmented from the documents using the background information of the text. Next, individual characters are recognized using rotation invariant features obtained from the foreground part of the characters. For character segmentation, at first, writing mode of a touching component (word) is detected using water reservoir principle based features. Next, depending on writing mode and the reservoir base-region of the touching component, a set of candidate envelope points is then selected from the contour points of the component. Based on these candidate points, the touching component is finally segmented into individual characters. For recognition of multi-sized/multi-oriented characters the features are computed from different angular information obtained from the external and internal contour pixels of the characters. These angular information are computed in such a way that they do not depend on the size and rotation of the characters. Circular and convex hull rings have been used to divide a character into smaller zones to get zone-wise features for higher recognition results. We combine circular and convex hull features to improve the results and these features are fed to support vector machines (SVM) for recognition. From our experiment we obtained recognition results of 99.18% (98.86%) accuracy when tested on 7515 (7874) Devnagari (Bangla) characters.

机译：存在打印的复杂文档，其中单页的文本行可能具有不同的方向，或者文本行的形状可能是弯曲的。结果，难以检测这种文档的歪斜，因此，对这些文档进行字符分割和识别是一项复杂的任务。在本文中，利用背景和前景信息，我们提出了一种新的方案，用于识别孟加拉语和德夫纳加里语脚本的印度复杂文档。在Bangla和Devnagari中，文档中的字符通常都是文字触摸的，它们形成空腔区域。为了照顾这些空腔区域，使用了此类文件的背景信息。凸包和蓄水原理已用于此目的。在这里，首先，使用文本的背景信息从文档中分割字符。接下来，使用从字符的前景部分获得的旋转不变特征来识别单个字符。对于字符分割，首先，使用基于蓄水原理的特征来检测触摸组件（单词）的书写模式。接下来，根据书写模式和触摸组件的容器基础区域，然后从组件的轮廓点中选择一组候选包络点。基于这些候选点，最终将触摸组件分割为单个字符。为了识别多尺寸/多方向的字符，根据从字符的外部和内部轮廓像素获得的不同角度信息来计算特征。这些角度信息的计算方式不取决于字符的大小和旋转。圆形和凸形船体环已用于将字符划分为较小的区域，以获取区域特征，以获得更高的识别结果。我们结合使用圆形和凸形船体特征来改善结果，并将这些特征馈入支持向量机（SVM）进行识别。从我们的实验中，当对7515（7874）个Devnagari（Bangla）字符进行测试时，我们获得了99.18％（98.86％）准确度的识别结果。

著录项

来源
《Pattern Recognition: The Journal of the Pattern Recognition Society》 |2010年第12期|共13页
作者
Pal U.; Pratim Roy P.; Tripathy N.; Llads J.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Bangla and Devnagari text; Character segmentation; Complex text recognition; Convex hull; Document analysis; Indian script OCR;

机译：Bangla和Devnagari文本;字符分割;复杂文本识别;凸包;文档分析;印度文字OCR;

相似文献

外文文献
中文文献
专利

1. Multi-oriented Bangla and Devnagari text recognition [J] . Pal U., Pratim Roy P., Tripathy N., Pattern Recognition: The Journal of the Pattern Recognition Society . 2010,第12期

机译：多向孟加拉语和天哪语文本识别
2. Bayesian classifier for multi-oriented video text recognition system [J] . Roy Sangheeta, Shivakumara Palaiahnakote, Roy Partha Pratim, Expert systems with applications . 2015,第13期

机译：用于多方向视频文本识别系统的贝叶斯分类器
3. Structural Features Extraction for Devnagari and Bangla Language Documents [J] . Manoj Kumar Shukla, Haider Banka, K.P. Yadav Indian Journal of Science and Technology . 2015,第13期

机译：Devnagari和Bangla语言文档的结构特征提取
4. OPTICAL CHARACTER RECOGNITION FOR PRINTED HINDI TEXT IN DEVNAGARI USING SOFT-COMPUTING TECHNIQUE [C] . Divakar Yadav, A.K. Sharma, J.P.Gupta Artificial Intelligence and Applications . 2007

机译：利用软计算技术识别印地文中印地语文本的视觉特征
5. Pattern recognition and text dependent voice recognition system for a biped robot. [D] . Gonzalez Solano, Natalia Francisca. 2007

机译：两足动物机器人的模式识别和与文本相关的语音识别系统。
6. Rotation-Invariant Features for Multi-Oriented Text Detection in Natural Images [O] . Cong Yao, Xin Zhang, Xiang Bai, -1

机译：自然图像中多方向文本检测的旋转不变特征
7. A study of multi-oriented text recognition in natural scene images [O] . MONA SAUDAGAR, S.V. JAIN 2014

机译：自然场景图像中多种文本识别研究

Multi-oriented Bangla and Devnagari text recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅