首页> 外文会议>2017 International Conference on Recent Trends in Electrical, Electronics and Computing Technologies >Rule Based Filtering Approach for Detection and Localization of Bangla Text from Scene Images
【24h】

Rule Based Filtering Approach for Detection and Localization of Bangla Text from Scene Images

机译:基于规则的场景图像中孟加拉语文本检测与定位过滤方法

获取原文
获取原文并翻译 | 示例

摘要

Detection, and localization of Bangla text from natural scene images are important prerequisites for developing Bangla OCR as well as many content-based image analyses. But there is no standard Bangla OCR to be used in the daily work. Due to the presence of some unique features, detection and localization of Bangla text have become more challenging than English text. In this paper, we have proposed MSER based method along with rule-based filtering for efficiently detect and localize Bangla texts from scene images. As the MSER based method is the winning method of the benchmark data, such as ICDAR 2011, this algorithm has been applied to get a better result than related existing methods. By using MSER, candidate text regions are detected. False positives are present into the detected regions. To remove the false positives, rule-based filtering technique has been applied. In this process, geometric properties of text like aspect ratio, eccentricity, Euler number, extent, and solidity have been used to filter out non-text regions. As there is no publicly available database containing scene images of Bangla text, we have developed such database to perform the experiment. The proposed method has been evaluated on 50 sample images of our present dataset containing Bangla and also evaluated on 50 images of ICDAR 2013 benchmark dataset and we have got better results in terms of precision, recall, and f-measure in both cases. A comparison has been made among existing related method and the proposed method and found that the proposed method is better.
机译:从自然场景图像中检测和定位孟加拉文本是开发孟加拉OCR以及许多基于内容的图像分析的重要前提。但是在日常工作中没有标准的Bangla OCR。由于存在一些独特的功能,孟加拉语文本的检测和本地化比英语文本更具挑战性。在本文中,我们提出了基于MSER的方法以及基于规则的过滤,可有效地从场景图像中检测和定位孟加拉文本。由于基于MSER的方法是基准数据的获胜方法,例如ICDAR 2011,因此该算法已得到应用,比相关的现有方法可获得更好的结果。通过使用MSER,可以检测候选文本区域。误报出现在检测到的区域中。为了消除误报,已经应用了基于规则的过滤技术。在此过程中,已使用文本的几何属性(如长宽比,偏心率,欧拉数,范围和坚固性)来过滤掉非文本区域。由于没有公开的数据库包含孟加拉文字的场景图像,因此我们开发了这种数据库来进行实验。在我们现有的包含Bangla的数据集的50个样本图像上评估了该方法,并在ICDAR 2013基准数据集的50个图像上进行了评估,在这两种情况下,我们在精度,查全率和f测度方面均取得了更好的结果。对现有的相关方法与提出的方法进行了比较,发现该方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号