【24h】

Text detection and localization in natural scene images based on text awareness score

机译:基于文本意识分数的自然场景图像中的文本检测与本地化

获取原文
获取原文并翻译 | 示例
           

摘要

Text detection & localization plays an essential role in finding the textual information from natural scene images that can be used in robot navigation, license plate detection, and wearable applications. In this work, we present text detection and localization approach based upon a novel text awareness model that encompasses an improved fast edge preserving and smoothing Maximum Stable Extremal Region (FEPS-MSER) algorithm which uses the fast guided filter to separate the interconnected characters efficiently by removing the mixed pixels around the edges of blurred images. The fast guided filter takes less execution time as compared to other edge-smoothing filters. The combination of five independent and class determining facets namely stroke width deviation, 8-histogram of edge gradients, color variation, occupation ratio, and occupy rate convex area is proposed to differentiate between text and non-text components. The probability of a component to be text is based on Text Awareness Score (TAS) that is calculated by fusing these facets in Naive Bayes using the observation possibility and prior probability of text & non-text components. Naive Bayes classifier helps in accurate and fast determination of the text awareness score and thus helps in the classification of text & non-text components with the help of graph cut algorithm. The text components have been grouped by using the mean-shift clustering algorithm which is a non-parametric technique and does not require the initial knowledge of clusters. The proposed method achieves improved results concerning precision, recall, and f-measure on the ICDAR benchmark datasets for natural scene images.
机译:文本检测和本地化在查找可用于机器人导航,车牌检测和可穿戴应用程序中的自然场景图像中的文本信息扮演基本作用。在这项工作中,我们基于新颖的文本意识模型提供了文本检测和定位方法,其包括改进的快速边缘保留和平滑最大稳定极值区域(FEPS-MSER)算法,该算法使用快速引导滤波器将互连的字符有效地分离在模糊图像的边缘周围移除混合像素。与其他边缘平滑滤波器相比,快速引导滤波器的执行时间较少。提出了五个独立和类别确定方面的组合,即行程宽度偏差,8级直方图,颜色变化,占用比和占用速率凸面区域,以区分文本和非文本组件。要文本的组件的概率基于文本意识得分(TAS),其通过使用文本和非文本组件的观察可能性和现有概率来融合Naive Bayes中的这些方面来计算。 Naive Bayes Classifier有助于准确,快速地确定文本意识评分,从而有助于在图形切割算法的帮助下进行文本和非文本组件的分类。通过使用作为非参数技术的平均移位聚类算法来分组文本组件,并且不需要群集的初始知识。所提出的方法实现了对自然场景图像的ICDAR基准数据集上的精度,召回和F测量的改进结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号