首页> 外文会议>Camera-based document analysis and recognition. >Multi-script and Multi-oriented Text Localization from Scene Images
【24h】

Multi-script and Multi-oriented Text Localization from Scene Images

机译:场景图像中的多脚本和多方向文本本地化

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a new method of color text localization from generic scene images containing text of different scripts and with arbitrary orientations. A representative set of colors is first identified using the edge information to initiate an unsupervised clustering algorithm. Text components are identified from each color layer using a combination of a support vector machine and a neural network classifier trained on a set of low-level features derived from the geometric, boundary, stroke and gradient information. Experiments on camera-captured images that contain variable fonts, size, color, irregular layout, non-uniform illumination and multiple scripts illustrate the robustness of the method. The proposed method yields precision and recall of 0.8 and 0.86 respectively on a database of 100 images. The method is also compared with others in the literature using the ICDAR 2003 robust reading competition dataset.
机译:本文描述了一种新的彩色文本本地化方法,该方法可从包含不同脚本且具有任意方向的文本的通用场景图像中进行定位。首先使用边缘信息识别一组代表性的颜色,以启动无监督的聚类算法。使用支持向量机和神经网络分类器的组合从每个颜色层中识别文本成分,该神经网络分类器在从几何,边界,笔划和渐变信息中导出的一组低级特征上进行训练。在包含可变字体,大小,颜色,不规则布局,不均匀照明和多个脚本的相机捕获图像上进行的实验说明了该方法的鲁棒性。所提出的方法在100个图像的数据库上分别产生0.8和0.86的精度和召回率。使用ICDAR 2003健壮的阅读比赛数据集,将该方法与文献中的其他方法进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号