首页> 外文会议>International Conference on Document Analysis and Recognition >Towards Visual Words to Words Text Detection with a General Bag of Words Representation
【24h】

Towards Visual Words to Words Text Detection with a General Bag of Words Representation

机译:朝着视觉单词用一般的单词表示来单词文本检测

获取原文

摘要

We address the problem of text localization and retrieval in real world images. We are first to study the retrieval of text images, i.e. the selection of images containing text in large collections at high speed. We propose a novel representation, textual visual words, which describe text by generic visual words that geometrically consistently predict bottom and top lines of text. The visual words are discretized SIFT descriptors of Hessian features. The features may correspond to various structures present in the text - character fragments, individual characters or their arrangements. The textual words representation is invariant to affine transformation of the image and local linear change of intensity. Experiments demonstrate that the proposed method outperforms the state-of-the-art on the MS dataset. The proposed method detects blurry, small font, low contrast, noisy text from real world images.
机译:我们解决了现实世界形象中的文本本地化和检索问题。我们首先要研究文本图像的检索,即,在高速中选择包含大集合中的文本的图像。我们提出了一种新颖的表示,文本视觉词语,它通过几何视觉单词描述了几何上一致地预测底部和顶线文本的文本。视觉单词是令人享有的隐式筛选的描述符。特征可以对应于文本字符片段,单个字符或其布置中存在的各种结构。文字字表示是不变的,以归属图像的变换和强度的本地线性变化。实验表明,所提出的方法优于MS DataSet上的最先进。所提出的方法检测来自现实世界图像的模糊,小字体,低对比度,嘈杂的文本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号