首页> 外文期刊>Pattern recognition letters >Context information from search engines for document recognition
【24h】

Context information from search engines for document recognition

机译:来自搜索引擎的上下文信息,用于文档识别

获取原文
获取原文并翻译 | 示例
           

摘要

In this work we propose the use of contextual information provided by web search engine queries for improving text recognition performance. We first describe a framework for automated text recognition from images. It is based on detecting text areas in images by analysis of Maximally Stable Extremal Regions (MSERs) and recognizing characters by simple template matching. The main emphasis of the paper is on introducing a novel method for exploiting contextual information to improve the obtained recognition results. We propose to analyze the results of web search engine queries on two levels of detail (word and sentence level) which both allow to significantly improve the overall text recognition performance. Experimental evaluations on reference data sets prove that dictionary based methods are outperformed and that even based on a low-quality single character recognition method the proposed web search engine extension enables reasonable text recognition results. This work received the "Best Scientific Paper Award" at the International Conference on Pattern Recognition (ICPR), 2008 (Donoser et al., 2008).
机译:在这项工作中,我们建议使用网络搜索引擎查询提供的上下文信息来提高文本识别性能。我们首先描述一种从图像自动识别文本的框架。它基于通过分析最大稳定末端区域(MSER)来检测图像中的文本区域,并通过简单的模板匹配来识别字符。本文的主要重点是介绍一种利用上下文信息来改善获得的识别结果的新方法。我们建议在两个细节级别(单词和句子级别)上分析网络搜索引擎查询的结果,这两个级别都可以显着提高整体文本识别性能。对参考数据集的实验评估证明,基于字典的方法性能不佳,并且即使基于低质量的单字符识别方法,所提出的Web搜索引擎扩展也可以实现合理的文本识别结果。这项工作在2008年国际模式识别会议(ICPR)上获得了“最佳科学论文奖”(Donoser等,2008)。

著录项

  • 来源
    《Pattern recognition letters》 |2010年第8期|p.750-754|共5页
  • 作者单位

    Institute for Computer Graphics and Vision, Graz University of Technology, Austria;

    Department of Slavic Studies, University of Craz, Austria;

    Institute for Computer Graphics and Vision, Graz University of Technology, Austria;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    context; text recognition; web search engines;

    机译:上下文文字识别网络搜索引擎;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号