首页> 外文会议>Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on >An Efficient Framework for Searching Text in Noisy Document Images
【24h】

An Efficient Framework for Searching Text in Noisy Document Images

机译:在嘈杂的文档图像中搜索文本的有效框架

获取原文
获取原文并翻译 | 示例

摘要

An efficient word spotting framework is proposed to search text in scanned books. The proposed method allows one to search for words when optical character recognition (OCR) fails due to noise or for languages where there is no OCR. Given a query word image, the aim is to retrieve matching words in the book sorted by the similarity. In the offline stage, SIFT descriptors are extracted over the corner points of each word image. Those features are quantized into visual terms (visterms) using hierarchical K-Means algorithm and indexed using an inverted file. In the query resolution stage, the candidate matches are efficiently identified using the inverted index. These word images are then forwarded to the next stage where the configuration of visterms on the image plane are tested. Configuration matching is efficiently performed by projecting the visterms on the horizontal axis and searching for the Longest Common Subsequence (LCS) between the sequences of visterms. The proposed framework is tested on one English and two Telugu books. It is shown that the proposed method resolves a typical user query under 10 milliseconds providing very high retrieval accuracy (Mean Average Precision 0.93). The search accuracy for the English book is comparable to searching text in the high accuracy output of a commercial OCR engine.
机译:提出了一种有效的单词发现框架来搜索扫描书籍中的文本。所提出的方法允许人们在光学字符识别(OCR)由于噪声而失败时搜索单词,或者搜索没有OCR的语言。给定一个查询词图像,目的是检索按相似度排序的书中的匹配词。在离线阶段,在每个单词图像的角点上提取SIFT描述符。使用分层K-Means算法将这些功能量化为视觉术语(视觉术语),并使用反向文件将其索引。在查询解析阶段,使用倒排索引可以有效地识别候选匹配项。然后将这些单词图像转发到下一个阶段,在该阶段测试visterm在图像平面上的配置。通过在水平轴上投影visterms并搜索visterms序列之间的最长公共子序列(LCS),可以有效地执行配置匹配。提议的框架已在一本英语和两本泰卢固语书籍中进行了测试。结果表明,所提出的方法可以在10毫秒内解析出典型的用户查询,从而提供了非常高的检索精度(平均平均精度为0.93)。英文书的搜索精度可与商用OCR引擎的高精度输出中的文本搜索相媲美。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号