首页> 外文期刊>電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding >Retrieval of relevant parts of document images based on density distributions of characters
【24h】

Retrieval of relevant parts of document images based on density distributions of characters

机译:基于字符的密度分布检索文档图像的相关部分

获取原文
获取原文并翻译 | 示例
           

摘要

This report presents a new method of document image retrieval that is capable of spotting parts of page images relevant to a user's Query. This enables us to improve the effectiveness and the usability of retrieval since the method is capable of spotting only relevant parts and thus free from the influence by irrelevant parts. Th( proposed method is based on the assumption that parts of page images which densely contain characters in a query are relevant to it. The characteristics of the proposed method are as follows: (1) Two-dimensional density distributions of a Query are calculated for ranking parts of page images, (2) The method relies only on the distribution of characters in page images so as not to be severely affected by the errors of character recognition and layout analysis. Based on the experimental results of retrieving Japanese newspaper articles, it is shown that the proposed method is superior to a method without the function of dealing with parts.
机译:该报告提出了一种新的文档图像检索方法,该方法能够发现与用户查询相关的部分页面图像。这使我们能够提高检索的效率和可用性,因为该方法仅能发现相关部分,因此不受无关部分的影响。提出的方法是基于这样的假设,即页面图像中在查询中密集包含字符的部分与之相关。该方法的特征如下:(1)计算查询的二维密度分布对页面图像的各个部分进行排序,(2)该方法仅依靠页面图像中的字符分布,不会受到字符识别和版面分析错误的严重影响,基于检索日本报纸文章的实验结果,结果表明,所提出的方法优于没有零件处理功能的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号