首页> 外文期刊>電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding >A study on document retrieval system for large-scale database based on OCR and character shape information
【24h】

A study on document retrieval system for large-scale database based on OCR and character shape information

机译:基于OCR和字符形状信息的大型数据库文档检索系统研究

获取原文
获取原文并翻译 | 示例
           

摘要

Making a large database of electronic documents from paper documents has left a tremendous problem. In order to search the database for an image document, it is necessary for general electronic filing systems to convert the document into texts using OCR. However, the system cannot retrieve documents that do not contain correct character codes. We had before proposed a document retrieval method that reduces false drops and false alarms by using the "shape-feature" technique that describes the outline of the character's shape. We have studied this method for large-scale database by using parallel processing and confirmed its effect.
机译:用纸质文档建立大型电子文档数据库留下了巨大的问题。为了在数据库中搜索图像文档,一般的电子归档系统必须使用OCR将文档转换为文本。但是,系统无法检索不包含正确字符代码的文档。之前,我们已经提出了一种文档检索方法,该方法通过使用描述角色形状轮廓的“形状特征”技术来减少错误掉落和错误警报。我们通过并行处理对大型数据库进行了研究,并证实了其有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号