首页> 外文会议> >A document retrieval method from handwritten characters based on OCR and character shape information
【24h】

A document retrieval method from handwritten characters based on OCR and character shape information

机译:基于OCR和字符形状信息的手写字符文档检索方法

获取原文

摘要

It is a difficult task to create a large database of electronic documents from paper documents. In order to search the database for an image document, it is necessary for general electronic filing systems to convert the document into texts using OCR. However, the system cannot retrieve documents that do not contain correct character codes. We (1999) had previously proposed a document retrieval method that reduces false drops and false alarms by using the "shape-feature" technique that describes the outline of the character's shape. We now apply this method to handwritten Japanese documents. Experimental results reveal that our method has a high recall rate of 88.8% compared to the conventional methods (69.2%: text matching, 78.3%: candidate matching).
机译:从纸质文档创建大型电子文档数据库是一项艰巨的任务。为了搜索图像文档的数据库,常规电子归档系统必须使用OCR将文档转换为文本。但是,系统无法检索不包含正确字符代码的文档。我们(1999)先前已经提出了一种文件检索方法,通过使用描述字符形状轮廓的“形状特征”技术来减少假滴和误报。我们现在将这种方法应用于手写日本文件。实验结果表明,与常规方法相比,我们的方法具有88.8%的高召回率(69.2%:文本匹配,78.3%:候选匹配)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号