首页> 外国专利> Reading document search access technique and the document search

Reading document search access technique and the document search

机译:阅读文档搜索访问技术和文档搜索

摘要

PROBLEM TO BE SOLVED: To provide a method that enables a search and browse of a document image group through the application of a document structure analysis technique and a character recognition technique as searching/browsing means for paper documents and document images.;SOLUTION: A highly functional document image search/browse system separates an OCR and a document processing apparatus, adopts as OCR output formats data (reading hypothesis data) holding multiple hypotheses of character line extraction, character segmentation and character recognition, and document structure data having ruled line information, frame information, character line information, browse attribute information and the like about a document image, and provides a function of important keyword extraction and document search from typed and handwritten character strings using OCR-added data, and of document display intended by a browser using the document structure data.;COPYRIGHT: (C)2005,JPO&NCIPI
机译:解决的问题:提供一种方法,该方法使得能够通过应用作为纸质文档和文档图像的搜索/浏览手段的文档结构分析技术和字符识别技术来搜索和浏览文档图像组。高性能的文档图像搜索/浏览系统将OCR和文档处理设备分开,采用OCR输出格式数据(读取假设数据),该数据包含字符行提取,字符分段和字符识别的多个假设以及具有格线信息的文档结构数据,框架信息,字符行信息,浏览属性信息等有关文档图像的信息,并提供重要的关键字提取和使用OCR附加数据从键入和手写的字符串中搜索文档的功能,以及浏览器预期的文档显示功能使用文档结构数据。;版权所有:(C)2005,JPO&NCIPI

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号