首页> 外国专利> System and method for retrieving electronic documents created by optical character recognition

System and method for retrieving electronic documents created by optical character recognition

机译:检索通过光学字符识别创建的电子文档的系统和方法

摘要

A method, system and computer product for processing search requests in order to compensate for characters and character strings misread during OCR scanning is disclosed. After an alphanumeric search request is entered, the system determines variant words associated with the entered alphanumeric search request according to a predefined table of possible OCR errors, the OCR errors' probability of occurrence and a predefined threshold of probability of occurrences. When the preprocessing is complete, a search engine then uses the variant words to search a database containing OCR scanned documents.
机译:公开了一种用于处理搜索请求以补偿在OCR扫描期间误读的字符和字符串的方法,系统和计算机产品。在输入字母数字搜索请求之后,系统根据可能的OCR错误的预定义表,OCR错误的发生概率和预定义的发生概率阈值,确定与输入的字母数字搜索请求关联的变体词。预处理完成后,搜索引擎将使用变体词来搜索包含OCR扫描文档的数据库。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号