首页> 外国专利> IMAGE DOCUMENT PROCESSING APPARATUS, IMAGE DOCUMENT PROCESSING METHOD, IMAGE PROCESSING PROGRAM, AND RECORDING MEDIUM ON WHICH IMAGE PROCESSING PROGRAM IS RECORDED

IMAGE DOCUMENT PROCESSING APPARATUS, IMAGE DOCUMENT PROCESSING METHOD, IMAGE PROCESSING PROGRAM, AND RECORDING MEDIUM ON WHICH IMAGE PROCESSING PROGRAM IS RECORDED

机译:记录图像处理程序的图像文件处理装置,图像文件处理方法,图像处理程序以及记录介质

摘要

PPROBLEM TO BE SOLVED: To provide an image document processing apparatus, and an image document processing method, in each of which index information is improved to achieve higher search precision. PSOLUTION: An image of a character string composed of M pieces of characters is clipped from an image document, and the image is divided into separate characters, image features of each character image are extracted, based on the image features, N (N1, integer) pieces of character images in descending order of degree of similarity are selected as candidate characters, from a character image feature dictionary which stores the image features of character image in units of character, and a first index matrix of MxN-th cells of the clipped character strings is prepared. A candidate character string composed of a plurality of candidate characters constituting a first column of the first index matrix, is subjected to a lexical analysis according to a predetermined language model, and whereby a second index matrix having adjusted the candidate character string to a character string which makes sense is prepared, in the language model, statistics are taken and then, the lexical analysis is performed. PCOPYRIGHT: (C)2009,JPO&INPIT
机译:

要解决的问题:提供一种图像文档处理设备和图像文档处理方法,其中,每个图像文档处理设备和图像文档处理方法均被改进以实现更高的搜索精度。

解决方案:从图像文档中剪切由M个字符组成的字符串的图像,并将该图像划分为单独的字符,并基于图像特征N(从以字符为单位存储字符图像的图像特征的字符图像特征字典和MxN-的第一索引矩阵中,选择N> 1(整数)个相似度从高到低的字符图像作为候选字符。准备被剪裁的字符串的第th个单元。由构成第一索引矩阵的第一列的多个候选字符组成的候选字符串,根据预定的语言模型进行词法分析,从而将候选字符串调整为字符串的第二索引矩阵这是很有意义的,在语言模型中,进行统计,然后进行词法分析。

版权:(C)2009,日本特许厅&INPIT

著录项

  • 公开/公告号JP2009026288A

    专利类型

  • 公开/公告日2009-02-05

    原文格式PDF

  • 申请/专利权人 SHARP CORP;

    申请/专利号JP20070246158

  • 申请日2007-09-21

  • 分类号G06T1;G06F17/30;

  • 国家 JP

  • 入库时间 2022-08-21 19:40:06

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号