首页> 外国专利> Optical Character Recognition low-resolution camera for documents acquired

Optical Character Recognition low-resolution camera for documents acquired

机译:光学字符识别低分辨率相机,用于获取文档

摘要

A system that facilitates optical character recognition, OCR, symbol low resolution, in which a string of symbols is representative of a word, and in which the symbols represent characters, comprising: a component segmentation to detect spaces between symbols to determine lines of text, and fragmenting the text lines into individual words; and a recognition component for recognizing characters (206) using a character recognizer based on machine learning to scan through each of the individual words to predict what character is likely to occur at a given location, to recognize the punctuation and word recognition; in said recognizing punctuation is used to identify if a final character of a word is a punctuation mark, comprising: determining a most likely position for each possible final character of the word character; generate a score for each character more likely; determine whether the word is punctuated word, in which the word is punctuated word if the most likely character with the highest score is a punctuation mark and if the score of the most likely character with the highest score is above a predetermined threshold; and wherein said word recognition includes: recognize the word using the rest of the word without punctuation, and add punctuation to the recognized word; and recognizing words (208) a sequence of individual reconciling character recognizer outputs with a particular word using dynamic programming and a dictionary.
机译:一种有助于光学字符识别,OCR,符号低分辨率的系统,其中,一串符号代表一个单词,并且其中,符号代表字符,包括:组件分割,用于检测符号之间的空格以确定文本行;将文本行分成单个单词;识别部件(206),其基于机器学习使用字符识别器来识别字符(206),以扫描各个单词以预测在给定位置可能出现的字符,从而识别标点符号和单词识别;在所述识别标点中,用于识别单词的最终字符是否是标点符号,包括:确定单词字符的每个可能的最终字符的最可能位置;为每个角色更可能产生一个分数;确定该单词是否为标点单词,如果得分最高的最可能字符是标点符号,并且得分最高的最可能字符的得分是否高于预定阈值,则将该单词作为标点单词;其中,所述单词识别包括:使用其余单词而不用标点符号来识别单词,并将标点符号添加到所识别的单词;以及利用动态编程和字典识别单词(208)与特定单词的一系列个体协调字符识别器输出。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号