首页> 外国专利> Handwritten text recognition for entire sentences and word series as well as single words

Handwritten text recognition for entire sentences and word series as well as single words

机译:整个句子和单词系列以及单个单词的手写文本识别

摘要

Handwritten text recognition uses flexible text model taking into account occurrence probabilities of series of letters. Symbol models are used e.g. hidden Markov model. User inputs handwriting by digitizer tablet or onto display screen, pen input is detected and values and characteristics vectors stored in memory. User selects recognition mode for handwriting recognition. Two modes available, multi word recognition mode with recognition proceeding over the word boundaries to recognize entire sentence. In second mode, single word recognition, isolated recognition of single words takes into account additional recognition of punctuation marks. Multi-word recognition uses prestored text model (106) held in memory and hidden Markov model (17) trained in previous training phase. Text model describes occurrence probability of a letter under condition of specific series of letters before the letter and determines occurrence probability for a series of characters. To train text model for multi word recognition, entire sentence is used to determine statistic relationships over word boundaries. Second recognition mode, for single words, has further step of single word recognition. It uses another text model for single word recognition (109) and hidden Markov model for single word recognition (110). Independent claims included for data processing unit, computer readable storage medium for program for recognition.
机译:手写文本识别使用灵活的文本模型,同时考虑到一系列字母的出现概率。例如使用符号模型隐藏的马尔可夫模型。用户通过数字化仪输入板或在显示屏上输入笔迹,检测笔输入并将值和特征向量存储在内存中。用户选择用于手写识别的识别模式。有两种可用的模式,多单词识别模式,其中识别在单词边界上进行以识别整个句子。在第二种模式中,单个单词识别,单个单词的隔离识别考虑了标点符号的附加识别。多词识别使用存储在存储器中的预存储文本模型(106)和在先前训练阶段中训练过的隐马尔可夫模型(17)。文本模型描述了字母之前特定字母序列的条件下字母的出现概率,并确定了一系列字符的出现概率。为了训练用于多单词识别的文本模型,整个句子用于确定单词边界上的统计关系。对于单个单词的第二识别模式具有单个单词识别的进一步步骤。它使用另一个文本模型进行单词识别(109),并使用隐马尔可夫模型进行单词识别(110)。独立权利要求包括数据处理单元,用于识别程序的计算机可读存储介质。

著录项

  • 公开/公告号DE19961476A1

    专利类型

  • 公开/公告日2001-07-05

    原文格式PDF

  • 申请/专利权人 KOSMALA ANDREAS;

    申请/专利号DE1999161476

  • 发明设计人 KOSMALA ANDREAS;WILLETT DANIEL;

    申请日1999-12-20

  • 分类号G06K9/78;

  • 国家 DE

  • 入库时间 2022-08-22 01:10:05

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号