首页> 外国专利> METHOD EXECUTED BY COMPUTER FOR AUTOMATICALLY RECOGNIZING TEXT IN ARABIC, AND COMPUTER PROGRAM

METHOD EXECUTED BY COMPUTER FOR AUTOMATICALLY RECOGNIZING TEXT IN ARABIC, AND COMPUTER PROGRAM

机译:由计算机执行的用于自动识别阿拉伯语中的文本的方法和计算机程序

摘要

PROBLEM TO BE SOLVED: To extract a text feature appropriately in recognizing a text in Arabic.;SOLUTION: A two-dimensional array of pixels associated with pixel values each of which is expressed in a binary number is formed as a result of digitalization of a line of Arabic characters. The pixel values are expressed in a binary number. The line of Arabic characters is divided into a plurality of line images, and a plurality of cells are defined in one of the plurality of line images. Each of the plurality of cells has an adjoining pixel group. A two-value cell number is formed as a result of serialization of the pixel value of each pixel of the plurality of cells in one of the plurality of line images. A text feature vector is formed in accordance with the two-value cell number obtained from the plurality of cells in one of the plurality of line images. The text feature vector is sent to a hidden Markov model so that the line of Arabic characters is recognized.;COPYRIGHT: (C)2014,JPO&INPIT
机译:解决的问题:为了在识别阿拉伯语文本时适当提取文本特征;解决方案:由于像素的数字化,形成了与像素值相关联的二维像素阵列,每个像素值均以二进制数表示阿拉伯字符行。像素值以二进制数表示。阿拉伯字符行被分成多个行图像,并且在多个行图像之一中定义了多个单元。多个单元中的每一个具有相邻的像素组。作为对多个线图像之一中的多个单元的每个像素的像素值的序列化的结果,形成二值单元号。根据从多个线图像之一中的多个单元格获得的二值单元格编号形成文本特征向量。文本特征向量被发送到隐藏的马尔可夫模型,以便识别阿拉伯字符行。;版权所有:(C)2014,JPO&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号