首页> 外国专利> METHOD, SYSTEM, AND COMPUTER-READABLE RECORDING MEDIUM FOR SEGMENTING CHARACTERS COMPRISED OF A PLURALITY OF LANGUAGES INCLUDED IN A DOCUMENT BY USING LANGUAGE RECOGNITION

METHOD, SYSTEM, AND COMPUTER-READABLE RECORDING MEDIUM FOR SEGMENTING CHARACTERS COMPRISED OF A PLURALITY OF LANGUAGES INCLUDED IN A DOCUMENT BY USING LANGUAGE RECOGNITION

机译:通过使用语言识别来分割包括文档中的多种语言的字符的方法,系统和计算机可读记录介质

摘要

PURPOSE: A method, a system, and a computer-readable recording medium for segmenting characters including a plurality of languages in a document using language recognition are provided to perform the character division exactly even though different languages are mixed in a document. CONSTITUTION: A method of understanding the language of the character string included in the document image in which two languages are mixed is as follows. The first step is recognizing at least one connection component comprising the character string included in the document image. The second step is recognizing the information about the language of the characters by analyzing the connection component.
机译:目的:提供一种用于使用语言识别来对文档中包括多种语言的字符进行分割的方法,系统和计算机可读记录介质,以即使在文档中混合了不同的语言时也可以精确地执行字符划分。构成:一种理解混合了两种语言的文档图像中包含的字符串的语言的方法如下。第一步是识别至少一个包括文档图像中包含的字符串的连接组件。第二步是通过分析连接组件来识别有关字符语言的信息。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号