首页> 外国专利> DETECTION AND RECONSTRUCTION OF RIGHT-TO-LEFT TEXT DIRECTION, LIGATURES AND DIACRITICS IN A FIXED FORMAT DOCUMENT

DETECTION AND RECONSTRUCTION OF RIGHT-TO-LEFT TEXT DIRECTION, LIGATURES AND DIACRITICS IN A FIXED FORMAT DOCUMENT

机译:固定格式文档中从右到左文本方向,连字和书法的检测和重构

摘要

Detection of right-to-left text direction, left-to-right text direction, ligatures and diacritics in fixed format documents for reconstruction of fixed format documents into flow format documents is provided. Each text run of a fixed format document is analyzed for directionality. If text runs contain ligatures, the ligatures are mapped to corresponding characters for proper reading order of the ligatures in context with other characters comprising a text run in which the ligatures are situated or neighboring the ligature. Each text run is collected based on determined text directionality for reconstruction in a flow format document. Proper text directionality for columns of text is determined in the same manner as proper text directionality for text runs in paragraphs of text. If diacritics are present in association with one or more characters or glyphs, a determination may be made as to a carrier character or glyph associated with each diacritic.
机译:提供了在固定格式文档中从右到左文本方向,从左至右文本方向,连字和变音符号的检测,以将固定格式文档重建为流格式文档。分析固定格式文档的每个文本行的方向性。如果文本行包含连字,则将连字映射到相应字符,以在上下文中与其他字符(包括连字位于或邻近连字的文本行)关联,以正确读取连字。基于确定的文本方向性收集每个文本运行,以在流格式文档中进行重建。确定文本列的正确文本方向性的方式与确定文本段落中文本的正确文本方向性的方式相同。如果变音符号与一个或多个字符或字形相关联地存在,则可以确定与每个变音符号相关的载体字符或字形。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号