首页>
外国专利>
Detection and Reconstruction of Right-to-Left Text Direction, Ligatures and Diacritics in a Fixed Format Document
Detection and Reconstruction of Right-to-Left Text Direction, Ligatures and Diacritics in a Fixed Format Document
展开▼
机译:固定格式文档中从右到左的文本方向,连字和变音符号的检测和重构
展开▼
页面导航
摘要
著录项
相似文献
摘要
Detection of right-to-left text direction, left-to-right text direction, ligatures and diacritics in fixed format documents for reconstruction of fixed format documents into flow format documents is provided. Each text run of a fixed format document is analyzed for directionality. If text runs contain ligatures, the ligatures are mapped to corresponding characters for proper reading order of the ligatures in context with other characters comprising a text run in which the ligatures are situated or neighboring the ligature. Each text run is collected based on determined text directionality for reconstruction in a flow format document. Proper text directionality for columns of text is determined in the same manner as proper text directionality for text runs in paragraphs of text. If diacritics are present in association with one or more characters or glyphs, a determination may be made as to a carrier character or glyph associated with each diacritic.
展开▼