首页>
外国专利>
METHODS AND DEVICES THAT CONVERT THE IMAGES OF DOCUMENTS TO ELECTRONIC DOCUMENTS USING A TRIE-STRUCTURE OF DATA CONTAINING UNPARAMETED SYMBOLS FOR DETERMINING DEFINITIONS
METHODS AND DEVICES THAT CONVERT THE IMAGES OF DOCUMENTS TO ELECTRONIC DOCUMENTS USING A TRIE-STRUCTURE OF DATA CONTAINING UNPARAMETED SYMBOLS FOR DETERMINING DEFINITIONS
展开▼
机译:使用包含参数的无符号符号的数据结构来将文件的图像转换为电子文件的方法和设备
展开▼
页面导航
摘要
著录项
相似文献
摘要
The current application is directed to methods and systems that convert document images, which contain Arabic text and text in other languages in which symbols are joined together to produce continuous words and portions of words, into corresponding electronic documents. In one implementation, a document-image-processing method and system to which the current application is directed employs numerous techniques and features that render efficiently computable an otherwise intractable or impractical document-image-to-electronic-document conversion. These techniques and features include transformation of text-image morphemes and words into feature symbols with associated parameters, efficiently identifying similar morphemes and words in an electronic store of standard-feature-symbol-encoded morphemes and words, and identifying candidate inter-character division points and corresponding traversal paths using the similar morphemes and words identified in the word store.
展开▼