首页>
外国专利>
Methods for efficiently and systematically searching stock, image, and other non-word-based documents
Methods for efficiently and systematically searching stock, image, and other non-word-based documents
展开▼
机译:有效,系统地搜索库存,图像和其他非单词文档的方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
One embodiment of a non-word-based information retrieval system includes searching stock or image documents in a huge data source. A non-word-based document is first divided into a series of elements or an array of cells. Each element or cell is matched against a series of predefined token patterns, so that a match will generate a token having a name. The collection of the generated named tokens is a word-based representation of the non-word-based document. After tokens from all documents are collected in a master collection of tokens, the non-word-based documents can be efficiently and systematically searched in a manner analogous to a document search in a word-based search system.
展开▼