首页>
外国专利>
Topic identification and use thereof in information retrieval systems
Topic identification and use thereof in information retrieval systems
展开▼
机译:主题标识及其在信息检索系统中的使用
展开▼
页面导航
摘要
著录项
相似文献
摘要
A technique to determine topics associated with, or classifications for, a data corpus uses an initial domain-specific word list to identify word combinations (one or more words) that appear in the data corpus significantly more often than expected. Word combinations so identified are selected as topics and associated with a user-specified level of granularity. For example, topics may be associated with each table entry, each image, each sentence, each paragraph, or an entire file. Topics may be used to guide information retrieval and/or the display of topic classifications during user query operations.
展开▼