首页>
外国专利>
Document retrieval using internal dictionary-hierarchies to adjust per-subject match results
Document retrieval using internal dictionary-hierarchies to adjust per-subject match results
展开▼
机译:使用内部字典层次结构调整文档的主题检索文档
展开▼
页面导航
摘要
著录项
相似文献
摘要
Techniques for managing big data include retrieval using per-subject dictionaries having multiple levels of sub-classification hierarchy within the subject. Entries may include subject-determining-power (SDP) scores that provide an indication of the descriptive power of the entry term with respect to the subject of the dictionary containing the term. The same term may have entries in multiple dictionaries with different SDP scores in each of the dictionaries. A retrieval request for one or more documents containing search terms descriptive of the one or more documents can be processed by identifying a set of candidate documents tagged with subjects, i.e., identifiers of per-subject dictionaries having entries corresponding to a search term, then using affinity values to adjust the aggregate score for the terms in the dictionaries. Documents are then selected for best match to the subject based on the adjusted scores. Alternatively, the adjustment may be performed after selecting the documents by re-ordering them according to adjusted scores.
展开▼