首页> 外国专利> Document tagging and retrieval using per-subject dictionaries including subject-determining-power scores for entries

Document tagging and retrieval using per-subject dictionaries including subject-determining-power scores for entries

机译:使用每个主题的词典进行文档标记和检索,包括条目的主题确定能力得分

摘要

Techniques for managing big data include tagging of documents and subsequent retrieval using per-subject dictionaries having entries with subject-determining-power scores. The subject-determining-power scores provide an indication of the descriptive power of the term with respect to the subject of the dictionary containing the term. The same term may have entries in multiple dictionaries with different subject-determining-power scores in each of the dictionaries. A retrieval request for one or more documents containing search terms descriptive of the one or more documents can be processed identifying a set of candidate documents tagged with subjects and optional terms, and then applying subject-determining-power scores from the multiple dictionaries for the search term to determine a subject for the search term. The method then selects the one or more documents from the candidate documents according to the subject.
机译:用于管理大数据的技术包括为文档加标签以及使用具有主题确定能力得分的条目的每个主题词典进行后续检索。主题确定能力得分提供了该术语相对于包含该术语的词典主题的描述能力的指示。同一术语可能在多个词典中都有条目,而每个词典中的主题决定能力得分都不同。可以处理对包含描述一个或多个文档的搜索词的一个或多个文档的检索请求,以标识一组标有主题和可选术语的候选文档,然后将来自多个词典的主题确定能力得分应用于搜索确定搜索词的主题。然后,该方法根据主题从候选文档中选择一个或多个文档。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号