首页> 外国专利> Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections

机译:大型异类文档集中文档的权威性分级,估计和分类的系统和方法

摘要

Systems and methods for determining the authoritativeness of a document based on textual, non-topical cues. The authoritativeness of a document is determined by evaluating a set of document content features contained within each document to determine a set of document content feature values, processing the set of document content feature values through a trained document textual authority model, and determining a textual authoritativeness value and/or textual authority class for each document evaluated using the predictive models included in the trained document textual authority model. Estimates of a document's textual authoritativeness value and/or textual authority class can be used to re-rank documents previously retrieved by a search, to expand and improve document query searches, to provide a more complete and robust determination of a document's authoritativeness, and to improve the aggregation of rank-ordered lists with numerically-ordered lists.
机译:用于基于文本,非主题线索来确定文档的权威性的系统和方法。通过评估每个文档中包含的一组文档内容特征以确定一组文档内容特征值,通过经过培训的文档文本权限模型处理该组文档内容特征值以及确定文本权威性来确定文档的权威性使用受过训练的文档文本权限模型中包含的预测模型评估的每个文档的价值和/或文本权限类别。可以使用文档的文本权威性值和/或文本权威性类别的估计值来对先前通过搜索检索到的文档进行重新排名,以扩展和改进文档查询搜索,以提供对文档权威性的更完整和更可靠的确定,并改善数字排序列表与排序列表的聚合。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号