首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >s-HITSc: an improved model and algorithm for topic distillation on the Web
【24h】

s-HITSc: an improved model and algorithm for topic distillation on the Web

机译:s-HITSc:Web上主题提炼的改进模型和算法

获取原文
获取原文并翻译 | 示例
           

摘要

Topic distillation on the Web, namely, finding quality information sources related to a given query topic with hyperlink analysis, has been shown to be useful in Web IR. Based on the analysis of three deficiencies of classical topic distillation algorithm HITS, this paper presents an improved model and algorithm named s-HITSc. Given a query topic, the improved algorithm can model a neighborhood graph at site granularity, compute the relevance weights of the nodes to the topic with content analysis, and apply weighted I/O operations in its iterative hyperlink analysis. Theoretical analysis and experimental results show that s-HITSc can control topic drift and identify more reasonable and meaningful authority and hub sites on a given topic.
机译:Web上的主题提炼(即使用超链接分析找到与给定查询主题相关的质量信息源)已显示在Web IR中很有用。在分析经典主题蒸馏算法HITS的三个不足的基础上,提出了一种改进的模型和算法s-HITSc。给定一个查询主题,改进的算法可以在站点粒度上对邻域图建模,通过内容分析计算节点与主题的相关权重,并在其迭代超链接分析中应用加权I / O操作。理论分析和实验结果表明,s-HITSc可以控制主题漂移并确定给定主题上更合理和有意义的权限和中心站点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号