【24h】

Using ODP metadata to personalize search

机译:使用ODP元数据来个性化搜索

获取原文

摘要

The Open Directory Project is clearly one of the largest collaborative efforts to manually annotate web pages. This effort involves over 65,000 editors and resulted in metadata specifying topic and importance for more than 4 million web pages. Still, given that this number is just about 0.05 percent of the Web pages indexed by Google, is this effort enough to make a difference? In this paper we discuss how these metadata can be exploited to achieve high quality personalized web search. First, we address this by introducing an additional criterion for web page ranking, namely the distance between a user profile defined using ODP topics and the sets of ODP topics covered by each URL returned in regular web search. We empirically show that this enhancement yields better results than current web search using Google. Then, in the second part of the paper, we investigate the boundaries of biasing PageRank on subtopics of the ODP in order to automatically extend these metadata to the whole web.
机译:开放目录项目显然是手动注释网页的最大的协作工作之一。这项工作涉及超过65,000名编辑人员,并导致指定超过400万个网页的主题和重要性的元数据。不过,鉴于此数字仅占Google索引的网页的0.05%,这种努力足以产生变化吗?在本文中,我们讨论了如何利用这些元数据来实现高质量的个性化Web搜索。首先,我们通过引入网页排名的附加标准来解决此问题,即使用ODP主题定义的用户个人资料与常规Web搜索中返回的每个URL覆盖的ODP主题集之间的距离。我们从经验上证明,与当前使用Google进行的网络搜索相比,此增强功能可产生更好的结果。然后,在本文的第二部分中,我们研究了将PageRank偏向ODP子主题的边界,以便将这些元数据自动扩展到整个Web。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号