【24h】

Facet-Driven Blog Feed Retrieval

机译:方面驱动的博客提要检索

获取原文
获取原文并翻译 | 示例
           

摘要

The faceted blog distillation task retrieves blogs (i.e. RSS feeds) that are not only relevant to a query but also satisfy an interested facet. The facets under consideration are opinionated vs. factual, personal vs. official and in-depth vs. shallow. For the opinionated/factual facets, we propose a classifier that uses syntactic and semantic features to determine whether an opinion in blog documents is relevant to a given query. For the personal/official facets, we propose three classifiers that are learned based on different assumptions to categorize a blog document into either the personal or the official class. For the in-depth/shallow facets, we propose to calculate the depth of the coverage of a blog document on a given query by the occurrences of the concepts related to the query. Dependencies among different facets are also discussed. Experimental results on TREC Blogs06 and Blogs08 collections show that our techniques are not only effective in finding faceted blogs but also significantly outperform the best known results over both collections.
机译:多面博客精简任务检索不仅与查询相关而且满足感兴趣的方面的博客(即RSS feed)。所考虑的方面是固执的,事实的,个人的,官方的,深度的与浅薄的。对于观点/事实方面,我们提出了一种分类器,该分类器使用句法和语义特征来确定博客文档中的观点是否与给定查询相关。对于个人/官方方面,我们建议根据不同的假设学习三个分类器,以将博客文档分为个人类或官方类。对于深度/浅层次,我们建议通过与查询相关的概念的出现来计算博客文档在给定查询上的覆盖深度。还讨论了不同方面之间的依赖性。 TREC Blogs06和Blogs08馆藏的实验结果表明,我们的技术不仅可以有效地查找多面博客,而且在这两个馆藏中的表现都远胜于最知名的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号