首页> 外文会议>Information access evaluation: multilinguality, multimodality, and interaction >Supporting More-Like-This Information Needs: Finding Similar Web Content in Different Scenarios
【24h】

Supporting More-Like-This Information Needs: Finding Similar Web Content in Different Scenarios

机译:支持更多此类信息需求:在不同情况下查找相似的Web内容

获取原文
获取原文并翻译 | 示例

摘要

We examine more-like-this information needs in different scenarios. A more-like-this information need occurs, when the user sees one interesting document and wants to access other but similar documents. One of our foci is on comparing different strategies to identify related web content. We compare following links (i.e., crawling), automatically generating keyqueries for the seen document (i.e., queries that have the document in the top of their ranks), and search engine operators that automatically display related results. Our experimental study shows that in different scenarios different strategies yield the most promising related results. One of our use cases is to automatically support people who monitor right-wing content on the web. In this scenario, it turns out that crawling from a given set of seed documents is the best strategy to find related pages with similar content. Querying or the related-operator yield much fewer good results. In case of news portals, however, crawling is a bad idea since hardly any news portal links to other news portals. Instead, a search engine's related operator or querying are better strategies. Finally, for identifying related scientific publications for a given paper, all three strategies yield good results.
机译:我们研究了在不同情况下类似的信息需求。当用户看到一个有趣的文档并想要访问其他但相似的文档时,便会出现一种类似信息的需求。我们的重点之一是比较不同的策略来识别相关的Web内容。我们比较以下链接(即抓取),自动为可见的文档生成关键字查询(即,将文档放在其排名最高的查询中)以及自动显示相关结果的搜索引擎运算符。我们的实验研究表明,在不同的情况下,不同的策略会产生最有希望的相关结果。我们的用例之一是自动为监视Web右翼内容的人员提供支持。在这种情况下,事实证明,从一组给定的种子文档中进行爬网是查找具有相似内容的相关页面的最佳策略。查询或相关运算符产生的好结果要少得多。但是,对于新闻门户而言,爬网是个坏主意,因为几乎没有任何新闻门户链接到其他新闻门户。相反,搜索引擎的相关运算符或查询是更好的策略。最后,为了确定给定论文的相关科学出版物,所有三种策略均产生了良好的效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号