【24h】

Efficient Diversity-Aware Search

机译:高效的多样性感知搜索

获取原文

摘要

Typical approaches of ranking information in response to a user's query that return the most relevant results ignore important factors contributing to user satisfaction; for instance, the contents of a result document may be redundant given the results already examined. Motivated by emerging applications, in this work we study the problem of Diversity-Aware Search, the essence of which is ranking search results based on both their relevance, as well as their dissimilarity to other results reported. Diversity-Aware Search is generally a hard problem, and even tractable instances thereof cannot be efficiently solved by adapting existing approaches. We propose DivGen . an efficient algorithm for diversity-aware search, which achieves significant performance improvements via novel data access primitives. Although selecting the optimal schedule of data accesses is a hard problem, we devise the first low-overhead data access prioritization scheme with theoretical quality guarantees, and good performance in practice. A comprehensive evaluation on real and synthetic large-scale corpora demonstrates the efficiency and effectiveness of our approach.
机译:响应返回最相关结果的用户查询的信息排名的典型方法,忽略了有助于提高用户满意度的重要因素。例如,鉴于已经检查过的结果,结果文档的内容可能是多余的。受新兴应用程序的推动,在这项工作中,我们研究了多样性感知搜索的问题,其实质是根据搜索结果的相关性以及与其他报告结果的相似性对搜索结果进行排名。多样性感知搜索通常是一个难题,即使采用现有方法也无法有效解决其难处理的实例。我们建议使用DivGen。一种高效的多样性感知搜索算法,通过新颖的数据访问原语实现了显着的性能提升。尽管选择最佳的数据访问时间表是一个难题,但我们设计了第一个低开销的数据访问优先级排序方案,该方案具有理论上的质量保证,并且在实践中具有良好的性能。对真实和合成的大型语料库的综合评估证明了我们方法的有效性和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号