首页> 外文期刊>ACM SIGIR FORUM >A Utility Theoretic Approach to Determining Optimal Wait Times in Distributed Information Retrieval
【24h】

A Utility Theoretic Approach to Determining Optimal Wait Times in Distributed Information Retrieval

机译:确定分布式信息检索中最佳等待时间的实用理论方法

获取原文
获取原文并翻译 | 示例
           

摘要

Distributed IR systems query a large number of IR servers, merge the retrieved results and display them to users. Since different servers handle collections of different sizes, have different processing and bandwidth capacities, there can be considerable heterogeneity in their response times. The broker in the distributed IR system thus has to make decisions regarding terminating searches based on perceived value of waiting -retrieving more documents — and the costs imposed on users by waiting for more responses. In this paper, we apply utility theory to formulate the broker's decision problem. The problem is a stochastic nonlinear program. We use Monte Carlo simulations to demonstrate how the optimal wait time may be determined in the context of a comparison shopping engine that queries multiple store websites for price and product information. We use data gathered from 30 stores for a set of 60 books. Our research demonstrates how a broker can leverage information about past retrievals regarding distributions of server response time and relevance scores to optimize its performance. Our main contribution is the formulation of the decision model for optimal wait time and proposal of a solution method. Our results suggest that the optimal wait time is highly sensitive to the manner in which users value from a set of retrieved results differs from the sum of user value from each result evaluated independently. We also find that the optimal wait time increases with the size of the distributed collections, but only if user utility from a set of results is nearly equal to the sum of utilities from each result.
机译:分布式IR系统查询大量的IR服务器,合并检索到的结果并将其显示给用户。由于不同的服务器处理不同大小的集合,具有不同的处理能力和带宽容量,因此它们的响应时间可能存在很大的异质性。因此,分布式IR系统中的经纪人必须基于等待的感知价值(决定检索更多文档)以及通过等待更多响应而对用户造成的成本,来决定是否终止搜索。在本文中,我们应用效用理论来制定经纪人的决策问题。问题是随机非线性程序。我们使用蒙特卡洛模拟来演示如何在比较购物引擎的上下文中确定最佳等待时间,该比较购物引擎向多个商店网站查询价格和产品信息。我们将从30家商店收集的数据用于60套书籍。我们的研究表明,经纪人如何利用有关服务器响应时间和相关性得分分布的过去检索信息来优化其性能。我们的主要贡献是为最佳等待时间制定决策模型并提出解决方法。我们的结果表明,最佳等待时间对一组检索结果中的用户价值与独立评估的每个结果中的用户价值总和不同的方式高度敏感。我们还发现,最佳等待时间随分布式集合的大小而增加,但前提是一组结果中的用户实用程序几乎等于每个结果中的实用程序之和。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号