首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Composite Retrieval of Diverse and Complementary Bundles
【24h】

Composite Retrieval of Diverse and Complementary Bundles

机译:复合和互补束的复合检索

获取原文
获取原文并翻译 | 示例
           

摘要

Users are often faced with the problem of finding complementary items that together achieve a single common goal (e.g., a starter kit for a novice astronomer, a collection of question/answers related to low-carb nutrition, a set of places to visit on holidays). In this paper, we argue that for some application scenarios returning item bundles is more appropriate than ranked lists. Thus we define composite retrieval as the problem of finding $k$ bundles of complementary items. Beyond complementarity of items, the bundles must be valid w.r.t. a given budget, and the answer set of $k$ bundles must exhibit diversity. We formally define the problem and show that in its general form is ${bf NP}$ -hard  and that also the special cases in which each bundle is formed by only one item, or only one bundle is sought, are hard. Our characterization however suggests how to adopt a two-phase approach (Produce-and-Choose, or PAC) in which we first produce many valid bundles, and then we choose $k$ among them. For the first phase we devise two ad-hoc clustering algorithms, while for the second phase we adapt heuristics with approximation guarantees for a related problem. We also devise another approach which is based on first finding a $k$ -clustering and then selecting a valid bundle from each of the produced cl- sters (Cluster-and-Pick, or CAP). We compare experimentally the proposed methods on two real-world data sets: the first data set is given by a sample of touristic attractions in 10 large European cities, while the second is a large database of user-generated restaurant reviews from Yahoo! Local. Our experiments show that when diversity is highly important, CAP is the best option, while when diversity is less important, a PAC approach constructing bundles around randomly chosen pivots, is better.
机译:用户经常面临寻找共同实现单个共同目标的补充项目的问题(例如,针对新手天文学家的入门工具包,与低碳水化合物营养相关的问题/答案的集合,一组度假场所) )。在本文中,我们认为对于某些应用场景,返回项目包比排名列表更合适。因此,我们将复合检索定义为以下问题:查找 $ k $ 补充项目的捆绑包。除商品的互补性外,捆绑商品必须有效。给定的预算和 $ k $ 捆绑包必须表现出多样性。我们正式定义问题并显示其一般形式为 $ {bf NP} $ -hard,并且在特殊情况下,每个捆绑包仅由一项构成或仅寻求一个捆绑包也是很难的。但是,我们的表征建议了如何采用两阶段方法(Profuce-and-Choose或PAC),在该方法中,我们首先产生许多有效的束,然后选择 $ k $ <数学图形格式=“ GIF” fileref =“ bonchi-ieq4-2306678.gif” /> 。对于第一阶段,我们设计了两种即席聚类算法,而对于第二阶段,我们对启发式算法进行了调整,以解决相关问题。我们还设计了另一种方法,该方法基于首先找到 $ k $ -聚类,然后从每个生成的聚类(Cluster-and-Pick或CAP)中选择一个有效的软件包。我们在两个现实世界的数据集上实验比较了所提出的方法:第一个数据集是由10个欧洲大城市的旅游景点样本提供的,而第二个数据集是由Yahoo!本地。我们的实验表明,当多样性至关重要时,CAP是最好的选择,而当多样性重要性不那么重要时,采用PAC方法围绕随机选择的枢轴构建捆绑是更好的选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号