首页> 外文期刊>Information systems frontiers >Index partitioning through a bipartite graph model for faster similarity search in recommendation systems
【24h】

Index partitioning through a bipartite graph model for faster similarity search in recommendation systems

机译:通过二部图模型对索引进行分区,以加快推荐系统中的相似度搜索

获取原文
获取原文并翻译 | 示例
           

摘要

Scalability of a recommendation system is an important factor for large e-commerce sites containing millions of products visited by millions of users. Similarity search is the core operation in recommendation systems. In this paper, we explain a framework to alleviate performance bottleneck of similarity search for very large-scale recommendation systems. The framework employs inverted index for real-time similarity search and handles dynamic updates. As the inverted index gets larger, retrieving recommendations become computationally expensive. There are various works devoted to solve this problem, such as clustering and preprocessing to compute recommendations offline. Our solution is based on bipartite graph partitioning for minimizing the affinity between entities in different partitions. Number of operations in similarity search is reduced by executing search within the closest partitions to the query. Parts are balanced, so that computational loads of partitions are almost the same, which is significant for reducing the computational cost. Sequential experiments with several different recommendation approaches and large datasets consisting of millions of users and items validate the scalability of the proposed recommendation framework. Accuracy drops only by a small factor due to partitioning, if any. Even slight improvements in recommendation accuracy are observed in our collaborative filtering experiments.
机译:对于包含数百万用户访问过数百万种产品的大型电子商务站点,推荐系统的可伸缩性是一个重要因素。相似性搜索是推荐系统中的核心操作。在本文中,我们解释了一个缓解大型推荐系统相似搜索性能瓶颈的框架。该框架采用倒排索引进行实时相似性搜索并处理动态更新。随着反向索引变大,检索建议的计算量也越来越大。有许多工作致力于解决此问题,例如聚类和预处理以离线计算推荐。我们的解决方案基于二分图分区,以最小化不同分区中实体之间的亲和力。通过在最接近查询的分区中执行搜索,可以减少相似性搜索中的操作数。部件是平衡的,因此分区的计算负荷几乎相同,这对于降低计算成本非常重要。使用几种不同的推荐方法以及由数百万个用户和项目组成的大型数据集的顺序实验验证了所提出的推荐框架的可伸缩性。由于分区,精度(如果有的话)只会下降一小部分。在我们的协作过滤实验中,甚至可以看到推荐准确性的轻微提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号