首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Collaborative Clustering: Sample Complexity and Efficient Algorithms
【24h】

Collaborative Clustering: Sample Complexity and Efficient Algorithms

机译:协同聚类:样本复杂度和高效算法

获取原文
       

摘要

We study the problem of collaborative clustering. This problem is concerned with a set of items grouped into clusters that we wish to recover from ratings provided by users. The latter are also clustered, and each user rates a random but typical small number of items. The observed ratings are random variables whose distributions depend on the item and user clusters only. Unlike for collaborative filtering problems where one needs to recover both user and item clusters, here we only wish to classify items. The number of items rated by a user can be so small that anyway, estimating user clusters may be hopeless. For the collaborative clustering problem, we derive fundamental performance limits satisfied by any algorithm. Specifically, we identify the number of ratings needed to guarantee the existence of an algorithm recovering the clusters with a prescribed level of accuracy. We also propose SplitSpec, an algorithm whose performance matches these fundamental performance limit order-wise. In turn, SplitSpec is able to exploit, as much as this is possible, the users’ structure to improve the item cluster estimates.
机译:我们研究了协作集群的问题。此问题与我们希望从用户提供的分级中恢复的一组项目组成的群集有关。后者也被聚类,并且每个用户对随机但典型的少量项目进行评分。观察到的评级是随机变量,其分布仅取决于项目和用户群。与需要恢复用户和项目集群的协作过滤问题不同,这里我们只希望对项目进行分类。用户评分的项目数量可能非常少,以至于估计用户群可能毫无希望。对于协作聚类问题,我们得出任何算法都可以满足的基本性能极限。具体来说,我们确定所需的等级数量,以保证存在一种以规定的准确度恢复群集的算法。我们还提出了SplitSpec,该算法的性能与这些基本性能限制按顺序匹配。反过来,SplitSpec能够尽可能利用用户的结构来改善项目聚类估计。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号