首页> 外文期刊>International Journal of Modern Physics, C. Physics and Computers >Information filtering via clustering coefficients of user-object bipartite networks
【24h】

Information filtering via clustering coefficients of user-object bipartite networks

机译:通过用户-对象双向网络的聚类系数进行信息过滤

获取原文
获取原文并翻译 | 示例
       

摘要

The clustering coefficient of user-object bipartite networks is presented to evaluate the overlap percentage of neighbors rating lists, which could be used to measure interest correlations among neighbor sets. The collaborative filtering (CF) information filtering algorithm evaluates a given user's interests in terms of his/her friends' opinions, which has become one of the most successful technologies for recommender systems. In this paper, different from the object clustering coefficient, users' clustering coefficients of user-object bipartite networks are introduced to improve the user similarity measurement. Numerical results for MovieLens and Netflix data sets show that users' clustering effects could enhance the algorithm performance. For MovieLens data set, the algorithmic accuracy, measured by the average ranking score, can be improved by 12.0% and the diversity could be improved by 18.2% and reach 0.649 when the recommendation list equals to 50. For Netflix data set, the accuracy could be improved by 14.5% at the optimal case and the popularity could be reduced by 13.4% comparing with the standard CF algorithm. Finally, we investigate the sparsity effect on the performance. This work indicates the user clustering coefficients is an effective factor to measure the user similarity, meanwhile statistical properties of user-object bipartite networks should be investigated to estimate users' tastes.
机译:提出了用户-对象双向网络的聚类系数,以评估邻居评级列表的重叠百分比,该度量可用于度量邻居集合之间的兴趣相关性。协作过滤(CF)信息过滤算法根据给定用户的朋友的意见来评估其兴趣,这已成为推荐系统最成功的技术之一。本文从对象聚类系数的角度出发,介绍了用户-对象双向网络的用户聚类系数,以提高用户相似度的度量。 MovieLens和Netflix数据集的数值结果表明,用户的聚类效果可以提高算法性能。对于MovieLens数据集,按平均排名得分衡量,算法准确性可以提高12.0%,而推荐列表等于50时,多样性可以提高18.2%,达到0.649。对于Netflix数据集,准确性可以与标准CF算法相比,在最佳情况下可提高14.5%的普及率,而流行度则可降低13.4%。最后,我们研究了稀疏性对性能的影响。这项工作表明,用户聚类系数是衡量用户相似性的有效因素,同时应研究用户-对象双向网络的统计属性,以估计用户的品味。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号