首页> 外文期刊>Arabian Journal for Science and Engineering. Section A, Sciences >An Effective Model for Jaccard Coefficient to Increase the Performance of Collaborative Filtering
【24h】

An Effective Model for Jaccard Coefficient to Increase the Performance of Collaborative Filtering

机译:JAccard系数的有效模型,增加了协同滤波性能

获取原文
获取原文并翻译 | 示例
       

摘要

Due to the advancement of technology and an increased number of digital devices per person, more and more digital data are generated daily. Extracting required data from such big data is a challenging task. Recommender systems help us in finding data that best match one’s taste. Collaborative filtering (CF) is the most popular approach used in recommender systems. Various similarity measure techniques are used in CF to calculate item-to-item and user-to-user similarity. The majority of these methods use common ratings to compute similarity. One of the similarity measurement methods is Jaccard similarity, which ignores both absolute values of ratings and the average rating value of a user. In this paper, we propose an improved measure that considers the ratio between absolute rating values and number of commonly rated items. We further improved the performance of proposed similarity measure by putting some thresholds on the average rating value of a user. An important aspect of ratings provided by a user is the rating preference behavior of a user, which almost all similarity measurement methods ignore. We also incorporated this behavior in our proposed method. The proposed method is tested over five publicly available datasets: Epinions, FilmTrust, Movie Lens-100K, CiaoDVD and MovieTweetings. The proposed method is compared with various modern similarity measures, and results show improvements in terms of prediction quality and accuracy.
机译:由于技术的进步和每个人的数字设备数量增加,每天产生越来越多的数字数据。从这些大数据中提取所需数据是一个具有挑战性的任务。推荐器系统帮助我们查找最佳匹配品味的数据。协作过滤(CF)是推荐系统中最流行的方法。 CF中使用各种相似度测量技术来计算项目到项目和用户到用户的相似性。这些方法的大多数使用共同的评级来计算相似性。其中一个相似度测量方法是Jaccard相似性,它忽略了额定值的绝对值和用户的平均评级值。在本文中,我们提出了一种改进的措施,旨在考虑绝对评级值与常定项目数量之间的比率。我们进一步提高了提出了所提出的相似度测量的性能,通过对用户的平均评级值提出一些阈值。用户提供的评级的一个重要方面是用户的额定偏好行为,几乎所有相似度测量方法忽略。我们还以我们提出的方法纳入了这种行为。所提出的方法在五个公共可用的数据集中进行测试:渗透,薄膜软件,电影镜-100k,Ciaodvd和Movieweetings。该方法与各种现代相似度措施进行比较,结果显示了预测质量和准确性的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号