Collaborative Filtering on Skewed Datasets

机译：偏斜数据集的协同过滤

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. This paper, we observed that in skewed datasets the state of the art collaborative filtering methods perform worse than a simple probabilistic model. Our test bench includes a real ad click stream dataset which is naturally skewed. The same conclusion obtained even from the popular movie rating dataset when we pose a binary prediction problem of whether a user will give maximum rating to a movie or not.

机译：当观察到少数事件的概率远远超过其他事件时，许多现实生活的数据集都会使事件的分布偏斜。在本文中，我们观察到，在偏斜的数据集中，最先进的协作过滤方法的性能比简单的概率模型差。我们的测试平台包括一个自然倾斜的真实广告点击流数据集。当我们提出一个关于用户是否将给电影最高评级的二元预测问题时，即使从受欢迎的电影评级数据集中也获得了相同的结论。

著录项

来源
《第十七届国际万维网大会（the 17th International World Wide Web Conference）（WWW08）论文集》|2008年||共2页
会议地点
作者
Somnath Banerjee; Krishnan Ramanathan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Collaborative filtering; skewed dataset; pLSA.;

机译：协同过滤偏斜的数据集; pLSA。;
入库时间 2022-08-26 14:16:14

相似文献

外文文献
中文文献
专利

1. Improving collaborative filtering's rating prediction coverage in sparse datasets by exploiting the 'friend of a friend' concept [J] . Dionisis Margaris, Costas Vassilakis International Journal of Big Data Intelligence . 2020,第1期

机译：通过利用“朋友的朋友”概念，改善稀疏数据集中的协同过滤的评定预测覆盖
2. IDENTIFYING USER AND GROUP INFORMATION FROM COLLABORATIVE FILTERING DATASETS [J] . JOSEPHINE GRIFFITH, COLM ORIORDAN, HUMPHREY SORENSEN International Journal of Pattern Recognition and Artificial Intelligence . 2007,第2期

机译：从协作过滤数据集中识别用户和组信息
3. Conglomeration of Instance Filtering’s k- Nearest Neighborhood and Collaborative Filtering’s Item based Recommendation on Airline Dataset System using Map-Reduce and Mahout [J] . Mrs. D.N.V.S.L.S.Indira, Mr. Dr. R. Kiran Kumar International Journal on Computer Science and Engineering . 2016,第6期

机译：基于Map-Reduce和Mahout的航空数据集系统实例过滤k-最近邻居和基于协同过滤项的建议的合并
4. Federated CF: Privacy-Preserving Collaborative Filtering Cross Multiple Datasets [C] . Le Wang, Zijun Huang, Qingqi Pei, IEEE International Conference on Communications . 2020

机译：联合CF：跨多个数据集的保护隐私的协作过滤
5. Modeling and local filtering of noise embedded in genome-scale microarray datasets. [D] . Fathallah-Shaykh, Hassan M. 2007

机译：对基因组规模微阵列数据集中嵌入的噪声进行建模和局部过滤。
6. An improved filtering algorithm for big read datasets and its application to single-cell assembly [O] . Axel Wedemeyer, Lasse Kliemann, Anand Srivastav, 2017

机译：改进的大读取数据集过滤算法及其在单细胞装配中的应用
7. Collaborative Filtering on Skewed Datasets [O] . Somnath Banerjee 2014

机译：偏斜数据集的协同过滤

Collaborative Filtering on Skewed Datasets

摘要

著录项

相似文献

相关主题

期刊订阅