首页> 外文会议>ACMKDD International Conference on Knowledge Discovery and Data Mining;KDD 2008 >Efficient Computation of Personal Aggregate Queries on Blogs
【24h】

Efficient Computation of Personal Aggregate Queries on Blogs

机译:博客上个人综合查询的有效计算

获取原文

摘要

There is an exploding amount of user-generated content on the Web due to the emergence of "Web 2.0" services, such as Blogger, MyS-pace, Flickr, and del.icio.us. The participation of a large number of users in sharing their opinion on the Web has inspired researchers to build an effective "information filter" by aggregating these independent opinions. However, given the diverse groups of users on the Web nowadays, the global aggregation of the information may not be of much interest to different groups of users. In this paper, we explore the possibility of computing personalized aggregation over the opinions expressed on the Web based on a user's indication of trust over the information sources. The hope is that by employing such "personalized" aggregation, we can make the recommendation more likely to be interesting to the users. We address the challenging scalability issues by proposing an efficient method, that utilizes two core techniques: Non-Negative Matrix Factorization and Threshold Algorithm, to compute personalized aggregations when there are potentially millions of users and millions of sources within a system. We show that, through experiments on real-life dataset, our personalized aggregation approach indeed makes a significant difference in the items that are recommended and it reduces the query computational cost significantly, often more than 75%, while the result of personalized aggregation is kept accurate enough.
机译:由于出现了诸如Blogger,MyS-pace,Flickr和del.icio.us之类的“ Web 2.0”服务,因此用户在Web上生成的内容激增。大量用户参与在Web上共享他们的意见,这激发了研究人员通过汇总这些独立意见来构建有效的“信息过滤器”。但是,考虑到当今Web上的用户群体各不相同,信息的全球汇总对于不同的用户群体可能并没有太大的意义。在本文中,我们探讨了基于用户对信息源信任程度的指示,根据Web上表达的观点计算个性化聚合的可能性。希望是通过采用这种“个性化”聚合,我们可以使推荐对用户来说更有趣。我们通过提出一种有效的方法来解决具有挑战性的可伸缩性问题,该方法利用两种核心技术:非负矩阵分解和阈值算法,以在系统中可能有数百万个用户和数百万个源的情况下计算个性化聚合。我们显示,通过对真实数据集进行的实验,我们的个性化聚合方法确实在建议的项目上产生了显着差异,并且显着降低了查询计算成本(通常超过75%),同时保持了个性化聚合的结果足够准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号