Efficient Computation of Personal Aggregate Queries on Blogs

机译：博客上个人综合查询的有效计算

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There is an exploding amount of user-generated content on the Web due to the emergence of "Web 2.0" services, such as Blogger, MyS-pace, Flickr, and del.icio.us. The participation of a large number of users in sharing their opinion on the Web has inspired researchers to build an effective "information filter" by aggregating these independent opinions. However, given the diverse groups of users on the Web nowadays, the global aggregation of the information may not be of much interest to different groups of users. In this paper, we explore the possibility of computing personalized aggregation over the opinions expressed on the Web based on a user's indication of trust over the information sources. The hope is that by employing such "personalized" aggregation, we can make the recommendation more likely to be interesting to the users. We address the challenging scalability issues by proposing an efficient method, that utilizes two core techniques: Non-Negative Matrix Factorization and Threshold Algorithm, to compute personalized aggregations when there are potentially millions of users and millions of sources within a system. We show that, through experiments on real-life dataset, our personalized aggregation approach indeed makes a significant difference in the items that are recommended and it reduces the query computational cost significantly, often more than 75%, while the result of personalized aggregation is kept accurate enough.

机译：由于出现了诸如Blogger，MyS-pace，Flickr和del.icio.us之类的“ Web 2.0”服务，因此用户在Web上生成的内容激增。大量用户参与在Web上共享他们的意见，这激发了研究人员通过汇总这些独立意见来构建有效的“信息过滤器”。但是，考虑到当今Web上的用户群体各不相同，信息的全球汇总对于不同的用户群体可能并没有太大的意义。在本文中，我们探讨了基于用户对信息源信任程度的指示，根据Web上表达的观点计算个性化聚合的可能性。希望是通过采用这种“个性化”聚合，我们可以使推荐对用户来说更有趣。我们通过提出一种有效的方法来解决具有挑战性的可伸缩性问题，该方法利用两种核心技术：非负矩阵分解和阈值算法，以在系统中可能有数百万个用户和数百万个源的情况下计算个性化聚合。我们显示，通过对真实数据集进行的实验，我们的个性化聚合方法确实在建议的项目上产生了显着差异，并且显着降低了查询计算成本（通常超过75％），同时保持了个性化聚合的结果足够准确。

著录项

来源
《ACMKDD International Conference on Knowledge Discovery and Data Mining;KDD 2008》|2008年|614-622|共9页
会议地点
作者
Ka Cheung Sia; Junghoo Cho; Yun Chi; Belle L. Tseng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息与知识传播;
关键词
persoanlized recommendation; aggregate queries; matrix factor-ization; web-mining;

机译：个性化推荐;汇总查询;矩阵分解网络挖掘;

相似文献

外文文献
中文文献
专利

1. Efficient Computation of Range Aggregates against Uncertain Location-Based Queries [J] . Zhang Ying Knowledge and Data Engineering, IEEE Transactions on . 2012,第7期

机译：针对不确定的基于位置的查询的范围聚合的有效计算
2. Cache-based Aggregate Query Shipping: An Efficient Scheme Of Distributed Olap Query Processing [J] . Hua, Ming Liao, Guo, Journal of Computer Science & Technology . 2008,第6期

机译：基于缓存的聚合查询传送：分布式Olap查询处理的有效方案
3. Cache-Based Aggregate Query Shipping: An Efficient Scheme of Distributed OLAP Query Processing [J] . Hua-Ming Liao, Guo-Shun Pei 计算机科学技术学报（英文版） . 2008,第006期

机译：基于缓存的聚合查询传送：分布式OLAP查询处理的有效方案
4. Efficient computation of personal aggregate queries on blogs [C] . Ka Cheung Sia, Junghoo Cho, Yun Chi, ACM SIGKDD international conference on Knowledge discovery and data mining . 2008

机译：博客上个人汇总查询的高效计算
5. An efficient index for computation of approximate nearest neighbors with query specified dimension relevance weights [D] . Katz, David 2015

机译：用于计算具有查询指定维相关权重的近似最近邻居的有效索引
6. What Women With Disabilities Write in Personal Blogs About Pregnancy and Early Motherhood: Qualitative Analysis of Blogs [O] . Michelle L Litchman, MJ Tran, Susan E Dearden, 2019

机译：残疾妇女在个人博客中写的有关怀孕和早孕的内容：博客的定性分析
7. Privacy-preserving computation and verification of aggregate queries on outsourced databases [O] . Brian Thompson, Stuart Haber, William G. Horne, 2009

机译：保护隐私的计算和对外包数据库的聚合查询的验证

Efficient Computation of Personal Aggregate Queries on Blogs

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅