首页> 外文学位 >A collaborative filtering approach to predict web pages of interest from navigation patterns of past users within an academic website.
【24h】

A collaborative filtering approach to predict web pages of interest from navigation patterns of past users within an academic website.

机译:一种协作过滤方法,可根据学术网站内过去用户的导航模式来预测感兴趣的网页。

获取原文
获取原文并翻译 | 示例

摘要

This dissertation is a simulation study of factors and techniques involved in designing hyperlink recommender systems that recommend to users, web pages that past users with similar navigation behaviors found interesting. The methodology involves identification of pertinent factors or techniques, and for each one, addresses the following questions: (a) room for improvement; (b) better approach, if any; and (c) performance characteristics of the technique in environments that hyperlink recommender systems operate in. The following four problems are addressed:; Web page classification. A new metric (PageRank x Inverse Links-to-Word count ratio) is proposed for classifying web pages as content or navigation, to help in the discovery of user navigation behaviors from web user access logs. Results of a small user study suggest that this metric leads to desirable results.; Data mining. A new apriori algorithm for mining association rules from large databases is proposed. The new algorithm addresses the problem of scaling of the classical apriori algorithm by eliminating an expensive join step, and applying the apriori property to every row of the database. In this study, association rules show the correlation relationships between user navigation behaviors and web pages they find interesting. The new algorithm has better space complexity than the classical one, and better time efficiency under some conditions and comparable time efficiency under other conditions.; Prediction models for user interests. We demonstrate that association rules that show the correlation relationships between user navigation patterns and web pages they find interesting can be transformed into collaborative filtering data. We investigate collaborative filtering prediction models based on two approaches for computating prediction scores: using simple averages and weighted averages. Our findings suggest that the weighted averages scheme more accurately computes predictions of user interests than the simple averages scheme does.; Clustering. Clustering techniques are frequently applied in the design of personalization systems. We studied the performance of the CLARANS clustering algorithm in high dimensional space in relation to the PAM and CLARA clustering algorithms. While CLARA had the best time performance, CLARANS resulted in clusters with the lowest intra-cluster dissimilarities, and so was most effective in this regard.
机译:本文是对设计超链接推荐系统的因素和技术的仿真研究,这些超链接推荐系统向用户推荐过去具有类似导航行为的用户感兴趣的网页。该方法论涉及确定相关因素或技术,并针对每个因素或技术解决以下问题:(a)改进的空间; (b)更好的办法,如果有的话; (c)该技术在超链接推荐系统运行的环境中的性能特征。解决了以下四个问题:网页分类。提出了一种新的度量标准(PageRank x反向链接与单词计数比率),用于将网页分类为内容或导航,以帮助从Web用户访问日志中发现用户导航行为。一项小型用户研究的结果表明,该指标可带来理想的结果。数据挖掘。提出了一种从大型数据库中挖掘关联规则的新先验算法。新算法通过消除昂贵的连接步骤,并将apriori属性应用于数据库的每一行,解决了经典apriori算法的伸缩问题。在这项研究中,关联规则显示了用户导航行为与他们感兴趣的网页之间的关联关系。与传统算法相比,新算法具有更好的空间复杂度,在某些情况下具有更高的时间效率,而在其他条件下则具有可比的时间效率。用户兴趣的预测模型。我们证明,显示用户导航模式和他们感兴趣的网页之间的关联关系的关联规则可以转换为协作过滤数据。我们研究基于两种用于计算预测分数的方法的协作过滤预测模型:使用简单平均值和加权平均值。我们的发现表明,加权平均方案比简单平均方案更准确地计算用户兴趣的预测。聚类。聚类技术经常用于个性化系统的设计中。我们针对PAM和CLARA聚类算法研究了CLARANS聚类算法在高维空间中的性能。尽管CLARA具有最佳的时间性能,但CLARANS导致集群内集群差异最小的集群,因此在这方面最有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号