首页> 外文会议> >Similarity-Based Fuzzy Clustering for User Profiling
【24h】

Similarity-Based Fuzzy Clustering for User Profiling

机译:基于相似度的用户聚类模糊聚类

获取原文

摘要

User profiling is a fundamental task in Web personalization. Fuzzy clustering is a valid approach to derive user profiles by capturing similar user interests from web usage data available in log files. Often, fuzzy clustering is based on the assumption that data lay on an Euclidean space; however, clustering based on Euclidean distance can lead the clustering process to find user representations that do not capture the semantic information incorporated in the original Web usage data. In this paper, we propose a different approach to express similarity between Web users. The measure is based on the evaluation of similarity between fuzzy sets. The proposed measure is employed in a relational fuzzy clustering algorithm to discover clusters embedded in the Web usage data and derive profiles modeling the real user preferences. An application example on usage data extracted from logfiles of a sample Web site is reported and a comparison with the results obtained using the cosine measure is shown to demonstrate the effectiveness of the proposed similarity measure.
机译:用户配置文件是Web个性化的一项基本任务。模糊聚类是通过从日志文件中可用的Web使用数据中捕获相似的用户兴趣来导出用户配置文件的有效方法。通常,模糊聚类基于数据位于欧几里得空间上的假设。但是,基于欧几里德距离的聚类可以导致聚类过程找到未捕获原始Web使用数据中包含的语义信息的用户表示。在本文中,我们提出了一种不同的方法来表达Web用户之间的相似性。该度量基于对模糊集之间相似性的评估。所提出的措施用于关系模糊聚类算法中,以发现嵌入在Web使用数据中的聚类,并导出对真实用户偏好进行建模的配置文件。报告了一个有关从示例网站的日志文件中提取的使用情况数据的应用示例,并显示了与使用余弦测度获得的结果的比较,以证明所提出的相似度测度的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号