首页> 外文会议>2011 International Symposium on Artificial Intelligence and Signal Processing >A webpage similarity measure for web sessions clustering using sequence alignment
【24h】

A webpage similarity measure for web sessions clustering using sequence alignment

机译:使用序列比对的Web会话聚类的网页相似性度量

获取原文

摘要

Web sessions clustering is a process of web usage mining task that aims to group web sessions with similar trends and usage patterns into clusters. This process is crucial for effective website management, web personalization and developing web recommender systems. Accurate clustering of web sessions is highly dependent to the similarity measure defined to compare web sessions. In this paper, we propose a similarity measure for comparing web sessions. The sequential order of web navigations in sessions is considered using sequence alignment method. Furthermore, we propose to consider the usage similarity of two web sessions based on the time a user spends on a webpage, and also the frequency of visit of each page within the session. The proposed method is validated by clustering a collection of web sessions using an agglomerative clustering technique and comparing the results with available methods. The experimental results show effectiveness of the proposed method to capture the properties of web session data.
机译:Web会话群集是Web使用情况挖掘任务的过程,旨在将具有类似趋势和使用模式的Web会话分组到群集中。此过程对于有效的网站管理,网站个性化和开发网站推荐系统至关重要。 Web会话的准确群集高度依赖于为比较Web会话而定义的相似性度量。在本文中,我们提出了一种用于比较Web会话的相似性度量。使用序列比对方法考虑会话中Web导航的顺序。此外,我们建议基于用户在网页上花费的时间以及会话中每个页面的访问频率来考虑两个Web会话的用法相似性。通过使用聚集聚类技术对Web会话的集合进行聚类并将结果与​​可用方法进行比较来验证所提出的方法。实验结果表明,该方法可以有效地捕获Web会话数据的属性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号