【24h】

A new system for discovering usage profiles

机译:用于发现使用情况配置文件的新系统

获取原文

摘要

The exponential growth of information available on the web, makes need to intelligence systems can give required information immediately, more than before. Web Usage Mining (WUM) is one of the web mining techniques, that extracts useful web usage patterns from user's accessed web pages. In web personalization based on WUM, a set of objects like text, product, link and etc. with respect to a user's interests and preferences, recommend to active user. The operation is done with adoption active session of user with discovered usage profiles. Clustering is one of the usage profile discovery methods. In the distance based clustering methods, the precision of distance measure is very important. To effectively provide sage profiles, we have proposed a system that process log files of web servers then extract user sessions and prepare them for clustering and discovering usage profiles. We have introduced a new hybrid distance measure that involved both syntactic similarities between URL's of session and time connectivity between all pairs of URL's in a session. Our experimental results on music machine dataset show that by using new distance measure in proposed system, cluster coherency and accuracy of the usage profiles are increased.
机译:Web上可用信息的呈指数增长,使得情报系统可以比以前更多地立即提供所需的信息。 Web用法挖掘(WUM)是一种Web挖掘技术,可从用户访问的网页中提取有用的Web使用模式。在基于WUM的Web个性化中,针对用户兴趣和偏好的一组对象(如文本,产品,链接等)推荐给活动用户。该操作通过具有发现的使用情况配置文件的用户的采用活动会话来完成。群集是使用情况配置文件发现方法之一。在基于距离的聚类方法中,距离测量的精度非常重要。为了有效地提供贤者配置文件,我们提出了一个系统,该系统处理Web服务器的日志文件,然后提取用户会话并为它们进行集群和发现使用情况配置文件做准备。我们引入了一种新的混合距离度量,该度量涉及会话URL之间的语法相似性以及会话中所有URL对之间的时间连接性。我们在音乐机器数据集上的实验结果表明,通过在提出的系统中使用新的距离度量,可以提高群集一致性和使用情况配置文件的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号