...
首页> 外文期刊>Fuzzy sets and systems >Mountain density-based fuzzy approach for discovering web usage clusters from web log data
【24h】

Mountain density-based fuzzy approach for discovering web usage clusters from web log data

机译:基于山密度的模糊方法,可从Web日志数据中发现Web使用集群

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Due to the continuous proliferation of e-businesses, there is intense competition among organizations to attract and retain customers. Analyses of the web server logs of these organizations are critical for obtaining insights into web usage behavior, which can support the design of more attractive web structures. In this study, we propose a mountain density function (MDF)-based fuzzy clustering framework for discovering user session clusters in web log data. The major steps in this framework include web log preprocessing, MDF-based discovery of fuzzy user session clusters, and validation of these clusters. To consider the high dimensionality of user session data, we propose a fuzzy approach for assigning weights to user sessions. Fuzzy c-means (FCM) and fuzzy c-medoids (FCMed) algorithms are used to cluster the user sessions. The selection of suitable initial cluster centers is a major challenge for these methods, so we propose MDF-based FCM (MDFCM) and FCMed (MDFCMed) algorithms to overcome this problem. MDF-based clustering is also used to estimate the number of clusters. Our results clearly indicate that the quality of the clusters formed using the proposed algorithms is much better in terms of various validity measures compared with the FCM and FCMed algorithms. (C) 2015 Elsevier B.V. All rights reserved.
机译:由于电子商务的不断增长,组织之间在吸引和保留客户方面存在激烈的竞争。这些组织的Web服务器日志的分析对于获得对Web使用行为的见解至关重要,这可以支持设计更具吸引力的Web结构。在这项研究中,我们提出了一种基于山区密度函数(MDF)的模糊聚类框架,用于发现Web日志数据中的用户会话聚类。该框架中的主要步骤包括Web日志预处理,基于MDF的模糊用户会话群集的发现以及这些群集的验证。为了考虑用户会话数据的高维性,我们提出了一种为用户会话分配权重的模糊方法。模糊c均值(FCM)和模糊c均值(FCMed)算法用于对用户会话进行聚类。选择合适的初始聚类中心是这些方法的主要挑战,因此我们提出了基于MDF的FCM(MDFC​​M)和FCMed(MDFC​​Med)算法来克服此问题。基于MDF的群集也可用于估计群集的数量。我们的结果清楚地表明,与FCM和FCMed算法相比,使用各种算法形成的聚类质量在各种有效性度量方面要好得多。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号