首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >Topic Analysis of Web User Behavior Using LDA Model on Proxy Logs
【24h】

Topic Analysis of Web User Behavior Using LDA Model on Proxy Logs

机译:使用LDA模型在代理日志上的Web用户行为主题分析

获取原文

摘要

We propose a web user profiling and clustering framework based on LDA-based topic modeling with an analogy to document analysis in which documents and words represent users and their actions. The main technical challenge addressed here is how to symbolize web access actions, by words, that are monitored through a web proxy. We develop a hierarchical URL dictionary generated from Yahoo! Directory and a cross-hierarchical matching method that provides the function of automatic abstraction. We apply the proposed framework to 7500 students in Osaka University. The results include, for example, 24 topics such as "Technology Oriented", "Job Hunting", and "SNS-addict." The results reflect the typical interest profiles of University students, while perplexity analysis is employed to confirm the optimality of the framework.
机译:我们提出了一种基于LDA的主题建模的Web用户分析和聚类框架,其模拟文件分析,其中文档和单词代表用户及其操作。这里解决的主要技术挑战是如何通过Web代理监视的单词符号化Web访问操作。我们开发了从Yahoo!生成的分层URL字典目录和交叉层次匹配方法,提供自动抽象的功能。我们将拟议的框架应用于大阪大学7500名学生。结果包括,例如,24个主题,如“技术为导向”,“求职”和“SNS成瘾者”。结果反映了大学生的典型利益概况,而困惑分析用于确认框架的最优性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号