首页> 外文会议>International conference on very large databases >INSITE: A Tool for Real-Time Knowledge Discovery from Users Web Navigation
【24h】

INSITE: A Tool for Real-Time Knowledge Discovery from Users Web Navigation

机译:Insite:来自用户Web导航的实时知识发现的工具

获取原文

摘要

The major challenges in web mining are a) tracking the data accurately (as not everything is reported to the web server), b) real-time acquisition of the hug volume of data (435 Million visits to yahoo per day, 2-4 GB clickstream data per hour), c) real-time interpretation of the data without compromising the privacy of the user (order of seconds for personalization and targeting information), and d) visualization of the data to facilitate policy making. To address these challenges, we demonstrate an integrated software platform, called INSITE - a) to accurately track users interactions with a web space with minimum overhead and no voluntary user participaiton, b) to generate individual and aggregate user profiles in realtime (or off-line) through the use of a unique Connectivity Matrix Model (CM-model), c) to show the efficacy and scalability of the CM-model in capturing the essence of the users' participatory attributes in the context of the web, d) to visualize the result of clustering of users navigation paths in real time by leveraging on the CM-model, and e) to execute a suite of queries (including temporal ones) and prove the utility of the captured data in making meaningful decisions about user interaction with a web site.
机译:网站挖掘中的主要挑战是a)准确跟踪数据(不是一切都报告给Web服务器),b)实时获取拥抱数据量(4.35亿访问雅虎每天,2-4 GB单击“每小时”单点数据),C)实时解释数据而不影响用户的隐私(个性化和定位信息的秒数),而d)数据的可视化以促进策略制作。为了解决这些挑战,我们展示了一个集成的软件平台,称为Insite - a),以便准确地跟踪用户与最小开销和没有自愿用户夫人的网站空间的交互,并且没有自愿用户参照,b)来实时生成个人和聚合用户配置文件(或off-通过使用独特的连接矩阵模型(CM-Model),c)来显示CM-Model在捕获Web,d)上下文中的用户参与性属性的本质中的效力和可扩展性通过在CM-Model上利用实时来可视化用户导航路径的群集结果,e)执行一套查询(包括临时),并证明捕获数据的实用程序在对用户交互进行有意义的决策时网站。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号